Re: Extremely slow HashAggregate in simple UNION query - Mailing list pgsql-performance

From Jeff Janes
Subject Re: Extremely slow HashAggregate in simple UNION query
Date
Msg-id CAMkU=1yiC23_F5OvQfh2qN2OMTx-95njbz2uskPB01dt2d_BNA@mail.gmail.com
Whole thread Raw
In response to Re: Extremely slow HashAggregate in simple UNION query  (Pavel Stehule <pavel.stehule@gmail.com>)
Responses Re: Extremely slow HashAggregate in simple UNION query  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-performance
On Thu, Aug 22, 2019 at 1:09 AM Pavel Stehule <pavel.stehule@gmail.com> wrote:
čt 22. 8. 2019 v 3:11 odesílatel Jeff Janes <jeff.janes@gmail.com> napsal:
...
 
But the same advance in v12 which makes it harder to fool with your test case also opens the possibility of fixing your real case.

I think so much more interesting should be long time after query processing - last row was processed in 13ms, but Execution Time was 69ms .. so some cleaning is 56ms - that is pretty long.

Most of the time is not after the clock stops, but before the stepwise ANALYZE clock starts.  If you just do an EXPLAIN rather than EXPLAIN ANALYZE, that is also slow.  The giant hash table is created during the planning step (or somewhere around there--I notice that EXPLAIN ANALYZE output doesn't count it in what it labels as the planning step--but it is some step that EXPLAIN without ANALYZE does execute, which to me makes it a planning step).

For me, "perf top" shows kernel's __do_page_fault as the top function.  tuplehash_iterate does show up at 20% (which I think is overattributed, considering how little the speedup is when dropping ANALYZE), but everything else just looks like kernel memory management code.

Cheers,

Jeff

pgsql-performance by date:

Previous
From: Gunther
Date:
Subject: Re: Out of Memory errors are frustrating as heck!
Next
From: Tom Lane
Date:
Subject: Re: Extremely slow HashAggregate in simple UNION query