From: | Andres Freund <andres(at)anarazel(dot)de> |
---|---|
To: | Felix Geisendörfer <felix(at)felixge(dot)de> |
Cc: | pgsql-performance(at)postgresql(dot)org |
Subject: | Re: Extremely slow HashAggregate in simple UNION query |
Date: | 2019-08-20 17:32:19 |
Message-ID: | 20190820173219.54dp463d7pkgg6un@alap3.anarazel.de |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-performance |
Hi,
On 2019-08-20 17:11:58 +0200, Felix Geisendörfer wrote:
> today I debugged a query that was executing about 100x slower than expected, and was very surprised by what I found.
>
> I'm posting to this list to see if this might be an issue that should be fixed in PostgreSQL itself.
>
> Below is a simplified version of the query in question:
>
> SET work_mem='64MB';
> EXPLAIN ANALYZE
> SELECT * FROM generate_series(1, 1) a, generate_series(1, 1) b
> UNION
> SELECT * FROM generate_series(1, 1) a, generate_series(1, 1) b;
>
> HashAggregate (cost=80020.01..100020.01 rows=2000000 width=8) (actual time=19.349..23.123 rows=1 loops=1)
FWIW, that's not a mis-estimate I'm getting on master ;). Obviously
that doesn't actually address your concern...
> 1. The query overestimates the final output rows by a factor of 2 million. [1]
Right. There's not really that much we can do about that in
general. That'll always be possible. Although we can obviously improve
the estimates a good bit more.
> I'm certainly a novice when it comes to PostgreSQL internals, but I'm
> wondering if this could be fixed by taking a more dynamic approach for
> allocating HashAggregate hash tables?
Under-sizing the hashtable just out of caution will have add overhead to
a lot more common cases. That requires copying data around during
growth, which is far far from free. Or you can use hashtables that don't
need to copy, but they're also considerably slower in the more common
cases.
> 3. Somehow EXPLAIN gets confused by this and only ends up tracking 23ms of the query execution instead of 45ms [5].
Well, there's plenty work that's not attributed to nodes. IIRC we don't
track executor startup/shutdown overhead on a per-node basis. So I don't
really think this is necessarily something that suspicious. Which
indeed seems to be what's happening here (this is with 11, to be able to
hit the problem with your reproducer):
+ 33.01% postgres postgres [.] tuplehash_iterate
- 18.39% postgres libc-2.28.so [.] __memset_avx2_erms
- 90.94% page_fault
__memset_avx2_erms
tuplehash_allocate
tuplehash_create
BuildTupleHashTableExt
build_hash_table
ExecInitAgg
ExecInitNode
InitPlan
standard_ExecutorStart
Greetings,
Andres Freund
From | Date | Subject | |
---|---|---|---|
Next Message | Felix Geisendörfer | 2019-08-20 17:55:56 | Re: Extremely slow HashAggregate in simple UNION query |
Previous Message | Pavel Stehule | 2019-08-20 16:57:03 | Re: Extremely slow HashAggregate in simple UNION query |