From: | Thomas Kellerer <spam_eater(at)gmx(dot)net> |
---|---|
To: | pgsql-general(at)lists(dot)postgresql(dot)org |
Subject: | Re: Plan output: actual execution time not considering loops? |
Date: | 2018-06-20 14:34:42 |
Message-ID: | 9e141f7e-74a6-7311-2630-3ea75a72c800@gmx.net |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
Tom Lane schrieb am 20.06.2018 um 16:03:
>> Consider the following execution plan:
>> ...
>> -> Aggregate (cost=26.87..26.87 rows=1 width=32) (actual time=0.012..0.012 rows=1 loops=700000)
>> -> Bitmap Heap Scan on orders o2 (cost=3.45..26.85 rows=8 width=8) (actual time=0.004..0.008 rows=8 loops=700000)
>> -> Bitmap Index Scan on orders_customer_id_order_date_idx (cost=0.00..3.45 rows=8 width=0) (actual time=0.003..0.003 rows=8 loops=700000)
>
>> My expectation would have been that the "Aggregate" step shows the actual time as a product of the number of loops.
>
> No, that looks fine to me. The rule of thumb for reading this is total
> time spent in/below this node is "actual time" times "number of loops".
OK, if that is the rule I can live with that ;)
> It seems a bit odd that the Agg node would account for a third of the
> total execution time when it's only processing 8 rows on average ...
> but maybe it's a really expensive aggregate.
But it's processing those 8 rows 700.000 times - so the total time seems correct.
FWIW, the query looks like this:
select customer_id,
amount,
sales_person_id
from orders o1
where amount = (select max(o2.amount)
from orders o2
where o2.customer_id = o1.customer_id);
It's not a real world query - it's just there to illustrate the drawbacks of co-related sub-queries.
> Another thought is that the EXPLAIN ANALYZE instrumentation itself
> can account for significant per-node-invocation overhead. If the
> total execution time drops significantly when you add "timing off"
> to the EXPLAIN options, then that's probably a factor in making
> the Agg node look relatively expensive.
"timing off" doesn't really change the execution time (it's about 60ms faster without)
Thanks for the answer, I am not really concerned about the query performance itself, just about the plan ;)
Thomas
From | Date | Subject | |
---|---|---|---|
Next Message | Tom Lane | 2018-06-20 14:49:04 | Re: tsvector field length limitation |
Previous Message | Jonathan Marks | 2018-06-20 14:24:13 | tsvector field length limitation |