| From: | Thomas Kellerer <spam_eater(at)gmx(dot)net> | 
|---|---|
| To: | pgsql-general(at)lists(dot)postgresql(dot)org | 
| Subject: | Re: Plan output: actual execution time not considering loops? | 
| Date: | 2018-06-20 14:34:42 | 
| Message-ID: | 9e141f7e-74a6-7311-2630-3ea75a72c800@gmx.net | 
| Views: | Whole Thread | Raw Message | Download mbox | Resend email | 
| Thread: | |
| Lists: | pgsql-general | 
Tom Lane schrieb am 20.06.2018 um 16:03:
>> Consider the following execution plan:
>> ...
>>     ->  Aggregate  (cost=26.87..26.87 rows=1 width=32) (actual time=0.012..0.012 rows=1 loops=700000)
>>           ->  Bitmap Heap Scan on orders o2  (cost=3.45..26.85 rows=8 width=8) (actual time=0.004..0.008 rows=8 loops=700000)
>>                 ->  Bitmap Index Scan on orders_customer_id_order_date_idx  (cost=0.00..3.45 rows=8 width=0) (actual time=0.003..0.003 rows=8 loops=700000)
> 
>> My expectation would have been that the "Aggregate" step shows the actual time as a product of the number of loops.
> 
> No, that looks fine to me.  The rule of thumb for reading this is total
> time spent in/below this node is "actual time" times "number of loops".
OK, if that is the rule I can live with that ;)
> It seems a bit odd that the Agg node would account for a third of the
> total execution time when it's only processing 8 rows on average ...
> but maybe it's a really expensive aggregate.
But it's processing those 8 rows 700.000 times - so the total time seems correct.
FWIW, the query looks like this:
    select customer_id, 
           amount, 
           sales_person_id
    from orders o1
    where amount = (select max(o2.amount)
                    from orders o2
                    where o2.customer_id = o1.customer_id);
It's not a real world query - it's just there to illustrate the drawbacks of co-related sub-queries.
 
> Another thought is that the EXPLAIN ANALYZE instrumentation itself
> can account for significant per-node-invocation overhead.  If the
> total execution time drops significantly when you add "timing off"
> to the EXPLAIN options, then that's probably a factor in making
> the Agg node look relatively expensive.
"timing off" doesn't really change the execution time (it's about 60ms faster without)
Thanks for the answer, I am not really concerned about the query performance itself, just about the plan ;)
Thomas
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Tom Lane | 2018-06-20 14:49:04 | Re: tsvector field length limitation | 
| Previous Message | Jonathan Marks | 2018-06-20 14:24:13 | tsvector field length limitation |