Quick Links

nodeAgg perf tweak

From:	Neil Conway <neilc(at)samurai(dot)com>
To:	pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	nodeAgg perf tweak
Date:	2004-12-01 02:48:31
Message-ID:	1101869311.22124.100.camel@localhost.localdomain
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

I noticed that we have a bottleneck in aggregate performance in
advance_transition_function(): according to callgrind, doing datumCopy()
and pfree() for every tuple produced by the transition function is
pretty expensive. Some queries bare this out:

dvl=# SELECT W.element_id, count(W.element_ID) FROM watch_list_element W
GROUP by W.element_id ORDER by count(W.element_ID) LIMIT 5;
element_id | count
------------+-------
65525 | 1
163816 | 1
16341 | 1
131023 | 1
65469 | 1
(5 rows)

Time: 176.723 ms

dvl=# select count(*) from watch_list_element;
count
--------
138044
(1 row)

Time: 94.246 ms

I've attached a quick and dirty hack that avoids the need to palloc()
and pfree() for every tuple produced by the aggregate's transition
function. This results in:

dvl=# select count(*) from watch_list_element;
count
--------
138044
(1 row)

Time: 73.975 ms

I can reproduce this performance difference consistently. I thought this
might have been attributable to memory checking overhead because
assertions were enabled, but that doesn't appear to be the case (the
above results are without --enable-cassert).

The attached patch invokes the transition function in the current memory
context. I don't think that's right: a memory leak in an aggregate's
transition function would be problematic when we're invoked from a
per-query memory context, as is the case with advance_aggregates().
Perhaps we need an additional short-lived memory context in
AggStatePerAggData: we could invoke the transition function in that
context, and reset it once per, say, 1000 tuples.

Alternatively we could just mandate that aggregate transition function's
not leak memory and then invoke the transition function in, say, the
aggregate's memory context, but that seems a little fragile.

Comments?

-Neil

Attachment	Content-Type	Size
node_agg_perf-1.patch	text/x-patch	989 bytes

Responses

Re: nodeAgg perf tweak at 2004-12-01 04:15:34 from Tom Lane
Re: nodeAgg perf tweak at 2004-12-01 08:25:26 from Simon Riggs
Re: nodeAgg perf tweak at 2004-12-01 20:54:11 from Tom Lane

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Bruce Momjian	2004-12-01 03:24:14	Re: Increasing the length of
Previous Message	Woodchuck Bill	2004-12-01 02:39:50	Re: USENET vs Mailing Lists Poll ...