Quick Links

Re: explain HashAggregate to report bucket and memory stats

From:	Jeff Davis <pgsql(at)j-davis(dot)com>
To:	Justin Pryzby <pryzby(at)telsasoft(dot)com>, Andres Freund <andres(at)anarazel(dot)de>
Cc:	Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com>, pgsql-hackers(at)lists(dot)postgresql(dot)org, Jeff Janes <jeff(dot)janes(at)gmail(dot)com>
Subject:	Re: explain HashAggregate to report bucket and memory stats
Date:	2020-03-13 17:15:46
Message-ID:	f95260dc8cd3b5a4de85d9f89b5bae19dfbc4c13.camel@j-davis.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

+ /* hashtable->entrysize includes additionalsize */
+ hashtable->instrument.space_peak_hash = Max(
+ hashtable->instrument.space_peak_hash,
+ hashtable->hashtab->size *
sizeof(TupleHashEntryData));
+
+ hashtable->instrument.space_peak_tuples = Max(
+ hashtable->instrument.space_peak_tuples,
+ hashtable->hashtab->members *
hashtable->entrysize);

I think, in general, we should avoid estimates/projections for
reporting and try to get at a real number, like
MemoryContextMemAllocated(). (Aside: I may want to tweak exactly what
that function reports so that it doesn't count the unused portion of
the last block.)

For instance, the report is still not accurate, because it doesn't
account for pass-by-ref transition state values.

To use memory-context-based reporting, it's hard to make the stats a
part of the tuple hash table, because the tuple hash table doesn't own
the memory contexts (they are passed in). It's also hard to make it
per-hashtable (e.g. for grouping sets), unless we put each grouping set
in its own memory context.

Also, is there a reason you report two different memory values
(hashtable and tuples)? I don't object, but it seems like a little too
much detail.

Regards,
Jeff Davis

In response to

Re: explain HashAggregate to report bucket and memory stats at 2020-03-06 21:33:10 from Justin Pryzby

Responses

Re: explain HashAggregate to report bucket and memory stats at 2020-03-13 17:27:07 from Andres Freund

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	James Coleman	2020-03-13 17:16:44	Re: [PATCH] Incremental sort (was: PoC: Partial sort)
Previous Message	Andres Freund	2020-03-13 17:09:00	Re: shared-memory based stats collector