Quick Links

Re: tweaking NTUP_PER_BUCKET

From:	Tomas Vondra <tv(at)fuzzy(dot)cz>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: tweaking NTUP_PER_BUCKET
Date:	2014-07-08 21:16:50
Message-ID:	53BC5FC2.6020806@fuzzy.cz
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Hi,

Thinking about this a bit more, do we really need to build the hash
table on the first pass? Why not to do this:

(1) batching
- read the tuples, stuff them into a simple list
- don't build the hash table yet

(2) building the hash table
- we have all the tuples in a simple list, batching is done
- we know exact row count, can size the table properly
- build the table

Also, maybe we could use a regular linear hash table [1], instead of
using the current implementation with NTUP_PER_BUCKET=1. (Although,
that'd be absolutely awful with duplicates.)

regards
Tomas

[1] http://en.wikipedia.org/wiki/Linear_probing

In response to

Re: tweaking NTUP_PER_BUCKET at 2014-07-08 18:28:41 from Tomas Vondra

Responses

Re: tweaking NTUP_PER_BUCKET at 2014-07-09 14:07:45 from Robert Haas

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2014-07-08 21:27:22	Re: Allowing join removals for more join types
Previous Message	Tomas Vondra	2014-07-08 21:04:22	Re: tweaking NTUP_PER_BUCKET