From: | Jeff Janes <jeff(dot)janes(at)gmail(dot)com> |
---|---|
To: | Simon Riggs <simon(at)2ndquadrant(dot)com> |
Cc: | Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>, Stephen Frost <sfrost(at)snowman(dot)net>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Bruce Momjian <bruce(at)momjian(dot)us>, Atri Sharma <atri(dot)jiit(at)gmail(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: A better way than tweaking NTUP_PER_BUCKET |
Date: | 2014-01-28 16:53:37 |
Message-ID: | CAMkU=1y_qp+QUPGk=JBJSTtcYQpW2k=v2LMyTZkO_8ftuuy_fw@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On Mon, Jan 27, 2014 at 10:00 AM, Simon Riggs <simon(at)2ndquadrant(dot)com> wrote:
> On 27 January 2014 17:44, Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com> wrote:
>
> > This topic is interesting - we found very bad performance with hashing
> large
> > tables with high work_mem. MergeJoin with quicksort was significantly
> > faster.
>
> I've seen this also.
>
> > I didn't deeper research - there is a possibility of virtualization
> > overhead.
>
> I took measurements and the effect was repeatable and happened for all
> sizes of work_mem, but nothing more to add.
>
I get similar results if I join on integers. But joining on text, the hash
wins by a mile.
I use this as a simple test bed:
alter table pgbench_accounts drop CONSTRAINT pgbench_accounts_pkey;
update pgbench_accounts set filler = md5(aid::text);
set work_mem to whatever keeps the join off of disk for the given scale;
set enable_hashjoin to whatever;
select sum(a1.abalance*a2.abalance) from pgbench_accounts a1 join
pgbench_accounts a2 using (aid);
select sum(a1.abalance*a2.abalance) from pgbench_accounts a1 join
pgbench_accounts a2 using (filler);
hash integer: 1832.695 ms
merge integer: 1462.913 ms
hash text: 2353.115 ms
merge text: 11,218.628 ms
The cost estimates do not depend on the column used in the join despite a 6
fold difference in run time, so the planner is perhaps missing a trick
there.
Cheers,
Jeff
From | Date | Subject | |
---|---|---|---|
Next Message | Robert Haas | 2014-01-28 16:53:41 | Re: Changeset Extraction v7.3 |
Previous Message | Thom Brown | 2014-01-28 16:49:17 | Re: Changeset Extraction v7.3 |