Quick Links

Re: Hash Join node sometimes slow

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Dave Roberge <droberge(at)bluetarp(dot)com>
Cc:	pgsql-performance(at)postgresql(dot)org
Subject:	Re: Hash Join node sometimes slow
Date:	2014-07-02 14:11:19
Message-ID:	1558.1404310279@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

Dave Roberge <droberge(at)bluetarp(dot)com> writes:
> For example, running explain (analyze, buffers) with the query, 4/5 times we
> will see the following:

> -> Hash Join (cost=16385.76..103974.09 rows=523954 width=64) (actual
> time=532.634..4018.678 rows=258648 loops=1)
> Hash Cond: (p.a = c.c)
> Buffers: shared hit=4 read=29147, temp read=12943 written=12923
> -> Seq Scan on p (cost=0.00..38496.88 rows=1503188 width=60) (actual
> time=0.013..1388.205 rows=1503188 loops=1)
> Buffers: shared hit=1 read=23464
> -> Hash (cost=15382.47..15382.47 rows=57703 width=12) (actual
> time=527.237..527.237 rows=57789 loops=1)
> Buckets: 4096 Batches: 4 Memory Usage: 632kB
> Buffers: shared hit=3 read=5683, temp read=617 written=771

> The other times, we will see something like this:

> -> Hash Join (cost=16385.76..103974.09 rows=523954 width=64) (actual
> time=587.277..15208.621 rows=258648 loops=1)
> Hash Cond: (p.a = c.c)
> Buffers: shared hit=26 read=29125, temp read=12943 written=12923
> -> Seq Scan on p (cost=0.00..38496.88 rows=1503188 width=60) (actual
> time=0.013..1525.608 rows=1503188 loops=1)
> Buffers: shared hit=22 read=23443
> -> Hash (cost=15382.47..15382.47 rows=57703 width=12) (actual
> time=581.638..581.638 rows=57789 loops=1)
> Buckets: 4096 Batches: 4 Memory Usage: 632kB
> Buffers: shared hit=4 read=5682, temp read=617 written=771

> Does anyone have ideas on what might be causing the difference in timing for
> the hash join node?

I'd bet on the extra time being in I/O for the per-batch temp files,
since it's hard to see what else would be different if the data were
identical in each run. Maybe the kernel is under memory pressure and
is dropping the file data from in-memory disk cache. Or maybe it's
going to disk all the time but the slow runs face more I/O congestion.

Personally, for a problem of this size I'd increase work_mem enough
so you don't get multiple batches in the first place.

regards, tom lane

In response to

Hash Join node sometimes slow at 2014-07-02 13:01:47 from Dave Roberge

Responses

Re: Hash Join node sometimes slow at 2014-07-03 15:14:13 from Dave Roberge

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Kevin Grittner	2014-07-02 18:01:29	Re: fragmention issue with ext4: e4defrag?
Previous Message	Dave Roberge	2014-07-02 13:01:47	Hash Join node sometimes slow