Quick Links

hash join performance question

From:	armand pirvu <armand(dot)pirvu(at)gmail(dot)com>
To:	"pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org>
Subject:	hash join performance question
Date:	2017-07-19 03:05:24
Message-ID:	8771FBF5-4637-404E-8BF0-6D1EE3525F9D@gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

testdb3=# select count(*) from csischema.dim_company;
count
---------
1786376
(1 row)

testdb3=# select count(*) from woc.dim_company;
count
-------
18980
(1 row)

woc.dim_company is a subset of csischema.dim_company meaning all company_id from woc.dim_company are in csischema.dim_company
Ratio is around 1%

SELECT a.company_id FROM csischema.dim_company a, woc.dim_company b
WHERE a.company_id = b.company_id;

testdb3=# explain analyze SELECT a.company_id FROM csischema.dim_company a, woc.dim_company b
testdb3-# WHERE a.company_id = b.company_id;
QUERY PLAN
-------------------------------------------------------------------------------------------------------------------------------
Hash Join (cost=711.05..54938.35 rows=18980 width=4) (actual time=34.067..1118.603 rows=18980 loops=1)
Hash Cond: (a.company_id = b.company_id)
-> Seq Scan on dim_company a (cost=0.00..47097.82 rows=1850582 width=4) (actual time=0.013..523.249 rows=1786376 loops=1)
-> Hash (cost=473.80..473.80 rows=18980 width=4) (actual time=20.203..20.203 rows=18980 loops=1)
Buckets: 32768 Batches: 1 Memory Usage: 924kB
-> Seq Scan on dim_company b (cost=0.00..473.80 rows=18980 width=4) (actual time=0.007..10.076 rows=18980 loops=1)
Planning time: 0.511 ms
Execution time: 1121.068 ms
(8 rows)

I was expecting at least the PK of csischema.dim_company to be used . In another DBMS that was the case. The larger table , csischema.dim_company used the PK.

Any hints, thoughts what am I not seing ?

Thank you
Armand

Responses

Re: hash join performance question at 2017-07-19 03:30:30 from Tom Lane

Browse pgsql-general by date

	From	Date	Subject
Next Message	Tom Lane	2017-07-19 03:30:30	Re: hash join performance question
Previous Message	David G. Johnston	2017-07-19 02:30:15	Re: PG 9.1 - FK + Check constraint