Re: optimizing a query

From: "David G(dot) Johnston" <david(dot)g(dot)johnston(at)gmail(dot)com>
To: Adrian Klaver <adrian(dot)klaver(at)aklaver(dot)com>
Cc: Jonathan Vanasco <postgres(at)2xlp(dot)com>, PostgreSQL mailing lists <pgsql-general(at)postgresql(dot)org>
Subject: Re: optimizing a query
Date: 2016-06-21 22:55:51
Message-ID: CAKFQuwYyNvP3UZvqnGq4axxqHsvReRUbCrNfG3NVuvGcx8hT-Q@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Tue, Jun 21, 2016 at 6:44 PM, Adrian Klaver <adrian(dot)klaver(at)aklaver(dot)com>
wrote:

> On 06/21/2016 03:33 PM, Jonathan Vanasco wrote:
>
>>
>>
>> In effort of simplifying the work, I've created indexes on t_a that have
>> all the related columns.
>>
>> CREATE INDEX test_idx ON t_a(col_1, id) WHERE col_2 IS NOT FALSE;
>> CREATE INDEX test_idx__a ON t_a(col_1, id) WHERE col_2 IS NOT
>> FALSE;
>>
>
​Aside from the name these indexes are identical...​

>> postgres will query test_idx__a first (yay!) but then does a bitmap heap
>> scan on t_a, and uses the raw t_a for the hash join.
>>
>> I don't actually need any information from t_a - it's just there for the
>> filtering, and ideally postgres would just use the index.
>>
>
​This is the description of a semi-join.

WHERE EXISTS (SELECT 1 FROM t_a WHERE t_a.id = ​t_a2b.a_id AND t_a.col_1 =
730 AND t_a.col_2 IS NOT FALSE)

>> I thought this might have been from using a partial index, but the same
>> results happen with a full index. I just can't seem to avoid this hash
>> join against the full table.
>>
>> anyone have a suggestion?
>>
>>
> The below works without including t_a in the FROM?
>
>
>> example query
>>
>> SELECT t_a2b.b_id AS b_id,
>> count(t_a2b.b_id) AS counted
>> FROM t_a2b
>> WHERE
>> t_a2b.col_a = 1
>> AND
>> t_a.col_1 = 730
>> AND
>> t_a.col_2 IS NOT False
>> GROUP BY t_a2b.b_id
>> ORDER BY counted DESC,
>> t_a2b.b_id ASC
>>
>>
​These two items combined reduce the desirability of diagnosing this...it
doesn't seem like you've faithfully recreated the scenario for us to
evaluate.

Your post is also not self-contained and you haven't provided the actual
EXPLAINs you are getting.

David J.

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Tim Smith 2016-06-21 23:03:55 Re: Help needed structuring Postgresql correlation query
Previous Message Adrian Klaver 2016-06-21 22:44:19 Re: optimizing a query