select distinct in a subquery bug/problem

From: "Dan Halbert" <halbert(at)halwitz(dot)org>
To: pgsql-general(at)postgresql(dot)org
Subject: select distinct in a subquery bug/problem
Date: 2012-08-11 03:32:35
Message-ID: 1344655955.224114970@beta.apps.rackspace.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general


In version 9.1.4-0ubuntu12.04:

Hi - I am getting wrong answers from a certain kind of query, and have narrowed it down to a change in the query plan between two similar queries. The two queries below use different query plans, and generate different results, one of which is completely wrong.

1. select count(t1_id) from t1 where t1_id not in (select distinct t1_id from t2 limit 1103) ==> 13357 [CORRECT result]
2. select count(t1_id) from t1 where t1_id not in (select distinct t1_id from t2 limit 1104) ==> 0 [WRONG result; should be close to 13357]

(See the query plans at end of this message.)

Obviously, the exact numbers are dependent on the table sizes and the data. In this case, t1 has about 14k rows, and t2 has about 210k rows. t1_id in t2 is a foreign key to t1.

Originally I was not using the LIMIT above, and was getting 0 from the query, so I tried some limits to figure out if it made a difference, and then noticed it was due to the query plan difference.

I've tried to reproduce this bug using test tables with at least as many rows. I can generate both query plans, but the query results are correct for both.

So there seems to be something odd or corrupted about my particular table t2, or something about it is exercising a bug. T2 is actually quite a large table with 20 columns, 9 foreign keys, and some more indexed columns. T2 has had various columns renamed, deleted and added over its lifetime.

I've tried VACUUM FULL, REINDEX, and I've tried dropping and recreating the foreign key on t2. Nothing has fixed the problem. I have not yet dumped and reloaded the whole database.

Any suggestions or insight on this? I'm pretty disturbed at getting wrong answers. There was some discussion in the past on this list about the efficiency of "NOT IN (SELECT DISTINCT ...)", but I haven't yet found any bug reports about incorrect results. I'm sorry I haven't been able to create reproducible test case yet.

Thanks,
Dan

------------------------------------

1. Query plan:
Aggregate (cost=6621.84..6621.85 rows=1 width=4)
-> Seq Scan on t1 (cost=6346.02..6603.77 rows=7230 width=4)
Filter: (NOT (hashed SubPlan 1))
SubPlan 1
-> Limit (cost=0.00..6343.26 rows=1103 width=4)
-> Unique (cost=0.00..19926.92 rows=3465 width=4)
-> Index Scan using fki_t2_t1_id_fkey on t2 (cost=0.00..19402.46 rows=209786 width=4)

2. Query plan:
Aggregate (cost=6625.95..6625.96 rows=1 width=4)
-> Seq Scan on t1 (cost=6350.13..6607.88 rows=7230 width=4)
Filter: (NOT (hashed SubPlan 1))
SubPlan 1
-> Limit (cost=6336.33..6347.37 rows=1104 width=4)
-> HashAggregate (cost=6336.33..6370.98 rows=3465 width=4)
-> Seq Scan on t2 (cost=0.00..5811.86 rows=209786 width=4)

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Tom Lane 2012-08-11 05:52:38 Re: select distinct in a subquery bug/problem
Previous Message John R Pierce 2012-08-11 01:27:11 Re: partitioning