Quick Links

Re: Improving NOT IN

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	"Simon Riggs" <simon(at)2ndquadrant(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Improving NOT IN
Date:	2007-01-30 22:34:25
Message-ID:	12336.1170196465@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

"Simon Riggs" <simon(at)2ndquadrant(dot)com> writes:
> First we need to show that the referenced table's PK values are a fully
> continuous sequence of integers with no gaps.

Since that is unlikely to be the case, I can't see that this is worth
implementing...

> I'll describe this using SQL statements, which execute as SeqScans of
> the PK and then FK tables. There is no preparatory step - no building a
> sort table or preparing a hash table, so these SQL statements always
> execute faster than the fastest current plan.

Except that when you fail to prove it, as you usually will, you have
wasted a complete seqscan of the table, and still have to fall back on
a regular plan. If the thing were amenable to falling out fairly
quickly on proof failure, it would be better, but AFAICS you don't know
anything until you've completed the whole scan.

I think the NOT IN optimization that *would* be of use is to
automatically transform the NOT IN representation to an
outer-join-with-null-test type of operation, so as to give us a wider
choice of join methods. However, I'm not sure about correct handling
of NULLs on the RHS in such a scenario. The existing hashed-IN code
has to jump through some really ugly hoops to give spec-compliant
answers with NULLs.

BTW, your sketch fails in the presence of NULLs on the RHS ...

regards, tom lane

In response to

Improving NOT IN at 2007-01-30 22:03:12 from Simon Riggs

Responses

Re: Improving NOT IN at 2007-01-30 22:55:29 from Simon Riggs
Re: Improving NOT IN at 2007-02-01 11:49:55 from Simon Riggs

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	David Fetter	2007-01-30 22:41:12	Re: Talks for OSCON? Only 5 days left!
Previous Message	Peter Eisentraut	2007-01-30 22:26:04	parsenodes vs. primnodes