Quick Links

Re: Planner estimates cost of 'like' a lot lower than '='??

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Mats Lofkvist <mal(at)algonet(dot)se>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: Planner estimates cost of 'like' a lot lower than '='??
Date:	2001-07-22 19:43:52
Message-ID:	2485.995831032@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

Actually, now that I look more closely, I bet that the real failure in
this example is not in estimation of the find0 scan, but in estimation
of the find1 scan. Notice that the plan switches from using
datavalueindex for find1 (ie, it's keying off "find1.value like
'test_0'", which means that the indexscan limits are 'test' to 'tesu')
to using dataindex (since this is an inner indexscan, values are
available for all three of key0, key1, key2). Since dataindex is a
unique index, that means only one row will be fetched from the index,
as opposed to however many are selected by "where find1.value >= 'test'
AND find1.value < 'tesu'".

By eyeball, it seems obvious that the unique-index lookup should be
preferred. I am not sure why the planner is selecting the other
instead, but it probably points to bogus estimation of the LIKE range
selectivity. What do you get from both EXPLAIN and actual execution
of

select count(*) from data where value like 'test_0';

select count(*) from data where value >= 'test' and value < 'tesu';

regards, tom lane

In response to

Re: Planner estimates cost of 'like' a lot lower than '='?? at 2001-07-22 18:36:43 from Mats Lofkvist

Responses

Re: Planner estimates cost of 'like' a lot lower than '='?? at 2001-07-22 21:16:54 from Mats Lofkvist

Browse pgsql-general by date

	From	Date	Subject
Next Message	Joel Burton	2001-07-22 20:04:15	Re: SQL Regular Expression Question
Previous Message	Tom Lane	2001-07-22 19:29:02	Re: Planner estimates cost of 'like' a lot lower than '='??