Quick Links

Re: Better estimates of index correlation

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Alvaro Herrera <alvherre(at)commandprompt(dot)com>
Cc:	Robert Haas <robertmhaas(at)gmail(dot)com>, jd <jd(at)commandprompt(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Better estimates of index correlation
Date:	2011-03-14 14:38:59
Message-ID:	8159.1300113539@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Alvaro Herrera <alvherre(at)commandprompt(dot)com> writes:
> Excerpts from Robert Haas's message of lun mar 14 11:18:24 -0300 2011:
>> Does it really matter? What Tom was describing sounded embarassingly cheap.

That was my thought exactly. If you could even measure the added cost
of doing that, I'd be astonished. It'd be adding one comparison-and-
possible-assignment to a loop that also has to invoke a binary search
of a TID array --- a very large array, in the cases we're worried about.
I'd put the actual update of pg_statistic somewhere where it only
happened once, but I don't especially care if the stat gets computed on
each index scan.

> As Heikki says, maybe this wouldn't be an issue at all if we can do it
> during ANALYZE instead, but I don't know if that works.

I'm not convinced you can get a sufficiently good estimate from a small
subset of pages.

I actually started with the idea of having ANALYZE try to calculate
correlation for multi-column indexes the same way it now calculates it
for individual data columns, but when this idea occurred to me it just
seemed a whole lot better. Note that we could remove the correlation
calculations from ANALYZE altogether.

regards, tom lane

In response to

Re: Better estimates of index correlation at 2011-03-14 14:25:46 from Alvaro Herrera

Responses

Re: Better estimates of index correlation at 2011-03-14 14:42:04 from Robert Haas
Re: Better estimates of index correlation at 2011-03-15 00:27:29 from Josh Berkus

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Bruce Momjian	2011-03-14 14:40:46	Re: Macros for time magic values
Previous Message	Robert Haas	2011-03-14 14:31:07	Re: Better estimates of index correlation