| From: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> |
|---|---|
| To: | "John Surcombe" <John(dot)Surcombe(at)digimap(dot)gg> |
| Cc: | pgsql-performance(at)postgresql(dot)org, "Kieran Senior" <Kieran(dot)Senior(at)digimap(dot)gg> |
| Subject: | Re: Planner wrongly shuns multi-column index for select .. order by col1, col2 limit 1 |
| Date: | 2011-03-14 17:24:33 |
| Message-ID: | 13031.1300123473@sss.pgh.pa.us |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-performance |
"John Surcombe" <John(dot)Surcombe(at)digimap(dot)gg> writes:
>> It'd be
>> useful to see the pg_stats.correlation value for both the userid and
>> receiveddatetime columns.
> Yes, the table is indeed nearly perfectly ordered by receiveddatetime
> (correlation 0.998479). correlation on userid is -0.065556. n_distinct
> on userid is also low: 1097.
Ah-hah.
> Is the problem perhaps something like the following: PostgreSQL is
> thinking that because there are not many userids and there is low
> correlation, that if it just scans the table from the top in date order,
> this will be cheap (because receiveddatetime correlation is high so it
> won't have to seek randomly), and it won't have to scan very far before
> it finds the first row with a matching userid.
There's some of that, but I think the main problem is that there's a
very high discount on the cost estimate for a perfectly-correlated
index, and that makes it end up looking cheaper to use than the
uncorrelated one. (It doesn't help any that we don't do correlation
properly for multicolumn indexes; but given what you say above, the
correlation estimate for the two-column index would be small even if
we'd computed it exactly.)
You might find that reducing random_page_cost would avoid the problem.
That should reduce the advantage conferred on the high-correlation
index, and it probably would represent your actual configuration better
anyway, given the results you're showing here.
regards, tom lane
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Merlin Moncure | 2011-03-14 17:34:31 | Re: Performance regression from 8.3.7 to 9.0.3 |
| Previous Message | Claudio Freire | 2011-03-14 17:24:31 | Bug in the planner? |