From: | Peter Kovacs <peter(dot)kovacs(at)chemaxon(dot)hu> |
---|---|
To: | Greg Stark <gsstark(at)mit(dot)edu> |
Cc: | pgsql-general(at)postgresql(dot)org |
Subject: | Re: Why won't it index scan? |
Date: | 2006-05-18 06:52:23 |
Message-ID: | 446C19A7.5000104@chemaxon.hu |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
Sorry for the naive question, but: is there a problem with analyze doing
full table scans? Analyze will not lock anything, will it?
Peter
Greg Stark wrote:
> Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> writes:
>
>
>> "Ed L." <pgsql(at)bluepolka(dot)net> writes:
>>
>>> So, does this sound like we just happened to get repeatedly
>>> horribly unrepresentative random samples with stats target at
>>> 10? Are we at the mercy of randomness here? Or is there a
>>> better preventive procedure we can follow to systematically
>>> identify this kind of situation?
>>>
>> I think the real issue is that stats target 10 is too small for large
>> tables: the samples are just not large enough to support a decent
>> numdistinct estimate, which is the critical stat for cases such as this
>> (ie, estimating the number of hits on a value that's not in the
>> most-common-values list).
>>
>
> There's been some discussion on -hackers about this area. Sadly the idea of
> using samples to calculate numdistinct estimates is fundamentally on pretty
> shaky ground.
>
> Whereas a fixed sample size works fine for calculating distribution of values,
> in order to generate consistent precision for numdistinct estimates the
> samples will have to be a constant fraction of the table -- and unfortunately
> a pretty large fraction at that.
>
> So sadly I think "at the mercy of randomness" is pretty accurate. You'll have
> to raise the statistics target as the table grows and I expect you'll
> eventually run into some downsides of large stats targets.
>
> Some better algorithms were posted, but they would require full table scans
> during analyze, not just samples.
>
>
From | Date | Subject | |
---|---|---|---|
Next Message | Volkan YAZICI | 2006-05-18 09:06:06 | Re: Where I find examples of pqtrace? |
Previous Message | Martijn van Oosterhout | 2006-05-18 05:39:45 | Re: [GENERAL] PGSQL Database Recovery in Portland Oregon Area needed ASAP |