Quick Links

Re: Thoughts on statistics for continuously advancing columns

From:	"Kevin Grittner" <Kevin(dot)Grittner(at)wicourts(dot)gov>
To:	"Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	"Josh Berkus" <josh(at)agliodbs(dot)com>, "Nathan Boley" <npboley(at)gmail(dot)com>, <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Thoughts on statistics for continuously advancing columns
Date:	2009-12-30 16:33:29
Message-ID:	4B3B2C79020000250002DAA4@gw.wicourts.gov
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:

> Well, the problem Josh has got is exactly that a constant high
> bound doesn't work.

I thought the problem was that the high bound in the statistics fell
too far below the actual high end in the data. This tends (in my
experience) to be much more painful than an artificially extended
high end in the statistics. (YMMV, of course.)

> What I'm wondering about is why he finds that re-running ANALYZE
> isn't an acceptable solution. It's supposed to be a reasonably
> cheap thing to do.

Good point. We haven't hit this problem in PostgreSQL precisely
because we can run ANALYZE often enough to prevent the skew from
becoming pathological.

> I think the cleanest solution to this would be to make ANALYZE
> cheaper, perhaps by finding some way for it to work incrementally.

Yeah, though as you say above, it'd be good to know why frequent
ANALYZE is a problem as it stands.

-Kevin

In response to

Re: Thoughts on statistics for continuously advancing columns at 2009-12-30 16:16:45 from Tom Lane

Responses

Re: Thoughts on statistics for continuously advancing columns at 2009-12-31 04:17:29 from Craig Ringer

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2009-12-30 16:33:44	Re: test/example does not support win32.
Previous Message	Joshua D. Drake	2009-12-30 16:31:26	Re: Thoughts on statistics for continuously advancing columns