Quick Links

Re: "Bug" in statistics for v7.2?

From:	"Zeugswetter Andreas SB SD" <ZeugswetterA(at)spardat(dot)at>
To:	"Marc G(dot) Fournier" <scrappy(at)hub(dot)org>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	<pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: "Bug" in statistics for v7.2?
Date:	2002-02-15 17:18:35
Message-ID:	46C15C39FEB2C44BA555E356FBCD6FA488780E@m0114.s-mxs.net
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

> That explains it ...
>
> profiles_faith | count
> ----------------+--------
> 0 | 485938
> 1 | 2
> 2 | 6
> 7 | 2
> 8 | 21
> (5 rows)
>
> Cool, another waste of space *sigh*
>
> thanks ...
>
>
> On Wed, 13 Feb 2002, Tom Lane wrote:
>
> > "Marc G. Fournier" <scrappy(at)hub(dot)org> writes:
> > > Okay, if I'm understanding pg_stats at all, which I may not be, n_distinct
> > > should represent # of distinct values in that row, no?
> > > But, I have one field that has 5 distinct values:
> > > But pg_stats is reporting 1:
> >
> > The pg_stats values are only, um, statistical. If 99.9% of the table is
> > the same value and the other four values appear only once or twice, it's
> > certainly possible for ANALYZE's sample to include only the common value
> > and miss the rare ones. AFAIK that will not break anything; if you have
> > an example where the planner seems to be fooled because of this, let's
> > see it.

Hmm ? How about select * from xxx where profiles_faith = 7
would estimate all rows, no ? Instead of 2.
That is why I think a bin for "very uncommon" values could also be
useful sometimes.

Andreas

Responses

Re: "Bug" in statistics for v7.2? at 2002-02-15 17:45:39 from Tom Lane

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Thomas Swan	2002-02-15 17:31:19	possible pg_dumpall (7.1.3) bug
Previous Message	Zeugswetter Andreas SB SD	2002-02-15 17:13:14	Re: alter table drop column status