From: | Simon Riggs <simon(at)2ndquadrant(dot)com> |
---|---|
To: | Chris Browne <cbbrowne(at)acm(dot)org> |
Cc: | pgsql-patches(at)postgresql(dot)org |
Subject: | Re: Better default_statistics_target |
Date: | 2007-12-06 19:34:42 |
Message-ID: | 1196969682.4255.455.camel@ebony.site |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers pgsql-patches |
On Wed, 2007-12-05 at 15:13 -0500, Chris Browne wrote:
> I have the theory (thus far not borne out by any numbers) that it
> might be a useful approach to try to go through the DB schema and use
> what information is there to try to come up with better numbers on a
> per-column basis.
Yeh, agreed.
The difficulty is making this work for generic datatypes.
> - Datestamps tend to imply temporal dispersion, ergo "somewhat fewer
> bins." Similar for floats.
Hmmm, not sure about that one. Some date/time columns can change very
quickly over time, so the stats are frequently out of date.
> Then could come a "second order" perspective, where data would
> actually get sampled from pg_statistics.
>
> - If we look at the number of distinct histogram bins used, for a
> particular column, and find that there are some not used, we might
> drop bins.
The histograms are height balanced, so they are always all used.
--
Simon Riggs
2ndQuadrant http://www.2ndQuadrant.com
From | Date | Subject | |
---|---|---|---|
Next Message | Andrew Dunstan | 2007-12-06 21:02:21 | Re: [HACKERS] BUG #3799: csvlog skips some logs |
Previous Message | Alvaro Herrera | 2007-12-06 19:29:21 | Re: shared_buffers, wal_buffers, WAL files, data files |
From | Date | Subject | |
---|---|---|---|
Next Message | Tom Lane | 2007-12-07 00:19:27 | Proposed patch to make mergejoin cost estimation more symmetric |
Previous Message | Merlin Moncure | 2007-12-06 19:12:22 | Re: PQParam version 0.5 |