From: | Csaba Nagy <nagy(at)ecircle-ag(dot)com> |
---|---|
To: | postgres performance list <pgsql-performance(at)postgresql(dot)org> |
Cc: | Simon Riggs <simon(at)2ndquadrant(dot)com> |
Subject: | Re: Searching for the cause of a bad plan |
Date: | 2007-09-27 11:53:24 |
Message-ID: | 1190894004.5430.46.camel@PCD12478 |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-performance |
On Wed, 2007-09-26 at 11:22 -0400, Tom Lane wrote:
> ... how
> many values of "a" are there really, and what's the true distribution of
> counts?
table_a has 23366 distinct values. Some statistics (using R):
> summary(table_a_histogram)
a count
Min. : 70000857 Min. : 1
1st Qu.:700003628 1st Qu.: 9
Median :700011044 Median : 22
Mean :622429573 Mean : 17640
3rd Qu.:700018020 3rd Qu.: 391
Max. :800003349 Max. :3347707
I'm not sure what you want to see in terms of distribution of counts, so
I created 2 plots: "a" against the counts for each distinct "a" value,
and the histogram of the log of the counts (without the log it's not
really readable). I hope they'll make it through to the list...
> Do the plan estimates get closer to reality if you set a higher
> statistics target?
The results of setting higher statistics targets are attached too. I
can't tell if the stats are closer to reality or not, but the plan
changes in any case...
Cheers,
Csaba.
Attachment | Content-Type | Size |
---|---|---|
image/png | 2.4 KB | |
image/png | 3.0 KB | |
bumped_stats.txt | text/plain | 13.4 KB |
From | Date | Subject | |
---|---|---|---|
Next Message | Heikki Linnakangas | 2007-09-27 12:39:07 | Re: Difference in query plan when using = or > in where clause |
Previous Message | Ow Mun Heng | 2007-09-27 06:59:13 | Re: REPOST: Nested loops row estimates always too high |