From: | Richard Huxton <dev(at)archonet(dot)com> |
---|---|
To: | Mezei Zoltán <mezei(dot)zoltan(at)telefor(dot)hu> |
Cc: | pgsql-performance(at)postgresql(dot)org |
Subject: | Re: Deceiding which index to use |
Date: | 2007-03-09 15:52:23 |
Message-ID: | 45F182B7.30507@archonet.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-performance |
Mezei Zoltán wrote:
> Richard Huxton wrote:
>>
>> OK - so the next place to look is the distribution of values for
>> subscriber_id on the output_message_log. Does that have some subscribers
>> with many rows and lots with hardly any?
>>
> Hmm... There are about 1.5k subscribers with 100-200 messages each - all the
> other 19k has an average of 8.9 messages, most of them having only 1 message. I
> think that's exactly the situation you mention...
[snip alter table ... set statistics]
> So if I'm correct: this statistics gathering can be fine tuned, and if i set the
> <num> to 1000 then not only the first 10 subsribers (with most messages) will be
> stored in pg_stats, but the first 1000? Is 1000 a hard-coded
> highest-possible-value? I think it would be best to set that to simething like
> 1800-1900 as I have about that many subscibers with high message count.
There is a cost to increasing the stats values, otherwise it'd already
be set at 1000. In your case I'm not sure if 100-200 vs 8-9 messages is
enough to skew things. Only one way to find out...
--
Richard Huxton
Archonet Ltd
From | Date | Subject | |
---|---|---|---|
Next Message | Alvaro Herrera | 2007-03-09 16:00:09 | Re: Deceiding which index to use |
Previous Message | Mezei Zoltán | 2007-03-09 15:44:18 | Re: Deceiding which index to use |