Re: Deceiding which index to use

From: Richard Huxton <dev(at)archonet(dot)com>
To: Mezei Zoltán <mezei(dot)zoltan(at)telefor(dot)hu>
Cc: pgsql-performance(at)postgresql(dot)org
Subject: Re: Deceiding which index to use
Date: 2007-03-09 15:52:23
Message-ID: 45F182B7.30507@archonet.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

Mezei Zoltán wrote:
> Richard Huxton wrote:
>>
>> OK - so the next place to look is the distribution of values for
>> subscriber_id on the output_message_log. Does that have some subscribers
>> with many rows and lots with hardly any?
>>
> Hmm... There are about 1.5k subscribers with 100-200 messages each - all the
> other 19k has an average of 8.9 messages, most of them having only 1 message. I
> think that's exactly the situation you mention...

[snip alter table ... set statistics]

> So if I'm correct: this statistics gathering can be fine tuned, and if i set the
> <num> to 1000 then not only the first 10 subsribers (with most messages) will be
> stored in pg_stats, but the first 1000? Is 1000 a hard-coded
> highest-possible-value? I think it would be best to set that to simething like
> 1800-1900 as I have about that many subscibers with high message count.

There is a cost to increasing the stats values, otherwise it'd already
be set at 1000. In your case I'm not sure if 100-200 vs 8-9 messages is
enough to skew things. Only one way to find out...

--
Richard Huxton
Archonet Ltd

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Alvaro Herrera 2007-03-09 16:00:09 Re: Deceiding which index to use
Previous Message Mezei Zoltán 2007-03-09 15:44:18 Re: Deceiding which index to use