| From: | Alexander Korotkov <aekorotkov(at)gmail(dot)com> |
|---|---|
| To: | Noah Misch <noah(at)leadboat(dot)com> |
| Cc: | Nathan Boley <npboley(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org> |
| Subject: | Re: Collect frequency statistics for arrays |
| Date: | 2012-01-22 21:21:20 |
| Message-ID: | CAPpHfdvm1z0dQ-v0=_+QF_Ws8LXfE_75xQ-n4dzR6eyffh213Q@mail.gmail.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
Hi!
Updated patch is attached. I've updated comment
of mcelem_array_contained_selec with more detailed description of
probability distribution assumption. Also, I found that "rest" behavious
should be better described by Poisson distribution, relevant changes were
made.
On Tue, Jan 17, 2012 at 2:33 PM, Noah Misch <noah(at)leadboat(dot)com> wrote:
>
> By "summary frequency of elements", do you mean literally P_0 + P_1 ... +
> P_N?
> If so, I can follow the above argument for "column && const" and "column <@
> const", but not for "column @> const". For "column @> const", selectivity
> cannot exceed the smallest frequency among const elements. A number of
> high-frequency elements will drive up the sum of the frequencies without
> changing the true selectivity much at all.
>
Referencing to summary frequency is not really correct. It would be more
correct to reference to number of element in "const". When there are many
elements in "const", "column @> const" selectivity tends to be close to 0
and "column @> const" tends to be close to 1. Surely, it's true when
elements have some kind of middle values of frequencies (not very close to
0 and not very close to 1). I've replaced "summary frequency of elements"
by "number of elements".
------
With best regards,
Alexander Korotkov.
| Attachment | Content-Type | Size |
|---|---|---|
| arrayanalyze-0.12.patch.gz | application/x-gzip | 23.9 KB |
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Mikko Tiihonen | 2012-01-22 21:47:06 | Re: Optimize binary serialization format of arrays with fixed size elements |
| Previous Message | Simon Riggs | 2012-01-22 21:06:49 | Re: [PATCH] Support for foreign keys with arrays |