From: | "David G(dot) Johnston" <david(dot)g(dot)johnston(at)gmail(dot)com> |
---|---|
To: | Amit Langote <Langote_Amit_f8(at)lab(dot)ntt(dot)co(dot)jp> |
Cc: | "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Typos/Questions in bloom documentation |
Date: | 2016-04-21 16:25:10 |
Message-ID: | CAKFQuwbczigM80D8njmk_vsSk6xKQtt6BH3C6Sad3M+G+oKtWA@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On Wed, Apr 20, 2016 at 9:18 PM, Amit Langote <Langote_Amit_f8(at)lab(dot)ntt(dot)co(dot)jp
> wrote:
> On 2016/04/21 11:19, David G. Johnston wrote:
> > On Wednesday, April 20, 2016, Amit Langote wrote:
> >> On 2016/04/21 6:51, David G. Johnston wrote:
> >>> http://www.postgresql.org/docs/devel/static/bloom.html
> >>>
> >>> F.4.3 Examples
> >>>
> >>> Claims that the signature length is 80 bits - shouldn't it be 8?
> >>
> >> In F.4.1. Introduction:
> >>
> >> ... The user can specify signature length (in uint16, default is 5)
> >>
> >> So, it seems right to me.
> >>
> > Great. Maybe you can consider re-wording it so others can understand. I
> > have no clue how 80bits is determined. The phase you quote is obtuse to
> > the casual user as well. If that means 16x5=80 irrespective of columns
> it
> > is not clear.
>
> I agree it's unclear. Does the following make it any better (updated
> patch attached):
>
> - The user can specify signature length (in uint16, default is 5) and the
> - number of bits, which can be set per attribute (1 < colN < 2048).
> + The user can specify signature length in units of 16 bits (default is
> 5)
> + and the number of bits per indexed attribute.
>
>
Better. The "and" is confusing. Is the signature length the sum of 16x5
+ (bits per indexed attribute)?
By the way, now I am slightly confused as well about per-column bits
> assignment thing:
>
> In F.4.1. Introduction:
>
> ... and the number of bits, which can be set per attribute (1 < colN <
> 2048).
>
> And then in F.4.2. Parameters:
>
> bloom indexes accept the following parameters in the WITH clause.
>
> length
> Length of signature in uint16 type values
>
How about: "Number of 16bit units to use for the signature"
> col1 — col16
> Number of bits for corresponding column
>
> Which is it: col1 - col2048 or col1 - col16? Or are they different things
> altogether?
>
Good question...
David J.
From | Date | Subject | |
---|---|---|---|
Next Message | Kevin Grittner | 2016-04-21 16:58:20 | Re: [COMMITTERS] pgsql: Inline initial comparisons in TestForOldSnapshot() |
Previous Message | David G. Johnston | 2016-04-21 15:48:53 | Re: Query Procedures |