Re: BUG #8354: stripped positions can generate nonzero rank in ts_rank_cd

From: Bruce Momjian <bruce(at)momjian(dot)us>
To: alex(at)hill(dot)net(dot)au
Cc: pgsql-bugs(at)postgresql(dot)org
Subject: Re: BUG #8354: stripped positions can generate nonzero rank in ts_rank_cd
Date: 2014-01-31 21:22:51
Message-ID: 20140131212251.GQ19957@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs


Would someone please comment on this text search bug report? Thanks.

---------------------------------------------------------------------------

On Fri, Aug 2, 2013 at 07:03:42AM +0000, alex(at)hill(dot)net(dot)au wrote:
> The following bug has been logged on the website:
>
> Bug reference: 8354
> Logged by: Alex Hill
> Email address: alex(at)hill(dot)net(dot)au
> PostgreSQL version: 9.2.4
> Operating system: OS X 10.8.4 Mountain Lion
> Description:
>
> Hi all,
>
>
> The docs for ts_rank_cd state:
>
>
> "This function requires positional information in its input. Therefore it
> will not work on "stripped" tsvector values — it will always return zero."
>
>
> However if a tsvector contains some stripped lexemes and some non-stripped,
> ts_rank_cd will rank extents including the non-stripped values.
>
>
> For example, this evaluates to zero as expected:
>
>
> SELECT ts_rank_cd(strip(to_tsvector('text search')),
> plainto_tsquery('text search'))
>
>
>
>
> But this doesn't:
>
>
> SELECT ts_rank_cd(to_tsvector('text') || strip(to_tsvector('search')),
> plainto_tsquery('text search'))
>
>
>
>
> I think this is a bug, if not in the code then in the documentation, which
> isn't clear on what happens when stripped and positioned lexemes are mixed
> in one tsvector.
>
>
> I would prefer that stripped lexemes were completely ignored by ts_rank_cd:
> my use case is using this as a fifth pseudo-weight, which matches a @@ query
> but doesn't add to a ts_rank_cd ranking.
>
>
> What do you think?
>
>
> Cheers,
> Alex
>
>
>
> --
> Sent via pgsql-bugs mailing list (pgsql-bugs(at)postgresql(dot)org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-bugs

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ Everyone has their own god. +

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Bruce Momjian 2014-01-31 22:09:09 Re: BUG #8365: Two documentation suggestions
Previous Message Peter Geoghegan 2014-01-31 08:27:09 Re: BUG #9050: pg_stat_statements() contains rows with the same dbid, userid and query