Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres

From: Atri Sharma <atri(dot)jiit(at)gmail(dot)com>
To: Bruce Momjian <bruce(at)momjian(dot)us>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, "Bayer, Samuel" <sam(at)mitre(dot)org>, pgsql-general(at)lists(dot)postgresql(dot)org
Subject: Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres
Date: 2022-03-04 16:52:11
Message-ID: CAOeZVic4yXEO3ZW2P3dmV2Zh=x05DbOX3ysfb++_qSLwqnrrWA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

TF/IDF should be pretty simple to implement IMO.

And no, Solr does not give preference to prior documents.

However, Solr allows you to "boost" specific terms, thus creating the
impression of preference.

On Fri, 4 Mar 2022, 22:15 Bruce Momjian, <bruce(at)momjian(dot)us> wrote:

> On Fri, Mar 4, 2022 at 11:43:57AM -0500, Tom Lane wrote:
> > "Bayer, Samuel" <sam(at)mitre(dot)org> writes:
> > > One concrete question, I suppose, is: the classic TF/IDF search
> strategy relies on inverse document frequency, which looks across the
> corpus. I can't tell whether that corpus-wide frequency information is
> taken into account in either ranking function.
> >
> > The documentation is pretty clear that they don't, they just consider
> each
> > document in isolation. Building a structure that would allow more-global
> > info to be taken into account is an interesting project that nobody's
> > tackled.
>
> Oh, you mean like, "pizza" is used only in this document vs it being
> used in every document. Should we add some documentation about this or
> is it already clear?
>
> --
> Bruce Momjian <bruce(at)momjian(dot)us> https://momjian.us
> EDB https://enterprisedb.com
>
> If only the physical world exists, free will is an illusion.
>
>

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Bruce Momjian 2022-03-04 16:53:52 Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres
Previous Message Bruce Momjian 2022-03-04 16:45:42 Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres