Quick Links

Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres

From:	Bruce Momjian <bruce(at)momjian(dot)us>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	"Bayer, Samuel" <sam(at)mitre(dot)org>, Atri Sharma <atri(dot)jiit(at)gmail(dot)com>, pgsql-general(at)lists(dot)postgresql(dot)org
Subject:	Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres
Date:	2022-03-04 16:45:42
Message-ID:	YiJCNtCZGIL1dzUH@momjian.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

On Fri, Mar 4, 2022 at 11:43:57AM -0500, Tom Lane wrote:
> "Bayer, Samuel" <sam(at)mitre(dot)org> writes:
> > One concrete question, I suppose, is: the classic TF/IDF search strategy relies on inverse document frequency, which looks across the corpus. I can't tell whether that corpus-wide frequency information is taken into account in either ranking function.
>
> The documentation is pretty clear that they don't, they just consider each
> document in isolation. Building a structure that would allow more-global
> info to be taken into account is an interesting project that nobody's
> tackled.

Oh, you mean like, "pizza" is used only in this document vs it being
used in every document. Should we add some documentation about this or
is it already clear?

--
Bruce Momjian <bruce(at)momjian(dot)us> https://momjian.us
EDB https://enterprisedb.com

If only the physical world exists, free will is an illusion.

In response to

Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres at 2022-03-04 16:43:57 from Tom Lane

Responses

Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres at 2022-03-04 16:52:11 from Atri Sharma

Browse pgsql-general by date

	From	Date	Subject
Next Message	Atri Sharma	2022-03-04 16:52:11	Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres
Previous Message	Bruce Momjian	2022-03-04 16:44:26	Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres