Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres

From: Bruce Momjian <bruce(at)momjian(dot)us>
To: "Bayer, Samuel" <sam(at)mitre(dot)org>
Cc: Atri Sharma <atri(dot)jiit(at)gmail(dot)com>, pgsql-general(at)lists(dot)postgresql(dot)org
Subject: Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres
Date: 2022-03-04 15:54:01
Message-ID: YiI2GWUWvsvOp02z@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Fri, Mar 4, 2022 at 10:41:16AM -0500, Bayer, Samuel wrote:
> Example anecdote: the documents I'm searching come with metadata
> (e.g., title), which I'm not indexing specially (not a separate field,
> just part of the raw text of the document). When I search even for
> single terms, and look at the titles of the results, the titles in the
> Solr results more frequently contain that term than the titles in the
> Postgres results. I also FEEL like I've noticed that the problem is
> more apparent in "OR" queries; if I search for a disjunction of terms,
> the documents that contain all the terms are more likely to be high in
> the Solr rankings than in the Postgres rankings.
>
> I apologize for not being able to be more specific.

I know it is hard to quantify. Is it possible that Postgres is treating
all the terms equally, while Solr is prioritizing terms that are earlier
in the document?

--
Bruce Momjian <bruce(at)momjian(dot)us> https://momjian.us
EDB https://enterprisedb.com

If only the physical world exists, free will is an illusion.

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Tom Lane 2022-03-04 16:09:46 Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres
Previous Message Bayer, Samuel 2022-03-04 15:41:16 Re: [EXT] Re: Looking for tips on improving full-text search quality in Postgres