Re: tsearch2, large data and indexes

From: Ivan Voras <ivoras(at)freebsd(dot)org>
To: Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
Cc: Sergey Konoplev <gray(dot)ru(at)gmail(dot)com>, Jeff Janes <jeff(dot)janes(at)gmail(dot)com>, postgres performance list <pgsql-performance(at)postgresql(dot)org>
Subject: Re: tsearch2, large data and indexes
Date: 2014-04-24 12:34:04
Message-ID: CAF-QHFWNt-+AKyYQDf=d56b198tRObmfrNwzBj78h2CLUw3T4g@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

On 24 April 2014 13:34, Heikki Linnakangas <hlinnakangas(at)vmware(dot)com> wrote:

> As the docs say, the GIN index does not store the weights. As such, there is
> no need to strip them. A recheck would be necessary if your query needs the
> weights, precisely because the weights are not included in the index.
>
> (In the OP's query, it's the ranking that was causing the detoasting.)

Thanks!

My problem is that I actually need the ranking. My queries can return
a large number of documents (tens of thousands) but I usually need
only the first couple of pages of most relevant results (e.g. 50-100
records). With PostgreSQL and tsearch2, this means that the tens of
thousands of documents found via the index are then detoasted and
ranked.

Does anyone have experience with external search engines which also
have ranking but are more efficient? How about Solr?

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Sergey Konoplev 2014-04-24 19:27:06 Re: tsearch2, large data and indexes
Previous Message Heikki Linnakangas 2014-04-24 11:34:22 Re: tsearch2, large data and indexes