Re: Please comment on the following OpenFTS/tsearch2

From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: Vivek Khera <vivek(at)khera(dot)org>
Cc: pgsql general <pgsql-general(at)postgresql(dot)org>
Subject: Re: Please comment on the following OpenFTS/tsearch2
Date: 2006-04-27 14:53:37
Message-ID: Pine.GSO.4.63.0604271849050.26244@ra.sai.msu.su
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Thu, 27 Apr 2006, Vivek Khera wrote:

>
> On Apr 26, 2006, at 3:17 AM, Teodor Sigaev wrote:
>
>> We knows installation of tsearch2 working with 4 millions docs.
>>
>
> What are the design goals for the size of the source tables? My engineers
> are telling me of things their friends have tried and have hit limits of
> tsearch2. One was importing a large message board (millions of rows, a few
> sentences of text per row) and ran into problems (which were not detailed).
>
> Our interest is in using it for indexing mailing lists we host. We're
> looking at about 100 or so messages per day right now, with potential growth.
> Short of actually implementing it and loading up sample data, what
> guidelines can you provide as to the limits of tsearch2 source data size?
>
> I can imagine having 10+ million rows of 4k-byte to 10k-byte long messages
> within a couple of years.

It should be no problem with inverted index we just posted. Search itself
is very fast ! The problem is intrinsic for relational database - read
data from disk. If you find 100,000 results and you want to rank them,
you have to read them from hd, which is slow. That's why we use cacheing
search daemon and on 5 mln blog and we could get 1mln search/day on
8Gb RAM server.

>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: don't forget to increase your free space map settings

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru)
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Tom Lane 2006-04-27 14:59:55 Re: Mailing list setup issue
Previous Message Vivek Khera 2006-04-27 14:45:22 Re: Please comment on the following OpenFTS/tsearch2 issues!