Quick Links

Re: Wich hardware suits best for large full-text indexed

From:	Diogo Biazus <diogo(at)ikono(dot)com(dot)br>
To:	Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: Wich hardware suits best for large full-text indexed
Date:	2004-03-31 17:19:06
Message-ID:	406AFD8A.8090305@ikono.com.br
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

Oleg Bartunov wrote:

>On Tue, 30 Mar 2004, Diogo Biazus wrote:
>
>
>
>>Hi folks,
>>
>>I have a database using tsearch2 to index 300 000 documents.
>>I've already have optimized the queries, and the database is vacuumed on
>>a daily basis.
>>The stat function tells me that my index has aprox. 460 000 unique words
>>(I'm using stemmer and a nice stopword list).
>>
>>
>
>460 000 unique words is a lot ! Have you seen on them ? Sometimes it's
>very useful to analyze what did you indexed and do you want all of them.
>I suggest you to use ispell dictionary and, if you index numbers
>(look statistics), use special dictionaries for integer and decimal numbers
>http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/dicts/README.intdict
>
>
I 'll try the ispell dictionaries and dicts for numbers too ;)
Could the synonym dictionary help me on this (reducing unique words)?

thanks,

--
Diogo Biazus
diogo(at)ikono(dot)com(dot)br
http://www.ikono.com.br

In response to

Re: Wich hardware suits best for large full-text indexed at 2004-03-31 11:18:58 from Oleg Bartunov

Responses

Re: Wich hardware suits best for large full-text indexed at 2004-04-01 10:02:07 from Oleg Bartunov

Browse pgsql-general by date

	From	Date	Subject
Next Message	William White	2004-03-31 17:29:39	Re: Question about rtrees (overleft replacing left in nodes)
Previous Message	Richard Huxton	2004-03-31 17:04:08	Re: select statement sorting