From: | Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com> |
---|---|
To: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> |
Cc: | Kenneth Marshall <ktm(at)rice(dot)edu>, Wojciech Knapik <webmaster(at)wolniartysci(dot)pl>, pgsql-performance(at)postgresql(dot)org |
Subject: | Re: FTS performance with the Polish config |
Date: | 2009-11-14 17:24:05 |
Message-ID: | 162867790911140924h7f18ca1aq7722530318e576a@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-performance |
2009/11/14 Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>:
> Kenneth Marshall <ktm(at)rice(dot)edu> writes:
>> On Sat, Nov 14, 2009 at 12:25:05PM +0100, Wojciech Knapik wrote:
>>> I just finished implementing a "search engine" for my site and found
>>> ts_headline extremely slow when used with a Polish tsearch configuration,
>>> while fast with English.
>
>> The documentation for ts_headline() states:
>> ts_headline uses the original document, not a tsvector summary, so it
>> can be slow and should be used with care.
>
> That's true but the argument in the docs would apply just as well to
> english or any other config. So while Wojciech would be well advised
> to try to avoid making a lot of calls to ts_headline, it's still curious
> that it's so much slower in polish than english. Could we see a
> self-contained test case?
is it dictionary based or stem based?
Dictionary based FTS is very slow (first load). Minimally czech FTS is slow.
regards
Pavel Stehule
>
> regards, tom lane
>
> --
> Sent via pgsql-performance mailing list (pgsql-performance(at)postgresql(dot)org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance
>
From | Date | Subject | |
---|---|---|---|
Next Message | Robert Haas | 2009-11-14 20:03:34 | Re: SSD + RAID |
Previous Message | Tom Lane | 2009-11-14 17:07:32 | Re: FTS performance with the Polish config |