From: | Emre Hasegeli <emre(at)hasegeli(dot)com> |
---|---|
To: | obartunov(at)gmail(dot)com |
Cc: | Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: new full text search configurations |
Date: | 2015-11-21 09:26:04 |
Message-ID: | CAE2gYzzFT-jxYv2gRaQC-MVNvwpmYLr8NCFxYwjKOojPLP1vLg@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
> I checked new snowball site http://snowballstem.org/ and found several new
> stemmers appeared (as external contributions):
>
> Irish and Czech
> Object Pascal codegenerator for Snowball
> Two stemmers for Romanian
> Hungarian
> Turkish
> Armenian
> Basque (Euskera)
> Catalan
>
> Some of them we don't have in our list of default configurations. Since
> these are external, not official stemmers, it'd be nice if people look and
> test them. If they are fine, we can prepare new configurations for 9.6.
We have configurations for the ones included to the Snowball, namely
Romanian, Hungarian, and Turkish. I don't know why the others are not
included but listed on the page as external contributions. It might
be a good idea to wait for someone to commit them to the upstream.
I have checked the changes on the algorithms [1]. They don't seemed
to be updated much after 2007, but recently a new one for Tamil
language is added. It might be a good candidate for a new
configuration.
[1] https://github.com/snowballstem/snowball/commits/master/algorithms
From | Date | Subject | |
---|---|---|---|
Next Message | Alvaro Herrera | 2015-11-21 11:59:02 | Re: CreateFunction Statement |
Previous Message | Pavel Stehule | 2015-11-21 06:49:33 | custom function for converting human readable sizes to bytes |