Re: Naming of the prefab snowball stemmer dictionaries

From: "A(dot)M(dot)" <agentm(at)themactionfaction(dot)com>
To: PostgreSQL-development hackers <pgsql-hackers(at)postgreSQL(dot)org>
Subject: Re: Naming of the prefab snowball stemmer dictionaries
Date: 2007-08-22 15:23:35
Message-ID: 721CB882-4C3A-4250-B018-A3C53DD658D5@themactionfaction.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers


On Aug 22, 2007, at 11:10 , Tom Lane wrote:

> I notice that the existing tsearch documentation that we've imported
> fairly consistently refers to Snowball dictionaries with names like
> "en_stem", "ru_stem", etc. However, CVS HEAD is set up to create them
> with names "english", "russian", etc. As I've been absorbing more of
> the docs I'm starting to wonder whether this is a good idea. ISTM
> that these names encourage a novice to think that the one dictionary
> is all you could need for a given language; and there are enough
> examples of more-complex setups in the docs to make it clear that
> in fact Snowball is not the be-all and end-all of dictionaries.
>
> I'm thinking that going back to the old naming convention (or
> something
> like it --- maybe "english_stem", "russian_stem", etc) would be
> better.
> It'd help to give the right impression, namely that these dictionaries
> are a component of a solution but not necessarily all you need.

Please use ISO 639 codes plus any qualifiers to reduce confusion.
http://en.wikipedia.org/wiki/List_of_ISO_639-1_codes

-M

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Zeugswetter Andreas ADI SD 2007-08-22 16:13:33 Re: Naming of the prefab snowball stemmer dictionaries
Previous Message Heikki Linnakangas 2007-08-22 15:12:40 Re: Crash with empty dictionary