From: | "Sven R(dot) Kunze" <srkunze(at)tbz-pariv(dot)de> |
---|---|
To: | Postgres General <pgsql-general(at)postgresql(dot)org> |
Subject: | [to_tsvector] German Compound Words |
Date: | 2015-05-28 11:05:16 |
Message-ID: | 5566F66C.3070701@tbz-pariv.de |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
Hi everybody,
what do I need to do in order to enable compound word handling in
PostgreSQL tsvector implementation?
I run an Ubuntu 14.04 machine, PostgreSQL 9.3, have installed package
hunspell-de-de and already created a new dictionary as described here:
http://www.postgresql.org/docs/9.3/static/textsearch-dictionaries.html#TEXTSEARCH-ISPELL-DICTIONARY
CREATE TEXT SEARCH DICTIONARY german_hunspell (
TEMPLATE = ispell,
DictFile = de_de,
AffFile = de_de,
StopWords = german
);
Furthermore, created a new test text search configuration (copied from german) and updated all parser parts where the german_stem dictionary is used so that it uses german_hunspell first and then german_stem.
However, ts_vector still does not work for the compound words such as:
wasserkraft -> wasserkraft, kraft
schifffahrt -> schifffahrt, fahrt
blindflansch -> blindflansch, flansch
etc.
What have I done wrong here?
--
Sven R. Kunze
TBZ-PARIV GmbH, Bernsdorfer Str. 210-212, 09126 Chemnitz
Tel: +49 (0)371 33714721, Fax: +49 (0)371 5347920
e-mail: srkunze(at)tbz-pariv(dot)de
web: www.tbz-pariv.de
Geschäftsführer: Dr. Reiner Wohlgemuth
Sitz der Gesellschaft: Chemnitz
Registergericht: Chemnitz HRB 8543
From | Date | Subject | |
---|---|---|---|
Next Message | Robert Haas | 2015-05-28 11:16:32 | Re: [HACKERS] Re: 9.4.1 -> 9.4.2 problem: could not access status of transaction 1 |
Previous Message | Chris Mair | 2015-05-28 09:16:24 | Re: 9.4.2 -> 9.4.3 |