From: | "Jodok Batlogg" <jodok(at)lovelysystems(dot)com> |
---|---|
To: | pgsql-general(at)postgresql(dot)org |
Subject: | tsearch2 problem |
Date: | 2008-10-31 07:49:07 |
Message-ID: | 47b22fd00810310049u7e8be025xb4f26cb39978efee@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
we're using tsearch2 with the german dictionary
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/dicts/ispell/ispell-german-compound.tar.gz
for fulltext search.
the indexing is configured as follows:
CREATE TEXT SEARCH DICTIONARY public.german (
TEMPLATE = ispell,
DictFile = german,
AffFile = german,
StopWords = german
);
CREATE TEXT SEARCH CONFIGURATION public.default ( COPY = pg_catalog.german );
ALTER TEXT SEARCH CONFIGURATION public.default
ALTER MAPPING FOR asciiword, asciihword, hword_asciipart,
word, hword, hword_part
WITH public.german;
-------------------------
select * from ts_debug('default', 'hundshütte');
works as expected: creates the two lexemes: "{hund,hütte}"
BUT
SELECT to_tsvector('default','lovely und bauarbeiter/in');
looses a lot of stuff:
"'bauarbeiter/in':2"
some more debugging shows:
SELECT * from ts_debug('default','lovely und bauarbeiter/in');
"asciiword";"Word, all ASCII";"lovely";"{german}";"german";""
"blank";"Space symbols";" ";"{}";"";""
"asciiword";"Word, all ASCII";"und";"{german}";"german";"{}"
"blank";"Space symbols";" ";"{}";"";""
"file";"File or path
name";"bauarbeiter/in";"{simple}";"simple";"{bauarbeiter/in}"
a) unknown words are just beeing dropped
b) words with slashes are interpreted as file paths and the first path
is beeing dropped.
any idea how we can fix this?
jodok
--
Jodok Batlogg, Vorstand
Lovely Systems AG
Telefon +43 5572 908060, Fax +43 5572 908060-77, Mobil +43 664 9636963
Schmelzhütterstraße 26a, 6850 Dornbirn, Austria
Sitz: Dornbirn, FB: Landesgericht Feldkirch, FN: 208859x, UID: ATU51736705
Aufsichtsratsvorsitzender: Christian Lutz
Vorstand: Jodok Batlogg, Manfred Schwendinger
From | Date | Subject | |
---|---|---|---|
Next Message | Craig Ringer | 2008-10-31 08:00:08 | Re: Decreasing WAL size effects |
Previous Message | Craig Ringer | 2008-10-31 07:06:27 | Re: Decreasing WAL size effects |