From: | "Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com> |
---|---|
To: | "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su>, "Teodor Sigaev" <teodor(at)sigaev(dot)ru>, "Heikki Linnakangas" <heikki(at)enterprisedb(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org> |
Subject: | integrated tsearch doesn't work with non utf8 database |
Date: | 2007-09-07 19:13:52 |
Message-ID: | 162867790709071213o898ce7av333a570bd9004f27@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Hello
last time I checked utf8 database. Now I checked latin2 encoding
database. I used dictionaries from last test.
client_encoding | utf8
lc_collate | cs_CZ.iso-8859-2
lc_ctype | cs_CZ.iso-8859-2
lc_messages | cs_CZ.iso-8859-2
lc_monetary | cs_CZ.iso-8859-2
lc_numeric | cs_CZ.iso-8859-2
lc_time | cs_CZ.iso-8859-2
postgres=# create Text search dictionary cspell2(template=ispell,
afffile=czech, dictfile=czech);
CREATE TEXT SEARCH DICTIONARY
postgres=# alter text search configuration cs alter mapping for word,
lword with cspell2, simple;
ALTER TEXT SEARCH CONFIGURATION
postgres=# select ts_debug('cs','Příliš žluťoučký kůň se napil žluté vody');
ERROR: character 0xc3a5 of encoding "UTF8" has no equivalent in "LATIN2"
CONTEXT: SQL function "ts_debug" statement 1
postgres=#
database was initialised
/usr/local/pgsql/bin/initdb --encoding=latin2
--locale=cs_CZ.iso-8859-2 -D /usr/local/pgsql/data/
locales is correct
postgres=# set client_encoding to utf8;
SET
postgres=# select upper('Příliš žluťoučký kůň se napil žluté vody');
upper
------------------------------------------
PŘÍLIŠ ŽLUŤOUČKÝ KŮŇ SE NAPIL ŽLUTÉ VODY
(1 row)
Regards
Pavel Stehule
From | Date | Subject | |
---|---|---|---|
Next Message | Florian G. Pflug | 2007-09-07 19:25:48 | Re: Low hanging fruit in lazy-XID-assignment patch? |
Previous Message | Tom Lane | 2007-09-07 18:35:32 | Re: Sorting the Stop word lists |