From: | Teodor Sigaev <teodor(at)sigaev(dot)ru> |
---|---|
To: | Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | regular expressions stranges |
Date: | 2007-01-23 12:53:33 |
Message-ID: | 45B6054D.2060009@sigaev.ru |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Regexp works differently with no-ascii characters depending on server encoding
(bug.sql contains non-ascii char):
% initdb -E KOI8-R --locale ru_RU.KOI8-R
% psql postgres < bug.sql
true
------
t
(1 row)
true | true
------+------
t | t
(1 row)
% initdb -E UTF8 --locale ru_RU.UTF-8
% psql postgres < bug.sql
true
------
f
(1 row)
true | true
------+------
f | t
(1 row)
As I can see, that is because of using isalpha (and other is*), tolower &
toupper instead of isw* and tow* functions. Is any reason to use them? If not, I
can modify regc_locale.c similarly to tsearch2 locale part.
--
Teodor Sigaev E-mail: teodor(at)sigaev(dot)ru
WWW: http://www.sigaev.ru/
Attachment | Content-Type | Size |
---|---|---|
bug.sql | text/plain | 145 bytes |
From | Date | Subject | |
---|---|---|---|
Next Message | Heikki Linnakangas | 2007-01-23 13:18:55 | Re: Free space management within heap page |
Previous Message | Pavel Stehule | 2007-01-23 12:39:51 | 10 weeks to feature freeze (Pending Work) |