Concerning about Unicode-aware string handling

From: Vincas Dargis <vindrg(at)gmail(dot)com>
To: pgsql-general(at)postgresql(dot)org
Subject: Concerning about Unicode-aware string handling
Date: 2012-05-21 09:09:42
Message-ID: CAPNCXk0q_hUN_gsKHNCYNWiJ=d+6RaStNknf-yfG8kvU9m196A@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Hello,

We have problems (currently using 8.4, but also in latest 9.1.3) in
our application with Unicode word symbols in Lithuanian ('ąčęėįšųūž'),
Russian and of course potentially other languages.

For example, regex_replace('acząčž', E'\\W', '', 'g') removes ąčž.

lower() and ~* comparison works only with locale that is set (no
internationalization).

Could we expect Unciode support in near future? Or should we do quick
hacks by reimplementing regexp_replace(), lower(), upper() and other
string SQL functions using, for example, Qt libraries..? Or maybe are
there some kind simpler workarounds?

Thank you,
Vincas.

Responses

Browse pgsql-general by date

  From Date Subject
Next Message John R Pierce 2012-05-21 09:44:45 Re: Concerning about Unicode-aware string handling
Previous Message Craig Ringer 2012-05-21 08:28:09 Re: Postgresql segmentation fault at slot_deform_tuple