Re: Unicode upper() bug still present

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Oliver Elphick <olly(at)lfix(dot)co(dot)uk>
Cc: PostgreSQL hackers list <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: Unicode upper() bug still present
Date: 2003-10-20 00:35:50
Message-ID: 4272.1066610150@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Oliver Elphick <olly(at)lfix(dot)co(dot)uk> writes:
> There is a bug in Unicode upper() which has been present since 7.2:

We don't support upper/lower in multibyte character sets, and can't as
long as the functionality is dependent on <ctype.h>'s toupper()/tolower().
It's been suggested that we could use <wctype.h> where available.
However there are a bunch of issues that would have to be solved to make
that happen. (How do we convert between the database character encoding
and the wctype representation? How do we even find out what
representation the current locale setting expects to use?)

In short, don't hold your breath ...

regards, tom lane

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2003-10-20 01:12:47 A couple of TODO notes
Previous Message Sailesh Krishnamurthy 2003-10-20 00:29:37 Re: Dreaming About Redesigning SQL