Re: once again, sorting with Unicode

From: Antti Haapala <antti(dot)haapala(at)iki(dot)fi>
To: "Troy K(dot)" <tjk(at)tksoft(dot)com>, <tjk(at)tksoft(dot)com>
Cc: JBJ <postgre(at)totw(dot)org>, <pgsql-sql(at)postgresql(dot)org>
Subject: Re: once again, sorting with Unicode
Date: 2003-02-19 12:37:27
Message-ID: Pine.GSO.4.44.0302191413590.21258-100000@paju.oulu.fi
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-sql


On Wed, 19 Feb 2003, Troy wrote:

> > I have a multi-lingual database (currently 11 languages) which sorts
> > fine in MySQL (8859-1 character set) I have now converted the data to
> > Unicode and compiled Postgre with unicode support.
> >
> > I can select and insert unicode and so was rather pleased about that.
> > Until I saw that it wasn't working properly when ordering!
>
> The cause for the different values is the fact that unicode characters
> have different numeric values from ISO8859-1 and other encodings. Only
> ascii values are in sync with unicode numeric values. This I am sure you
> knew.

No, ISO8859-1 maps directly to unicode up to U+00FF. So the actual
_numeric_ values are the same. But actual byte patterns are encoding
dependent.

Have you set database encoding to UTF-8? Are you using proper UTF-8
locales? POSIX compiled locales are often charset dependent.

--
Antti Haapala

In response to

Responses

Browse pgsql-sql by date

  From Date Subject
Next Message Richard Huxton 2003-02-19 12:49:03 Re: select from update from select?
Previous Message Troy 2003-02-19 10:49:28 Re: once again, sorting with Unicode