From: | Antti Haapala <antti(dot)haapala(at)iki(dot)fi> |
---|---|
To: | "Troy K(dot)" <tjk(at)tksoft(dot)com>, <tjk(at)tksoft(dot)com> |
Cc: | JBJ <postgre(at)totw(dot)org>, <pgsql-sql(at)postgresql(dot)org> |
Subject: | Re: once again, sorting with Unicode |
Date: | 2003-02-19 12:37:27 |
Message-ID: | Pine.GSO.4.44.0302191413590.21258-100000@paju.oulu.fi |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-sql |
On Wed, 19 Feb 2003, Troy wrote:
> > I have a multi-lingual database (currently 11 languages) which sorts
> > fine in MySQL (8859-1 character set) I have now converted the data to
> > Unicode and compiled Postgre with unicode support.
> >
> > I can select and insert unicode and so was rather pleased about that.
> > Until I saw that it wasn't working properly when ordering!
>
> The cause for the different values is the fact that unicode characters
> have different numeric values from ISO8859-1 and other encodings. Only
> ascii values are in sync with unicode numeric values. This I am sure you
> knew.
No, ISO8859-1 maps directly to unicode up to U+00FF. So the actual
_numeric_ values are the same. But actual byte patterns are encoding
dependent.
Have you set database encoding to UTF-8? Are you using proper UTF-8
locales? POSIX compiled locales are often charset dependent.
--
Antti Haapala
From | Date | Subject | |
---|---|---|---|
Next Message | Richard Huxton | 2003-02-19 12:49:03 | Re: select from update from select? |
Previous Message | Troy | 2003-02-19 10:49:28 | Re: once again, sorting with Unicode |