Quick Links

Re: once again, sorting with Unicode

From:	Antti Haapala <antti(dot)haapala(at)iki(dot)fi>
To:	"Troy K(dot)" <tjk(at)tksoft(dot)com>, <tjk(at)tksoft(dot)com>
Cc:	JBJ <postgre(at)totw(dot)org>, <pgsql-sql(at)postgresql(dot)org>
Subject:	Re: once again, sorting with Unicode
Date:	2003-02-19 12:37:27
Message-ID:	Pine.GSO.4.44.0302191413590.21258-100000@paju.oulu.fi
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-sql

On Wed, 19 Feb 2003, Troy wrote:

> > I have a multi-lingual database (currently 11 languages) which sorts
> > fine in MySQL (8859-1 character set) I have now converted the data to
> > Unicode and compiled Postgre with unicode support.
> >
> > I can select and insert unicode and so was rather pleased about that.
> > Until I saw that it wasn't working properly when ordering!
>
> The cause for the different values is the fact that unicode characters
> have different numeric values from ISO8859-1 and other encodings. Only
> ascii values are in sync with unicode numeric values. This I am sure you
> knew.

No, ISO8859-1 maps directly to unicode up to U+00FF. So the actual
_numeric_ values are the same. But actual byte patterns are encoding
dependent.

Have you set database encoding to UTF-8? Are you using proper UTF-8
locales? POSIX compiled locales are often charset dependent.

--
Antti Haapala

In response to

Re: once again, sorting with Unicode at 2003-02-19 10:49:28 from Troy

Responses

Re: once again, sorting with Unicode at 2003-02-20 10:51:28 from Troy

Browse pgsql-sql by date

	From	Date	Subject
Next Message	Richard Huxton	2003-02-19 12:49:03	Re: select from update from select?
Previous Message	Troy	2003-02-19 10:49:28	Re: once again, sorting with Unicode