Re: Tsearch2 and Unicode?

From: dom(at)happygiraffe(dot)net (Dominic Mitchell)
To: Dawid Kuroczko <qnex42(at)gmail(dot)com>
Cc: Pgsql General <pgsql-general(at)postgresql(dot)org>
Subject: Re: Tsearch2 and Unicode?
Date: 2004-11-18 13:34:23
Message-ID: 20041118133423.GA54846@ppe.happygiraffe.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Thu, Nov 18, 2004 at 12:17:01PM +0100, Dawid Kuroczko wrote:
> Correct me if I am wrong, but I think that UTF-8 is almost identical
> to ISO-8859-1 in binary form to ISO-8859-1. I mean, UTF-8 is
> ISO-8859-1 plus multibyte characters from other charsets.

No, UTF-8 and ISO-8859-1 are different from characters 128 and up. Even
though code point 160 represents "non breaking space" in both, they are
represented differently in terms of bytes. One is 0xA0 and the other is
0xC2 0xA0.

-Dom

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Richard_D_Levine 2004-11-18 13:44:36 Re: How to know a record has been updated, then reset the flag?
Previous Message Dawid Kuroczko 2004-11-18 11:17:01 Re: Tsearch2 and Unicode?