Re: Encoding Issue with UNICODE

From: Stephane Bortzmeyer <bortzmeyer(at)nic(dot)fr>
To: Fritz Bayer <fritz-bayer(at)web(dot)de>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: Encoding Issue with UNICODE
Date: 2005-04-14 07:56:28
Message-ID: 20050414075628.GA26171@nic.fr
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Tue, Apr 12, 2005 at 03:39:45AM -0700,
Fritz Bayer <fritz-bayer(at)web(dot)de> wrote
a message of 53 lines which said:

> I mean unicode itself is 16 bit long.

This is completely false. Unicode itself is just a table and, since it
contains more than 100,000 characters, you cannot index them with 16
bits.

Unicode has various encodings, some fixed-size, like UTF-32, some not.

> So "münchen" should expand to 14 characters. But only ü expands to
> two characters.

Perfectly normal with UTF-8, where the size of an Unicode charactere
is not fixed.

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Dawid Kuroczko 2005-04-14 08:09:20 Re: psql vs perl prepared inserts
Previous Message Steve - DND 2005-04-14 07:48:31 Can't install pltcl on Windows 8.0