Quick Links

Re: [HACKERS] Unicode combining characters

From:	Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>
To:	phede-ml(at)islande(dot)org
Cc:	pgsql-patches(at)postgresql(dot)org
Subject:	Re: [HACKERS] Unicode combining characters
Date:	2001-10-09 14:16:56
Message-ID:	20011009231656N.t-ishii@sra.co.jp
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers pgsql-patches

> - corrects a bit the UTF-8 code from Tatsuo to allow Unicode 3.1
> characters (characters with values >= 0x10000, which are encoded on
> four bytes).

After applying your patches, do the 4-bytes UTF-8 convert to UCS-2 (2
bytes) or UCS-4 (4 bytes) in pg_utf2wchar_with_len()? If it were 4
bytes, we are in trouble. Current regex implementaion does not handle
4 byte width charsets.
--
Tatsuo Ishii

In response to

Re: [HACKERS] Unicode combining characters at 2001-10-08 19:35:44 from Patrice Hédé

Responses

Re: [HACKERS] Unicode combining characters at 2001-10-09 17:07:38 from Patrice Hédé

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Patrice Hédé	2001-10-09 17:07:38	Re: [HACKERS] Unicode combining characters
Previous Message	Haller Christoph	2001-10-09 11:01:26	Re: System usage and statistics

Browse pgsql-patches by date

	From	Date	Subject
Next Message	Bruce Momjian	2001-10-09 15:37:19	Re: updated patch for Chinese NLS support (simplified)
Previous Message	Tom Lane	2001-10-09 14:01:18	Re: typo or C&P error