Quick Links

Re: FW: Character set equivalent for AL32UTF8

From:	"Kevin Grittner" <Kevin(dot)Grittner(at)wicourts(dot)gov>
To:	"Mridul Mathew" <mridulmathew(at)gmail(dot)com>,<ringerc(at)ringerc(dot)id(dot)au>
Cc:	<rajeshwarbharathi(at)gmail(dot)com>,<pgsql-admin(at)postgresql(dot)org>
Subject:	Re: FW: Character set equivalent for AL32UTF8
Date:	2011-08-10 18:42:25
Message-ID:	4E428AC1020000250003FD63@gw.wicourts.gov
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-admin

Mridul Mathew <mridulmathew(at)gmail(dot)com> wrote:

> From: *Craig Ringer* <ringerc(at)ringerc(dot)id(dot)au>

>> A 30-second Google search turned up this:
>>
>>
http://decipherinfosys.wordpress.com/2007/01/28/difference-between-utf8-and-al32utf8-character-sets-in-oracle/

> If supplementary characters are inserted in a UTF8 database, they
> will be treated as 2 separate undefined characters, occupying 6
> bytes in storage. Oracle recommends using al32utf8 for any newly
> defined supplementary characters.
>
> Does PostgreSQL make a distinction within Unicode in a similar
> fashion?

It sounds as though Oracle initially failed to properly implement
the UTF-8 character encoding scheme, but rather than fix the broken
scheme they created an alternative. So far as I know, PostgreSQL
should be using proper UTF-8 encoding if you ask for it, without any
special gyrations.

-Kevin

In response to

Re: FW: Character set equivalent for AL32UTF8 at 2011-08-10 08:07:25 from Mridul Mathew

Browse pgsql-admin by date

	From	Date	Subject
Next Message	CS DBA	2011-08-10 19:08:21	initdb fails on AIX
Previous Message	Simon Riggs	2011-08-10 18:01:18	Re: Frequency of archive_cleanup_command