| From: | Abhijit Menon-Sen <ams(at)oryx(dot)com> |
|---|---|
| To: | Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> |
| Cc: | PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org> |
| Subject: | Re: UTF8 or Unicode |
| Date: | 2005-02-15 02:27:32 |
| Message-ID: | 20050215022732.GB24807@penne.toroid.org |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
At 2005-02-14 21:14:54 -0500, pgman(at)candle(dot)pha(dot)pa(dot)us wrote:
>
> Should our multi-byte encoding be referred to as UTF8 or Unicode?
The *encoding* should certainly be referred to as UTF-8. Unicode is a
character set, not an encoding; Unicode characters may be encoded with
UTF-8, among other things.
(One might think of a charset as being a set of integers representing
characters, and an encoding as specifying how those integers may be
converted to bytes.)
> I know UTF8 is a type of unicode but do we need to rename anything
> from Unicode to UTF8?
I don't know. I'll go through the documentation to see if I can find
anything that needs changing.
-- ams
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Joshua D. Drake | 2005-02-15 02:56:49 | Re: 8.0.X and the ARC patent |
| Previous Message | pgsql | 2005-02-15 02:21:01 | Re: 8.0.X and the ARC patent |