| From: | Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp> | 
|---|---|
| To: | pgman(at)candle(dot)pha(dot)pa(dot)us | 
| Cc: | tgl(at)sss(dot)pgh(dot)pa(dot)us, dpage(at)vale-housing(dot)co(dot)uk, oliver(at)opencloud(dot)com, zakkr(at)zf(dot)jcu(dot)cz, pgsql-hackers(at)postgresql(dot)org | 
| Subject: | Re: UTF8 or Unicode | 
| Date: | 2005-02-23 02:44:29 | 
| Message-ID: | 20050223.114429.71086134.t-ishii@sra.co.jp | 
| Views: | Whole Thread | Raw Message | Download mbox | Resend email | 
| Thread: | |
| Lists: | pgsql-hackers pgsql-patches | 
I do not object the changing UNICODE->UTF-8, but all these discussions
sound a little bit funny to me.
If you want to blame UNICODE, you should blame LATIN1 etc. as
well. LATIN1(ISO-8859-1) is actually a character set name, not an
encoding name. ISO-8859-1 can be encoded in 8-bit single byte
stream. But it can be encoded in 7-bit too. So when we refer to
LATIN1(ISO-8859-1), it's not clear if it's encoded in 7/8-bit.
--
Tatsuo Ishii
From: Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Subject: Re: [HACKERS] UTF8 or Unicode
Date: Mon, 21 Feb 2005 22:08:25 -0500 (EST)
Message-ID: <200502220308(dot)j1M38PV03238(at)candle(dot)pha(dot)pa(dot)us>
> Tom Lane wrote:
> > Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
> > > I think we just need to _favor_ UTF8.
> > 
> > I agree.
> > 
> > > The question is where are we
> > > favoring Unicode rather than UTF8?
> > 
> > It's the canonical name of the encoding, both in the code and the docs.
> > 
> > regression=# create database e encoding 'utf-8';
> > CREATE DATABASE
> > regression=# \l
> >          List of databases
> >     Name    |  Owner   | Encoding  
> > ------------+----------+-----------
> >  e          | postgres | UNICODE
> >  regression | postgres | SQL_ASCII
> >  template0  | postgres | SQL_ASCII
> >  template1  | postgres | SQL_ASCII
> > (5 rows)
> > 
> > As soon as we decide whether the canonical name is "UTF8" or "UTF-8"
> > ;-) we can fix it.
> 
> I checked and it looks like "UTF-8" is the correct usage:
> 
> 	http://www.unicode.org/glossary/
> 
> -- 
>   Bruce Momjian                        |  http://candle.pha.pa.us
>   pgman(at)candle(dot)pha(dot)pa(dot)us               |  (610) 359-1001
>   +  If your life is a hard drive,     |  13 Roberts Road
>   +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073
> 
> ---------------------------(end of broadcast)---------------------------
> TIP 6: Have you searched our list archives?
> 
>                http://archives.postgresql.org
> 
| From | Date | Subject | |
|---|---|---|---|
| Next Message | anupam.chaudhury | 2005-02-23 04:21:45 | Postgres on VXworks+SH4 | 
| Previous Message | Neil Conway | 2005-02-22 23:02:22 | Re: left-deep plans? | 
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Neil Conway | 2005-02-23 07:14:27 | optimize md5_text | 
| Previous Message | Nicolai Tufar | 2005-02-22 21:39:54 | Re: Repleacement for src/port/snprintf.c |