From: | Nicolas Barbier <nicolas(dot)barbier(at)gmail(dot)com> |
---|---|
To: | Martijn van Oosterhout <kleptog(at)svana(dot)org> |
Cc: | Itagaki Takahiro <itagaki(dot)takahiro(at)gmail(dot)com>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Robert Haas <robertmhaas(at)gmail(dot)com>, Dimitri Fontaine <dimitri(at)2ndquadrant(dot)fr>, PostgreSQL-Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Extensions, patch v20 (bitrot fixes) |
Date: | 2010-12-20 21:15:56 |
Message-ID: | AANLkTinZVgKk+2Eeq5OTn8u1UajY8tjdytFJziDAstiC@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
2010/12/20 Martijn van Oosterhout <kleptog(at)svana(dot)org>:
> On Mon, Dec 20, 2010 at 09:03:56AM +0900, Itagaki Takahiro wrote:
>
>> UTF-8 is not a superset of all encodings.
>
> I think you mean Unicode is not a superset of all character sets. I've
> heard this before but never found what's missing. [citation needed]?
>From <URL:http://en.wikipedia.org/wiki/Japanese_language_and_computers#Character_encodings>:
"Unicode is supposed to solve all encoding problems in all languages
of the world. [..] There are still controversies. For Japanese, the
kanji characters have been unified with Chinese, that is a character
considered to be the same in both Japanese and Chinese have been given
one and the same code number in Unicode, even if they look a little
different. This process, called Han unification, has caused
controversy."
For examples (my browser doesn't show any differences though, probably
because I don't have the corresponding fonts):
<URL:http://en.wikipedia.org/wiki/Han_unification#Examples_of_language_dependent_characters>
Nicolas
From | Date | Subject | |
---|---|---|---|
Next Message | Dimitri Fontaine | 2010-12-20 21:35:44 | Extensions, patch 22 (cleanup, review, cleanup) |
Previous Message | Robert Haas | 2010-12-20 20:40:27 | Re: bug in SignalSomeChildren |