Quick Links

Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?

From:	Tatsuo Ishii <ishii(at)sraoss(dot)co(dot)jp>
To:	parker(dot)han(at)outlook(dot)com
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?
Date:	2020-10-05 12:17:48
Message-ID:	20201005.211748.674586579649892338.t-ishii@sraoss.co.jp
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

>> 1. In this big data and mobile era, in the country with most population, 50% more disk energy consuming for Chinese characters (UTF-8 usually 3 bytes for a Chinese character, while GB180830 only 2 bytes) is indeed a harm to "Carbon Neutral", along with Polar ice melting.
>
> Really? I thought GB18030 uses up to 4 bytes.
> https://en.wikipedia.org/wiki/GB_18030#Encoding
>
> --Parker:
> More preciously description should be GB18030 use 2 or 4 bytes for Chinese characters.
> It's a bit complicated to explain with only words but easy with help of the following graph.
>
> Most frequently used 20902 Chinese characters and 984 symbols in GBK is encoded with 2 bytes, which is a subset of GB18030.

It does not sound fair argument unless you are going to implement only
GBK compatible part of GB18030.

Best regards,
--
Tatsuo Ishii
SRA OSS, Inc. Japan
English: http://www.sraoss.co.jp/index_en.php
Japanese:http://www.sraoss.co.jp

In response to

回复: May "PostgreSQL server side GB18030 character set support" reconsidered? at 2020-10-05 10:08:28 from Han Parker

Browse pgsql-general by date

	From	Date	Subject
Next Message	James B. Byrne	2020-10-05 13:18:30	Re: UUID generation problem
Previous Message	Mario Emmenlauer	2020-10-05 11:22:18	dup(0) fails on Ubuntu 20.04 and macOS 10.15 with 13.0