Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Tatsuo Ishii <ishii(at)sraoss(dot)co(dot)jp>
Cc: parker(dot)han(at)outlook(dot)com, pgsql-general(at)postgresql(dot)org
Subject: Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?
Date: 2020-10-06 00:58:42
Message-ID: 1831165.1601945922@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Tatsuo Ishii <ishii(at)sraoss(dot)co(dot)jp> writes:
> One of ideas to avoid the concern could be "shifting" GB18030 code
> points into "ASCII safe" code range with some calculations so that
> backend can handle them without worrying about the concern above. This
> way, we could avoid a table lookup overhead which is necessary in
> conversion between GB18030 and UTF8 and so on.

Hmm ... interesting idea, basically invent our own modified version
of GB18030 (or SJIS?) for backend-internal storage. But I'm not
sure how to make it work without enlarging the string, which'd defeat
the OP's argument. It looks to me like the second-byte code space is
already pretty full in both encodings.

regards, tom lane

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Han Parker 2020-10-06 01:46:25 回复: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?
Previous Message Tatsuo Ishii 2020-10-06 00:04:10 Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?