Quick Links

Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Tatsuo Ishii <ishii(at)sraoss(dot)co(dot)jp>
Cc:	parker(dot)han(at)outlook(dot)com, pgsql-general(at)postgresql(dot)org
Subject:	Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?
Date:	2020-10-06 00:58:42
Message-ID:	1831165.1601945922@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

Tatsuo Ishii <ishii(at)sraoss(dot)co(dot)jp> writes:
> One of ideas to avoid the concern could be "shifting" GB18030 code
> points into "ASCII safe" code range with some calculations so that
> backend can handle them without worrying about the concern above. This
> way, we could avoid a table lookup overhead which is necessary in
> conversion between GB18030 and UTF8 and so on.

Hmm ... interesting idea, basically invent our own modified version
of GB18030 (or SJIS?) for backend-internal storage. But I'm not
sure how to make it work without enlarging the string, which'd defeat
the OP's argument. It looks to me like the second-byte code space is
already pretty full in both encodings.

regards, tom lane

In response to

Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered? at 2020-10-06 00:04:10 from Tatsuo Ishii

Responses

Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered? at 2020-10-06 02:15:35 from Tatsuo Ishii

Browse pgsql-general by date

	From	Date	Subject
Next Message	Han Parker	2020-10-06 01:46:25	回复: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?
Previous Message	Tatsuo Ishii	2020-10-06 00:04:10	Re: 回复: May "PostgreSQL server side GB18030 character set support" reconsidered?