Re: Pre-proposal: unicode normalized text

From: Robert Haas <robertmhaas(at)gmail(dot)com>
To: Isaac Morland <isaac(dot)morland(at)gmail(dot)com>
Cc: Daniel Verite <daniel(at)manitou-mail(dot)org>, Jeff Davis <pgsql(at)j-davis(dot)com>, Peter Eisentraut <peter(at)eisentraut(dot)org>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: Pre-proposal: unicode normalized text
Date: 2023-10-17 15:43:18
Message-ID: CA+TgmobgMg8x7xjq42_tiBT_Yxr+SX5qkp6MGZ3OtvdJ92b9jw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Tue, Oct 17, 2023 at 11:38 AM Isaac Morland <isaac(dot)morland(at)gmail(dot)com> wrote:
> On Tue, 17 Oct 2023 at 11:15, Robert Haas <robertmhaas(at)gmail(dot)com> wrote:
>> Are code points assigned from a gapless sequence? That is, is the
>> implementation of codepoint_is_assigned(char) just 'codepoint <
>> SOME_VALUE' and SOME_VALUE increases over time?
>
> Not even close. Code points are organized in blocks, e.g. for mathematical symbols or Ethiopic script. Sometimes new blocks are added, sometimes new characters are added to existing blocks. Where they go is a combination of convenience, history, and planning.

Ah. Good to know.

--
Robert Haas
EDB: http://www.enterprisedb.com

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Nathan Bossart 2023-10-17 15:46:47 Re: stopgap fix for signal handling during restore_command
Previous Message Robert Haas 2023-10-17 15:42:27 Re: run pgindent on a regular basis / scripted manner