pgsql: Treat Unicode codepoints of category "Format" as non-spacing

From: John Naylor <john(dot)naylor(at)postgresql(dot)org>
To: pgsql-committers(at)lists(dot)postgresql(dot)org
Subject: pgsql: Treat Unicode codepoints of category "Format" as non-spacing
Date: 2022-09-13 09:22:09
Message-ID: E1oY27N-0008aG-AS@gemulon.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-committers

Treat Unicode codepoints of category "Format" as non-spacing

Commit d8594d123 updated the list of non-spacing codepoints used
for calculating display width, but in doing so inadvertently removed
some, since the script used for that commit only considered combining
characters.

For complete coverage for zero-width characters, include codepoints in
the category Cf (Format). To reflect the wider purpose, also rename files
and update comments that referred specifically to combining characters.

Some of these ranges have been missing since v12, but due to lack of
field complaints it was determined not important enough to justify adding
special-case logic the backbranches.

Kyotaro Horiguchi

Report by Pavel Stehule
Discussion: https://www.postgresql.org/message-id/flat/CAFj8pRBE8yvpQ0FSkPCoe0Ny1jAAsAQ6j3qMgVwWvkqAoaaNmQ%40mail.gmail.com

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/0bd9c629732375e21d3ca6fba16c4a6a2808411a

Modified Files
--------------
src/common/unicode/Makefile | 4 +--
...ble.pl => generate-unicode_nonspacing_table.pl} | 12 ++++----
src/common/wchar.c | 8 +++---
...ombining_table.h => unicode_nonspacing_table.h} | 33 ++++++++++++++--------
4 files changed, 34 insertions(+), 23 deletions(-)

Browse pgsql-committers by date

  From Date Subject
Next Message John Naylor 2022-09-13 09:25:18 Re: pgsql: Fix perltidy breaking perlcritic
Previous Message Andres Freund 2022-09-13 07:51:17 Re: pgsql: aix: No need to use mkldexport when we want to export all symbol