Re: Unicode full case mapping: PG_UNICODE_FAST, and standard-compliant UCS_BASIC

From: Noah Misch <noah(at)leadboat(dot)com>
To: Jeff Davis <pgsql(at)j-davis(dot)com>
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: Unicode full case mapping: PG_UNICODE_FAST, and standard-compliant UCS_BASIC
Date: 2025-04-20 12:53:22
Message-ID: 20250420125322.0b.nmisch@google.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Sat, Apr 19, 2025 at 12:30:57PM -0700, Jeff Davis wrote:
> On Thu, 2025-04-17 at 06:58 -0700, Noah Misch wrote:
> > Should initcap_wbnext() pass in a locale-dependent "bool posix"
> > argument like
> > the others calls the commit changed?
>
> Yes, I believe you are correct. Patch and tests attached.

That patch is ready for commit.

> > Long-term, pg_u_isword() should have a "bool posix" argument. 
> > Currently, only
> > tests call that function.  If it got a non-test caller,
> > https://www.unicode.org/reports/tr18/#word would have pg_u_isword()
> > follow the
> > choice of posix compatibility like pg_u_isalnum() does.
>
> I based those functions on:
>
> https://www.unicode.org/reports/tr18/#Compatibility_Properties
>
> and the "word" class does not have a POSIX variant.

I missed that distinction. I withdraw this part.

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Alexander Lakhin 2025-04-20 15:00:00 Re: AIO v2.5
Previous Message Guillaume Lelarge 2025-04-20 07:37:27 Re: [BUG] temporary file usage report with extended protocol and unnamed portals