Re: Support regular expressions with nondeterministic collations

From: Peter Eisentraut <peter(at)eisentraut(dot)org>
To: pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Subject: Re: Support regular expressions with nondeterministic collations
Date: 2024-12-30 12:29:36
Message-ID: e7f9a516-5cd0-48dc-aaa3-435adf5ac478@eisentraut.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 29.10.24 09:47, Peter Eisentraut wrote:
>> I kind of wonder if we really want to do this.  It adds no
>> functionality, and it forecloses the possibility of changing
>> the definition later.  I understand and agree with your conclusion
>> that it's pretty much impossible to do what the SQL standard suggests
>> should happen --- but maybe we're both missing something that would
>> make it feasible.  (Have you asked your committee colleagues if
>> anyone's actually implemented what they wrote about SIMILAR TO?
>> If they've written something unimplementable, it seems like there
>> is work for them to do in any case.)
>
> Good idea; I'll go ask there too.

So the result from that was that no one there knew what to do either.
There was general interest in the various arguments and options, but
there was no consensus about what the right solution should be.

For everyone's amusement, attached is the discussion paper I submitted,
which contains some of my arguments from this thread as well as other
information and examples.

I think a way forward would be to define more special purpose collations
that are just "normal but case insensitive" or "normal but accent
insensitive", like was discussed later in this thread, and what other
implementations apparently also do (see BINARY_CI in the paper).

For now, I'm withdrawing this patch, but I (and I suspect others) will
keep thinking about this.

Attachment Content-Type Size
w28015-Pattern matching versus collations.pdf application/pdf 68.5 KB

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Nazir Bilal Yavuz 2024-12-30 12:39:18 Re: add support for the old naming libs convention on windows (ssleay32.lib and libeay32.lib)
Previous Message Jelte Fennema-Nio 2024-12-30 12:22:24 Re: IANA timezone abbreviations versus timezone_abbreviations