Re: Hostnames, IDNs, Punycode and Unicode Case Folding

From: Andrew Sullivan <ajs(at)crankycanuck(dot)ca>
To: pgsql-general(at)postgresql(dot)org
Subject: Re: Hostnames, IDNs, Punycode and Unicode Case Folding
Date: 2014-12-30 00:25:59
Message-ID: 20141230002559.GH54847@crankycanuck.ca
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Tue, Dec 30, 2014 at 12:18:58AM +0000, Mike Cardwell wrote:
>
> This is exactly the same method that we commonly use for performing case
> insensitive text searches using lower() indexes.

Hmm. How did you get the original, then? If you have the original
Unicode version, why don't you switch to IDNA2008 publication rules,
which are way more reliable? In that case, you do have a 1:1 lookup
and you shouldn't have a problem.

If you need variants, then you have a different problem, but that
actually can be specified for the much narrower range of UTF-8
permissible under IDNA2008.

A

--
Andrew Sullivan
ajs(at)crankycanuck(dot)ca

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Mike Cardwell 2014-12-30 00:26:11 Re: Hostnames, IDNs, Punycode and Unicode Case Folding
Previous Message Andrew Sullivan 2014-12-30 00:22:21 Re: Hostnames, IDNs, Punycode and Unicode Case Folding