Re: unicode match normal forms

From: Matthias Apitz <guru(at)unixarea(dot)de>
To: pgsql-general(at)lists(dot)postgresql(dot)org
Subject: Re: unicode match normal forms
Date: 2021-05-17 13:45:00
Message-ID: YKJzXK5X/NXN5h/Z@c720-r368166.fritz.box
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

El día lunes, mayo 17, 2021 a las 01:27:40p. m. -0000, hamann(dot)w(at)t-online(dot)de escribió:

> Hi,
>
> in unicode letter ä exists in two versions - linux and windows use a composite whereas macos prefers
> the decomposed form. Is there any way to make a semi-exact match that accepts both variants?
> This question is not about fulltext but about matching filenames across a network - I wish to avoid two equally-looking
> filenames.

There is only *one* codepoint for the German letter a Umlaut:
LATIN SMALL LETTER A WITH DIAERESI U+00E4

Said that, having such chars (non ASCII) in file names, I count as a bad
idea.

matthias

--
Matthias Apitz, ✉ guru(at)unixarea(dot)de, http://www.unixarea.de/ +49-176-38902045
Public GnuPG key: http://www.unixarea.de/key.pub
¡Con Cuba no te metas! «» Don't mess with Cuba! «» Leg Dich nicht mit Kuba an!
http://www.cubadebate.cu/noticias/2020/12/25/en-video-con-cuba-no-te-metas/

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Gianni Ceccarelli 2021-05-17 14:00:49 Re: unicode match normal forms
Previous Message Gianni Ceccarelli 2021-05-17 13:44:31 Re: unicode match normal forms