From: | Przemysław Sztoch <przemyslaw(at)sztoch(dot)pl> |
---|---|
To: | Michael Paquier <michael(at)paquier(dot)xyz> |
Cc: | Thomas Munro <thomas(dot)munro(at)gmail(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>, Peter Eisentraut <peter(dot)eisentraut(at)enterprisedb(dot)com> |
Subject: | Re: [PATCH] Completed unaccent dictionary with many missing characters |
Date: | 2022-06-23 12:10:42 |
Message-ID: | 9bb4b652-d830-7760-7ce4-17952fb7f023@sztoch.pl |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Michael Paquier wrote on 23.06.2022 06:39:
>>> That'd leave just DEGREE CELSIUS and DEGREE FAHRENHEIT. Not sure how
>>> to kill those last two special cases -- they should be directly
>>> replaced by their decomposition.
>>>
>>> [1] https://unicode-org.atlassian.net/browse/CLDR-11383
>> I patch v3 support for cirilic is added.
>> Special character function has been purged.
>> Added support for category: So - Other Symbol. This category include
>> characters from special_cases().
> I think that we'd better split v3 into more patches to keep each
> improvement isolated. The addition of cyrillic characters in the
> range of letters and the removal of the sound copyright from the
> special cases can be done on their own, before considering the
> original case tackled by this thread.
> --
> Michael
The only division that is probably possible is the one attached.
--
Przemysław Sztoch | Mobile +48 509 99 00 66
Attachment | Content-Type | Size |
---|---|---|
unaccent-v4-cirilic-digits-dict.patch | text/plain | 16.5 KB |
unaccent-v4-more-char-category.patch | text/plain | 4.4 KB |
From | Date | Subject | |
---|---|---|---|
Next Message | Maxim Orlov | 2022-06-23 12:12:27 | Re: Custom tuplesorts for extensions |
Previous Message | Amit Kapila | 2022-06-23 12:09:19 | Re: Replica Identity check of partition table on subscriber |