pgsql: Extend the default rules file for contrib/unaccent with Vietname

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: pgsql-committers(at)postgresql(dot)org
Subject: pgsql: Extend the default rules file for contrib/unaccent with Vietname
Date: 2017-08-16 20:52:01
Message-ID: E1di5IH-0002ii-Pa@gemulon.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-committers

Extend the default rules file for contrib/unaccent with Vietnamese letters.

Improve generate_unaccent_rules.py to handle composed characters whose base
is another composed character rather than a plain letter. The net effect
of this is to add a bunch of multi-accented Vietnamese characters to
unaccent.rules.

Original complaint from Kha Nguyen, diagnosis of the script's shortcoming
by Thomas Munro.

Dang Minh Huong and Michael Paquier

Discussion: https://postgr.es/m/CALo3sF6EC8cy1F2JUz=GRf5h4LMUJTaG3qpdoiLrNbWEXL-tRg@mail.gmail.com

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/ec0a69e49bf41a37b5c2d6f6be66d8abae00ee05

Modified Files
--------------
contrib/unaccent/generate_unaccent_rules.py | 39 ++++++++--
contrib/unaccent/unaccent.rules | 114 ++++++++++++++++++++++++++++
2 files changed, 145 insertions(+), 8 deletions(-)

Browse pgsql-committers by date

  From Date Subject
Next Message Heikki Linnakangas 2017-08-16 21:49:32 pgsql: Fix pg_atomic_u64 initialization.
Previous Message Tom Lane 2017-08-16 19:37:31 pgsql: Make the planner assume that the entries in a VALUES list are di