Re: BUG #13440: unaccent does not remove all diacritics

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>
Cc: Michael Gradek <mike(at)busbud(dot)com>, pgsql-bugs(at)postgresql(dot)org
Subject: Re: BUG #13440: unaccent does not remove all diacritics
Date: 2015-06-15 12:55:33
Message-ID: 38161.1434372933@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Alvaro Herrera <alvherre(at)2ndquadrant(dot)com> writes:
> My terminal shows these characters to be different. One is
> http://graphemica.com/%C8%9B
> latin small letter t with comma below (U+021B)

> The other is
> http://graphemica.com/%C5%A3
> latin small letter t with cedilla (U+0163)

Ah-hah -- I did not look closely enough. So the immediate answer for
Michael is to add another entry to his unaccent.rules file.

Should we add the missing character to the standard unaccent.rules file?
I should think so in HEAD at least, but what about back-patching?

regards, tom lane

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Soule, Cathi (HQP) 2015-06-15 14:37:09 Re: BUG #13438: Restore using GUI client - Data Not Loading
Previous Message Michael Meskes 2015-06-15 12:34:21 Re: Lack of Sanity Checking in file 'misc.c' for PostgreSQL 9.4.x