| From: | Thomas Munro <thomas(dot)munro(at)enterprisedb(dot)com> | 
|---|---|
| To: | Tasos Maschalidis <tas(dot)o(dot)s(at)hotmail(dot)com> | 
| Cc: | PostgreSQL mailing lists <pgsql-bugs(at)lists(dot)postgresql(dot)org> | 
| Subject: | Re: BUG #15347: Unaccent for greek characters does not work | 
| Date: | 2018-08-23 23:22:41 | 
| Message-ID: | CAEepm=0F3pv9A3_pe=jQMCS9b-iUPjEQjzoftNJjN8FHwXHeKA@mail.gmail.com | 
| Views: | Whole Thread | Raw Message | Download mbox | Resend email | 
| Thread: | |
| Lists: | pgsql-bugs | 
On Fri, Aug 24, 2018 at 10:47 AM, Tasos Maschalidis <tas(dot)o(dot)s(at)hotmail(dot)com> wrote:
> The results are legit for all vowels.
Cool.
> There is only one thing missing which
> I guess does fall into unaccent functionality. When an "σ" is used as the
> last letter of any word, it changes to "s" grammatically, unless the whole
> word is capitals, then it stays the same ("Σ"), even at the end of the word.
> In searches it s useful to convert any "ς" to "σ". I had included it to a
> custom unaccent.rules file I was using and brought desired results. For
> example searching for "Θωμάς" would not match "ΘΩΜΑΣ", unless such a
> convertion exists. Not sure if that should be taken care of somewhere else,
> but in my case (and also in the gist I sent you, check the last comments) it
> proved useful and made sense.
Hmm, I see. Also described here:
https://en.wikipedia.org/wiki/Sigma
I take it you are making searches case insensitive by converting
everything to lower case.  Since you have a distinction that exists in
lower case but not in upper case, wouldn't it make more sense to
converting everything to upper case?
postgres=# select upper('Θωμάς'), upper('Θωμάσ'), upper('Θωμάσ') =
upper('Θωμάς');
 upper | upper | ?column?
-------+-------+----------
 ΘΩΜΆΣ | ΘΩΜΆΣ | t
(1 row)
PS On PostgreSQL mailing lists, we try to avoid "top posting" (=
leaving the message we're replying to below our reply), because it
makes the archive of email threads harder to read.
-- 
Thomas Munro
http://www.enterprisedb.com
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Michael Paquier | 2018-08-24 00:12:39 | Re: BUG #15347: Unaccent for greek characters does not work | 
| Previous Message | Tasos Maschalidis | 2018-08-23 22:47:59 | Re: BUG #15347: Unaccent for greek characters does not work |