Re: BUG #10589: hungarian.stop file spelling error

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>
Cc: Gavin Flower <GavinFlower(at)archidevsys(dot)co(dot)nz>, Kevin Grittner <kgrittn(at)ymail(dot)com>, "zsoros(at)gmail(dot)com" <zsoros(at)gmail(dot)com>, "pgsql-bugs(at)postgresql(dot)org" <pgsql-bugs(at)postgresql(dot)org>
Subject: Re: BUG #10589: hungarian.stop file spelling error
Date: 2014-06-11 03:43:47
Message-ID: 6839.1402458227@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Alvaro Herrera <alvherre(at)2ndquadrant(dot)com> writes:
> Gavin Flower wrote:
>> Not saying there is any problem, but you might like to check how the
>> EUR currency symbol is handled (it is in LATIN2, but not in LATIN1):

> Latin1 doesn't have euro, which is why Latin9 (iso-8859-15) was invented
> IIUC.

Yeah, I doubt there's much to be learned from the euro-sign case.
The Snowball stemmers certainly don't care about euro --- they
only work with alphabetic characters.

Actually, an interesting point is that we could probably use one of the
single-byte-encoding LATIN1 stemmers when the database encoding is LATIN9,
and thereby save a translation to UTF8 and back, since the stemmer logic
isn't going to care about euro signs. Likewise for LATIN2 vs LATIN10.
Not sure it's worth the trouble though.

regards, tom lane

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message bipinramani1717 2014-06-11 05:07:49 BUG #10598: VC++ Error During Installation Time
Previous Message Alvaro Herrera 2014-06-11 03:30:15 Re: BUG #10589: hungarian.stop file spelling error