Re: BUG #14885: mistake in sorting win1251 chars

From: Francisco Olarte <folarte(at)peoplecall(dot)com>
To: Kalin Daskalov <k(dot)daskalov(dot)911(at)gmail(dot)com>
Cc: "pgsql-bugs(at)postgresql(dot)org" <pgsql-bugs(at)postgresql(dot)org>, Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>
Subject: Re: BUG #14885: mistake in sorting win1251 chars
Date: 2017-11-03 16:44:05
Message-ID: CA+bJJbzs_CpX7HKXZbLWZVHhmX_2sDU1aBLrPpmcg6AQme4ZYA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Kalin:

1st, if you follow on top-posting the flow is going to become dificult
to understand.

2nd:
On Fri, Nov 3, 2017 at 12:18 PM, Kalin Daskalov
<k(dot)daskalov(dot)911(at)gmail(dot)com> wrote:
> It's the proper locale. It's set to Bulgarian everywhere.

> - Windows settings:
> format: Bulgarian (Bulgaria)
> Location: Bulgaria
> Current language for non-Unicode programs: Bulgarian (Bulgaria)

I assume this is correct, I don't do windows.

> - PostgreSQL Database settings:
> ENCODING = 'UTF8'
> LC_COLLATE = 'Bulgarian_Bulgaria.1251'
> LC_CTYPE = 'Bulgarian_Bulgaria.1251'

This sounds fishy. Specially the .1251, which hints at a collation for
win1251 encoded bulgarian.

In my computers ( linux, debian and redhat derived most of them ) I
have LC_COLLATE set to en_US.UTF-8, and they seem to sort utf-8
properly. Maybe your .1251 does some strange character mapping utf-8
=> 1251 before sorting. The collations are, IIRC, derived from the c
library collaiton routines anyway.

Francisco Olarte.

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message Jeff Janes 2017-11-03 19:42:42 permissions denial to superuser with foreign keys
Previous Message David G. Johnston 2017-11-03 14:12:12 Re: BUG #14883: Syntax SQL error (42601), but should be a different error no