Re: Is it possible to sort strings in EBCDIC order in PostgreSQL server?

From: John McKown <john(dot)archie(dot)mckown(at)gmail(dot)com>
To: "Tsunakawa, Takayuki" <tsunakawa(dot)takay(at)jp(dot)fujitsu(dot)com>
Cc: "pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org>
Subject: Re: Is it possible to sort strings in EBCDIC order in PostgreSQL server?
Date: 2017-12-12 13:18:10
Message-ID: CAAJSdjhQf1ajbsZhwtBVhD8sTRKfPQmao-cFL4P2wT1-oncTDw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Tue, Dec 12, 2017 at 2:17 AM, Tsunakawa, Takayuki <
tsunakawa(dot)takay(at)jp(dot)fujitsu(dot)com> wrote:

> Hi Laurenz, Tom, Peter,
>
> Thanks for your suggestions. The practical solution seems to be to
> override comparison operators of char, varchar and text data types with
> UDFs that behave as Tom mentioned.
>
> From: Peter Geoghegan [mailto:pg(at)bowt(dot)ie]
> > That said, the idea of an "EBCDIC collation" seems limiting. Why
> > should a system like DB2 for the mainframe (that happens to use EBCDIC
> > as its encoding) not have a more natural, human-orientated collation
> > even while using EBCDIC? ISTM that the point of using the "C" locale
> > (with EBDIC or with UTF-8 or with any other encoding) is to get a
> > performance benefit where the actual collation's behavior doesn't
> > matter much to users. Are you sure it's really important to be
> > *exactly* compatible with EBCDIC order? As long as you're paying for a
> > custom collation, why not just use a collation that is helpful to
> > humans?
>
> You are right. I'd like to ask the customer whether and why they need
> EBCDIC ordering.
>

​This is a guess on my part, based on many years on an EBCDIC system. But
I'll bet that they are doing a conversion off of the EBCDIC system (maybe
Db2 on z/OS) to an ASCII system (Linux or Windows) running PostgreSQL. They
want to be able to compare the output from the existing system to the
output on the new system. EBCDIC orders "lower case", "upper case", then
"digits". The default C locale on Linux (I don't know Windows) will sort
"digits", then alphabetic with the lower then upper case of each letter in
order like: "aAbB...zZ". Comparing identical data which is not presented in
exactly the same order would be very difficult. ​

>
> Regards
> Takayuki Tsunakawa
>
>

--
I have a theory that it's impossible to prove anything, but I can't prove
it.

Maranatha! <><
John McKown

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message James Keener 2017-12-12 15:11:10 Re: Is it possible to sort strings in EBCDIC order in PostgreSQL server?
Previous Message Edson Carlos Ericksson Richter 2017-12-12 12:44:08 Re: How to know if a database has changed