Quick Links

Re: Mixing different LC_COLLATE and database encodings

From:	Bill Moseley <moseley(at)hank(dot)org>
To:	pgsql-general(at)postgresql(dot)org
Subject:	Re: Mixing different LC_COLLATE and database encodings
Date:	2006-02-19 04:16:07
Message-ID:	20060219041606.GB11754@hank.org
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

On Sat, Feb 18, 2006 at 09:31:27PM -0500, Greg Stark wrote:
> Anything else and the collation just won't work properly. It will be
> expecting UTF-8 and be fed ISO-8859-1 strings, resulting in weird
> and sometimes inconsistent sort orders.

So if I have utf8 encoded text and the lc_collate is anything but
utf8 then sorting will be all wrong for any chars that don't map to
ASCII (>127). Kind of a mess.

> There's a certain amount of feeling that using any locale other than C is
> probably not ever the right thing given the current functionality. Just about
> any database has some strings in it that are really just ascii strings like
> char(1) primary keys and other internal database strings. You may not want
> them being subject to the locale's collation for comparison purposes and you
> may not want the overhead of variable width character encodings.

Is the Holy Grail encoding and lc_collate settings per column?

Changing topics, but I'm going to play with different cluster
settings for collate. If I create a cluster in given directory
is there any problems with moving that cluster (renaming the
directory)?

Thanks for your comments, Greg.

--
Bill Moseley
moseley(at)hank(dot)org

In response to

Re: Mixing different LC_COLLATE and database encodings at 2006-02-19 02:31:27 from Greg Stark

Responses

Re: Mixing different LC_COLLATE and database encodings at 2006-02-19 12:45:38 from Martijn van Oosterhout

Browse pgsql-general by date

	From	Date	Subject
Next Message	Brendan Duddridge	2006-02-19 05:51:58	Same data, different results in Postgres vs. FrontBase
Previous Message	Greg Stark	2006-02-19 02:31:27	Re: Mixing different LC_COLLATE and database encodings