Quick Links

Re: C locale versus en_US.UTF8. (Was: String comparision in PostgreSQL)

From:	Bruce Momjian <bruce(at)momjian(dot)us>
To:	Aleksey Tsalolikhin <atsaloli(dot)tech(at)gmail(dot)com>
Cc:	Merlin Moncure <mmoncure(at)gmail(dot)com>, pgsql-general(at)postgresql(dot)org
Subject:	Re: C locale versus en_US.UTF8. (Was: String comparision in PostgreSQL)
Date:	2012-08-29 17:43:34
Message-ID:	20120829174334.GA8748@momjian.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

On Wed, Aug 29, 2012 at 10:31:21AM -0700, Aleksey Tsalolikhin wrote:
> On Wed, Aug 29, 2012 at 9:45 AM, Merlin Moncure <mmoncure(at)gmail(dot)com> wrote:
> > citext unfortunately doesn't allow for index optimization of LIKE
> > queries, which IMNSHO defeats the whole purpose. to the best way
> > remains to use lower() ...
> > this will be index optimized and fast as long as you specified C
> > locale for your database.
>
> What is the difference between C and en_US.UTF8, please? We see that
> the same query (that invokes a sort) runs 15% faster under the C
> locale. The output between C and en_US.UTF8 is identical. We're
> considering moving our database from en_US.UTF8 to C, but we do deal
> with internationalized text.

Well, C has reduced overhead for string comparisons, but obviously
doesn't work well for international characters. The single-byte
encodings have somewhat less overhead than UTF8. You can try using C
locales for databases that don't require non-ASCII characters.

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ It's impossible for everything to be true. +

In response to

C locale versus en_US.UTF8. (Was: String comparision in PostgreSQL) at 2012-08-29 17:31:21 from Aleksey Tsalolikhin

Responses

Re: C locale versus en_US.UTF8. (Was: String comparision in PostgreSQL) at 2012-08-29 18:45:20 from Merlin Moncure
Re: C locale versus en_US.UTF8. (Was: String comparision in PostgreSQL) at 2012-08-29 18:52:50 from Scott Marlowe

Browse pgsql-general by date

	From	Date	Subject
Next Message	Merlin Moncure	2012-08-29 18:45:20	Re: C locale versus en_US.UTF8. (Was: String comparision in PostgreSQL)
Previous Message	Ryan Kelly	2012-08-29 17:40:40	Re: psql & unix env variables