Re: case insensitive regex clause with some latin1 characters

From: Emi Lu <emilu(at)encs(dot)concordia(dot)ca>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Ragnar Österlund <ragoster(at)gmail(dot)com>, pgsql-sql(at)postgresql(dot)org
Subject: Re: case insensitive regex clause with some latin1 characters
Date: 2006-09-11 20:14:04
Message-ID: 4505C38C.9080108@encs.concordia.ca
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-sql

My environment setup as:

show lc_ctype;
lc_ctype
-------------
fr_CA.UTF-8
(1 row)

fis=> SELECT 'Ä' ~* 'ä';
?column?
----------
f
(1 row)

fis=> SELECT 'Ä' ilike 'ä';
?column?
----------
f
(1 row)

I got the same result: false

> "=?ISO-8859-1?Q?Ragnar_=D6sterlund?=" <ragoster(at)gmail(dot)com> writes:
>> I'm not sure if this is a bug or if I'm doing something wrong. I have
>> a database encoded with ISO-8859-1, aka LATIN1. When I do something
>> like:
>
>> SELECT 'Ä' ~* 'ä';
>
>> it returns false.
>
> Check the database's locale setting (LC_CTYPE). It has to be one that
> expects LATIN1 encoding.
>
> The current regex code is generally not able to deal with locale-specific
> behaviors in UTF8 encoding, but it should work for single-byte encodings
> as long as you've got the locale setting right.
>
> regards, tom lane
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: don't forget to increase your free space map settings

In response to

Browse pgsql-sql by date

  From Date Subject
Next Message gabor 2006-09-11 23:34:38 text+number, find largest entry
Previous Message Tom Lane 2006-09-11 19:41:16 Re: case insensitive regex clause with some latin1 characters fails