Re: UTF-8 and LIKE vs =

From: David Wheeler <david(at)kineticode(dot)com>
To: Ian Barwick <barwick(at)gmail(dot)com>
Cc: Markus Bertheau <twanger(at)bluetwanger(dot)de>, pgsql-general(at)postgresql(dot)org
Subject: Re: UTF-8 and LIKE vs =
Date: 2004-08-23 23:50:04
Message-ID: 282C6158-F55F-11D8-990D-000A95972D84@kineticode.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Aug 23, 2004, at 4:34 PM, Ian Barwick wrote:

> wild speculation in need of a Korean speaker, but:
>
> ian(at)linux:~/tmp> cat j.txt
> テスト
> 환경설
> 전검색
> 웹문서
> 국방비
> 북한의
> てすと
> ian(at)linux:~/tmp> uniq j.txt
> テスト
> 환경설
> てすと
>
> All but the first and last lines are random Korean (Hangul)
> characters. Evidently our respective locales think all Hangul strings
> of the same length are identical, which is very probably not the
> case...

Does this go away if you change your locale to C?

Regards,

David

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message David Wheeler 2004-08-23 23:51:32 Re: UTF-8 and LIKE vs =
Previous Message David Wheeler 2004-08-23 23:49:15 Re: UTF-8 and LIKE vs =