Re: another seemingly simple encoding question

From: "John D(dot) Burger" <john(at)mitre(dot)org>
To: PostgreSQL-general general <pgsql-general(at)postgresql(dot)org>
Subject: Re: another seemingly simple encoding question
Date: 2006-03-24 13:56:18
Message-ID: 8c7b7e16a736c419961bd75a75c04374@mitre.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

> i have a problem matching a utf8 string with a field in a database
> encoded in utf8.

You seem to give all the details of your configuration, but unless I
misread your message, you don't say what the actual problem is. Can
you provide more details? What exactly doesn't work?

This may not be the issue, but many people don't realize that there are
sometimes multiple ways to encode what is conceptually the same string
in UTF8 (or any of the Unicode encodings). If you do not canonicalize
your strings using one of the Unicode normalization forms, then
seemingly identical strings may not match, because they are not
byte-for-byte identical.

- John D. Burger
MITRE

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Jim C. Nasby 2006-03-24 14:05:52 Re: URGENT! could not access status of transaction
Previous Message Reimer 2006-03-24 13:51:25 Re: URGENT! could not access status of transaction