Re: character 0xe29986 of encoding "UTF8" has no equivalent in "LATIN2"

From: Alvaro Herrera <alvherre(at)commandprompt(dot)com>
To: Andreas Kalsch <andreaskalsch(at)gmx(dot)de>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: character 0xe29986 of encoding "UTF8" has no equivalent in "LATIN2"
Date: 2009-08-03 23:23:05
Message-ID: 20090803232305.GH19615@alvh.no-ip.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Andreas Kalsch wrote:

> My question again: Is there a native Postgres solution to simplify
> characters consistently? It means to completely remove all
> diacriticals from Unicode characters.

There's a to_ascii() function but it supports a subset of charsets, and
IIRC UTF8 is not one of them. Patches welcome.

> I will validate input data on the client side (PHP or Python) and send
> it to the server. Of course the only encoding I will use on any side
> is UTF8. I just wnated to use this Latin thing for simplification of
> characters.

Hmm, seems you're using the wrong tool for that purpose. Changing to a
different encoding does not remove any diacritical marks, only change
the underlying byte encoding.

--
Alvaro Herrera http://www.CommandPrompt.com/
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Andreas Kalsch 2009-08-04 02:41:51 Refer to another database
Previous Message Andreas Kalsch 2009-08-03 22:57:05 Re: character 0xe29986 of encoding "UTF8" has no equivalent in "LATIN2"