Re: 8.0, UTF8, and CLIENT_ENCODING

From: Michael Glaesemann <grzm(at)seespotcode(dot)net>
To: PFC <lists(at)peufeu(dot)com>
Cc: "Paul Ramsey" <pramsey(at)refractions(dot)net>, PostgreSQL <pgsql-general(at)postgresql(dot)org>
Subject: Re: 8.0, UTF8, and CLIENT_ENCODING
Date: 2007-05-17 22:25:14
Message-ID: 30280E5A-761C-4C44-9EC7-D220F12A4032@seespotcode.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general


On May 17, 2007, at 16:47 , PFC wrote:

>> and put that in the form. Instead of being mapped to 2-byte UTF8
>> high-bit equivalents, they are going into the database directly as
>> one-byte values > 127. That is, as illegal UTF8 values.
>
> Sometimes you also get HTML entities in the mix. Who knows.
> All my web forms are UTF-8 back to back, it just works. Was I lucky ?
> Normally postgres rejects illegal UTF8 values, you wouldn't be
> able to insert them...

8.0 and earlier weren't quite as strict as it should have been. See
the note at the end of the migration instuctions in the release notes
for 8.1[1] That may have been part of the issue here.

Michael Glaesemann
grzm seespotcode net

[1](http://www.postgresql.org/docs/8.2/interactive/
release-8-1.html#AEN80196)

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Ron Johnson 2007-05-17 22:53:54 Re: Large Database Restore
Previous Message Michael Nolan 2007-05-17 21:49:55 Re: Large Database Restore