Re: Best practice for: ERROR: invalid byte sequence for encoding "UTF8"

From: "Phoenix Kiula" <phoenix(dot)kiula(at)gmail(dot)com>
To: "Scott Marlowe" <scott(dot)marlowe(at)gmail(dot)com>
Cc: "Ivan Zolotukhin" <ivan(dot)zolotukhin(at)gmail(dot)com>, "Postgres General" <pgsql-general(at)postgresql(dot)org>
Subject: Re: Best practice for: ERROR: invalid byte sequence for encoding "UTF8"
Date: 2007-08-15 17:37:23
Message-ID: e373d31e0708151037g52e6219wcece85485528847e@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

> What, exactly, does that mean?
>
> That PostgreSQL should take things in invalid utf-8 format and just store them?
> Or that PostgreSQL should autoconvert from invalid utf-8 to valid
> utf-8, guessing the proper codes?
>
> Seriously, what do you want pgsql to do with these invalid inputs?

PG should let me, as the administrator of the database, decide whether
I mind my DB to have an option to:

1. Either allow the "invalid" input

2. Or to continue storing the other information in the table even if
an exception was thrown for the utf-8 column (which may be an
unrequired column, for instance, so I may want it not to block the
storage of other valid input which is more important)

I am not advocating what others should do. But I know what I need my
DB to do. If I want it to store data that does not match puritanical
standards of textual storage, then it should allow me to...

It's just a wishlist item from me, but I realize my voice may be
inconsequential.

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Scott Marlowe 2007-08-15 17:40:44 Re: Best practice for: ERROR: invalid byte sequence for encoding "UTF8"
Previous Message Scott Marlowe 2007-08-15 17:31:32 Re: Best practice for: ERROR: invalid byte sequence for encoding "UTF8"