Re: Strange UTF8 issue with pg_dump/pg_restore

From: "Jim Nasby" <jnasby(at)pervasive(dot)com>
To: "Benjamin Krajmalnik" <kraj(at)illumen(dot)com>
Cc: "Pgsql-Admin \(E-mail\)" <pgsql-admin(at)postgresql(dot)org>
Subject: Re: Strange UTF8 issue with pg_dump/pg_restore
Date: 2006-08-09 18:20:06
Message-ID: 4D27CB1096EF1C408F4BFAB0046EC7B6099FF5@ausmailid.aus.pervasive.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

Please include the list in your replies so other people can help.

-----Original Message-----
From: Benjamin Krajmalnik [mailto:kraj(at)illumen(dot)com]
Sent: Tuesday, August 08, 2006 10:31 PM
To: Jim Nasby
Subject: RE: [ADMIN] Strange UTF8 issue with pg_dump/pg_restore

I had corrupt rows inserted on 8.1.4/Windows. I had truncated the table, and it startd collecting data again.
I have since moved to 8.1.4/FreeBSD. I have been running the data for 2 days now, so the odds of having a corrupt record is pretty good if the problem exists. I will wait a few more days and will pg_dump the table which had the problems and attempt to restor it on a dev server to see if I see a problem.

Is iconv a utility that comes woth PG? How can I run the data through it (I am doing pg_dump in custom format).

No, it's a unix utility. You'd have to dump in plain text output, and then run that through iconv. There's info about this in the 8.1 release notes.

_____

From: Jim Nasby [mailto:jnasby(at)pervasive(dot)com]
Sent: Tue 8/8/2006 10:48 AM
To: Benjamin Krajmalnik
Cc: pgsql admin
Subject: Re: [ADMIN] Strange UTF8 issue with pg_dump/pg_restore

On Aug 7, 2006, at 9:08 PM, Benjamin Krajmalnik wrote:
> I have a database whose records are inserted via ODBC.
> I turned on logging, and the client is explicitly setting the
> client encoding to UTF.
>
> Rows are inserted into the tables. When I do a pg_dump, I do not
> get any errors. When I try tor restore, I am getting an error
> regarding an invalid UTF-8 character sequence.
> Of course, this renders compressed backups useless, since I have no
> way of manually fixing the file. The process of fixing the file
> is, of course, painful.
>
> Is this a known issue? I am running PG 8.1.4. The problem
> happened on PG Windows.
> Tomorrow I will try to see if the problem exists in the FreeBSD
> compile. If I get it to reproduce, I will be able to send some
> data dumps, but will need to know who to send it to.

Are the dumps from an older version? There's a known issue that older
versions weren't doing proper checking, in which case you can have
invalid UTF8 characters. The fix is to run the dump through iconv.
--
Jim C. Nasby, Sr. Engineering Consultant jnasby(at)pervasive(dot)com
Pervasive Software http://pervasive.com <http://pervasive.com/> work: 512-231-6117
vcard: http://jim.nasby.net/pervasive.vcf cell: 512-569-9461

Browse pgsql-admin by date

  From Date Subject
Next Message Joel Stevenson 2006-08-09 18:24:03 vacuumdb exclude tables option?
Previous Message Thomas Vatter 2006-08-09 15:46:08 service account