From: | "detrox yang" <detrox(at)gmail(dot)com> |
---|---|
To: | "Martijn van Oosterhout" <kleptog(at)svana(dot)org>, "detrox(at)gmail(dot)com" <detrox(at)gmail(dot)com>, pgsql-general(at)postgresql(dot)org |
Subject: | Re: how to ignore invalid byte sequence for encoding without using sql_ascii? |
Date: | 2007-10-10 03:33:29 |
Message-ID: | f9d504d90710092033u68b1aac4rc2b4b20429256056@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
got it. thanks very much.
On 10/2/07, Martijn van Oosterhout <kleptog(at)svana(dot)org> wrote:
>
> On Thu, Sep 27, 2007 at 02:28:27AM -0700, detrox(at)gmail(dot)com wrote:
> > I am now importing the dump file of wikipedia into my postgresql using
> > maintains/importDump.php. It fails on 'ERROR: invalid byte sequence
> > for encoding UTF-8'. Is there any way to let pgsql just ignore the
> > invalid characters ( i mean that drop the invalid ones ), that the
> > script will keep going without die on this error.
>
> No, postgres does not destroy data. It you want bits of your data
> removed you need to write your own tool to do it.
>
> That said, are you sure that the data you're importing is UTF-8?
>
> Have a nice day,
> --
> Martijn van Oosterhout <kleptog(at)svana(dot)org> http://svana.org/kleptog/
> > From each according to his ability. To each according to his ability to
> litigate.
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.1 (GNU/Linux)
>
> iD8DBQFHAfOQIB7bNG8LQkwRAlMxAJ93gd9QP/c00tOcK9rSzEUvg4kZcQCfQYjS
> JhhN/o8NT9xpahZmMz6XjbA=
> =n0T1
> -----END PGP SIGNATURE-----
>
>
From | Date | Subject | |
---|---|---|---|
Next Message | Guy Rouillier | 2007-10-10 03:53:47 | Re: SLEEP in posgresql |
Previous Message | Jasbinder Singh Bali | 2007-10-10 02:56:27 | SLEEP in posgresql |