From: | Michael Robinson <robinson(at)netrinsics(dot)com> |
---|---|
To: | robinson(at)netrinsics(dot)com, t-ishii(at)sra(dot)co(dot)jp |
Cc: | pgsql-hackers(at)hub(dot)org, tgl(at)sss(dot)pgh(dot)pa(dot)us |
Subject: | Re: [HACKERS] fatal copy in/out error (6.5.3) |
Date: | 2000-01-25 03:04:12 |
Message-ID: | 200001250304.LAA39308@netrinsics.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp> writes:
>No. some EUC's (EUC_TW and EUC_JP) has three-byte or even four-byte
>codes. But you said your database has been configured as EUC_CN. As
>far as I know, it only uses 1 or 2 byte-code. Another thing I am
>confused is that ' \217\210' is not a valid EUC_CN data at all. \217
>(0x8f) specifies code set 3 which does not exist in EUC_CN. In this
>case, it is assumed that the multi-byte word to be consisted of 3-byte
>code in the current implementation of PostgreSQL.
It could be that one of our users had their input method set to produce
EUC_TW or Big5.
>In short, the problem you have is caused by:
>1) wrong data submitted into the table
Kind of hard to control that when data is submitted by random users on
the Internet.
>I would recommend you delete the data since it's not correct anyway.
>In the mean time I'm going to fix 2) so that it assumes data be
>consisted of 2 bytes even if wrong data sequence is submitted
>(needless to say, except ascii).
>Do you want the backpatch for 6.5.3?
Very much so. Thank you.
-Michael
From | Date | Subject | |
---|---|---|---|
Next Message | Tom Lane | 2000-01-25 03:06:05 | Sure enough, SI buffer overrun is broken |
Previous Message | Philip Warner | 2000-01-25 03:03:49 | Re: [HACKERS] DISTINCT ON: speak now or forever hold your peace |