From: | Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp> |
---|---|
To: | cch(at)cc(dot)kmu(dot)edu(dot)tw |
Cc: | pgsql-patches(at)postgresql(dot)org |
Subject: | Re: A Patch for MIC to EUC_TW code converting in mb support |
Date: | 2000-11-09 04:43:25 |
Message-ID: | 20001109134325L.t-ishii@sra.co.jp |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-docs pgsql-hackers pgsql-patches |
> ============================================================================
>
> POSTGRESQL BUG REPORT: MIC to EUC_TW code converting in mb support
> ============================================================================
>
> System Configuration
> ---------------------
> Architecture (example: Intel Pentium) :x86
> Operating System (example: Linux 2.0.26 ELF) :Linux 2.2.x and FreeBSD
> 3.5R
> PostgreSQL version (example: PostgreSQL-7.0) :PostgreSQL-7.0.2
> Compiler used (example: gcc 2.8.0) :egcs-2.91.66, gcc 2.7.3
>
> A FULL description of the problem:
> ------------------------------------------------
> In PostgreSQL mb (multi-byte) support, there is a bug in code converting
>
> for MIC to EUC_TW. Original mic2euc_tw() in conv.c converts CNS
> 11643-1992
> Plane 2 into 2 bytes EUC_TW encoding. But characters in CNS 11643-1992
> Plane 2
> should be converted into 4 bytes EUC_TW encoding instead.
>
> A way to repeat the problem:
> ----------------------------------------------------------------------
> When you initdb with -E EUC_TW and set PGCLIENTENCODING to BIG5,
> you will find all the characters in CNS 11643-1992 Plane 2 are
> incorrectly stored or output.
>
> This problem might be fixed by the solution in the attachement.
Thanks for pointing it out. Your fix seems correct.
BTW I have found another bug with EUC_TW support. line 917 in conv.c:
*p++ = c1 - LC_CNS11643_3 + 0xa3;
this should be:
*p++ = *mic++ - LC_CNS11643_3 + 0xa3;
Otherwise, CNS 11643-1992 Plane 3 or more won't work. Could you test
it out with CNS 11643-1992 Plane 3 or more?
If they are ok, I will fix the current source and make a patch for
7.0.3 (I guess it's too late to back-patch the 7.0 tree).
--
Tatsuo Ishii
From | Date | Subject | |
---|---|---|---|
Next Message | ksp | 2000-11-09 16:41:44 | I hava a problem in postgresql! |
Previous Message | Chih-Chang Hsieh | 2000-11-09 02:12:10 | A Patch for MIC to EUC_TW code converting in mb support |
From | Date | Subject | |
---|---|---|---|
Next Message | Rod Taylor | 2000-11-09 04:44:06 | Re: Text concat problem |
Previous Message | Tom Lane | 2000-11-09 04:38:18 | Re: Text concat problem |
From | Date | Subject | |
---|---|---|---|
Next Message | Jan Thom | 2000-11-10 09:44:10 | Re: [BUGS] My Bug report: JDBC-Driver produces wrong output.] |
Previous Message | Bruce Momjian | 2000-11-09 04:15:56 | Re: Patch to fix installed location dependency in AIX |