From: | Patrice Hédé <phede-ml(at)islande(dot)org> |
---|---|
To: | pgsql-patches(at)postgresql(dot)org |
Subject: | Re: [HACKERS] Unicode combining characters |
Date: | 2001-10-10 17:28:19 |
Message-ID: | 20011010192819.J14587@idf.net |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers pgsql-patches |
> > 1) we support these supplementary characters, knowing that they won't
> > work with regexes,
> >
> > 2) I back out the change, but then anyone using these characters will
> > get something weird, since the decoding would be faulty (they would
> > be handled as 3 bytes UTF-8 chars, and then the fourth byte would
> > become a "faulty char"... not very good, as the 3-byte version is
> > still not a valid UTF-8 code !),
> >
> > 3) we fix the regex engine within the next 24 hours, before the beta
> > deadline is activated :/
> >
> > What do you think ?
>
> I think 2) is not very good, and we should reject these 4-bytes UTF-8
> strings. After all, we are not ready for them.
If we still recognise them as 4-byte UTF-8 chars (in order to parse
the next char correctly) and reject them as invalid chars, that should
be OK :)
> BTW, other part of your patches looks good. Peter, what do you think?
Nice to hear :)
Patrice
--
Patrice Hédé
email: patrice hede à islande org
www : http://www.islande.org/
From | Date | Subject | |
---|---|---|---|
Next Message | Patrice Hédé | 2001-10-10 17:46:14 | Re: Mule internal code ? |
Previous Message | Peter Eisentraut | 2001-10-10 16:57:34 | extract(timezone_hour) funny business |
From | Date | Subject | |
---|---|---|---|
Next Message | Bruce Momjian | 2001-10-10 19:42:01 | Re: [PATCH] unconditionally enable pltcl-unknown |
Previous Message | Tatsuo Ishii | 2001-10-10 01:12:01 | Re: [HACKERS] Unicode combining characters |