Quick Links

Re: [rfc] unicode escapes for extended strings

From:	Marko Kreen <markokr(at)gmail(dot)com>
To:	Sam Mason <sam(at)samason(dot)me(dot)uk>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: [rfc] unicode escapes for extended strings
Date:	2009-04-17 16:07:31
Message-ID:	e51f66da0904170907v51cfb917r95be803276e16bba@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 4/16/09, Marko Kreen <markokr(at)gmail(dot)com> wrote:
> It's up to UTF8 validator whether to consider non-characters as error.

I checked, and it did not work well, as addunicode() did not set
the saw_high_bit variable.when outputting UTF8. Attached patch fixes it.

Currently is would be NOP as pg_verifymbstr() only checks for invalid UTF8,
and addunicode cannot output it, but in the future we may want to reject
some codes, so now it can.

Btw, is there any good reason why we don't reject \000, \x00
in text strings?

Currently I made addunicode() do it, because it seems sensible.

--
marko

Attachment	Content-Type	Size
unicode.escape.v2.diff	text/x-patch	3.7 KB

In response to

Re: [rfc] unicode escapes for extended strings at 2009-04-16 19:32:16 from Marko Kreen

Responses

Re: [rfc] unicode escapes for extended strings at 2009-04-17 17:01:47 from Martijn van Oosterhout
Re: [rfc] unicode escapes for extended strings at 2009-04-17 19:28:58 from Andrew Dunstan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2009-04-17 16:12:12	Re: Replacing plpgsql's lexer
Previous Message	Alberto J. Castiñeira P.	2009-04-17 16:07:20	oid in a where