Re: jsonb, unicode escapes and escaped backslashes

From: Andrew Dunstan <andrew(at)dunslane(dot)net>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Robert Haas <robertmhaas(at)gmail(dot)com>
Cc: Noah Misch <noah(at)leadboat(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: jsonb, unicode escapes and escaped backslashes
Date: 2015-01-29 23:02:02
Message-ID: 54CABBEA.7070903@dunslane.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers


On 01/29/2015 05:39 PM, Tom Lane wrote:
> Robert Haas <robertmhaas(at)gmail(dot)com> writes:
>> I have yet to understand what we fix by banning \u0000. How is 0000
>> different from any other four-digit hexadecimal number that's not a
>> valid character in the current encoding? What does banning that one
>> particular value do?
> BTW, as to the point about encoding violations: we *already* ban \uXXXX
> sequences that don't correspond to valid characters in the current
> encoding. The attempt to exclude U+0000 from the set of banned characters
> was ill-advised, plain and simple.
>
>

Actually, unless the encoding is utf8 we ban all non-ascii unicode
escapes even if they might designate a valid character in the current
encoding. This was arrived at after some discussion here. So adding
\u0000 to the list of banned characters is arguably just making us more
consistent.

cheers

andrew

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Jim Nasby 2015-01-29 23:02:51 Re: Proposal: two new role attributes and/or capabilities?
Previous Message Tom Lane 2015-01-29 22:39:23 Re: jsonb, unicode escapes and escaped backslashes