Home > mailing lists

Re: JSON for PG 9.2 - Mailing list pgsql-hackers

From	Robert Haas
Subject	Re: JSON for PG 9.2
Date	January 20, 2012 11:31:47
Msg-id	CA+TgmoadO1egz3sq3YPqRmoHYbzhxwmyDif4f8MW5EZ6pttCqw@mail.gmail.com Whole thread
In response to	Re: JSON for PG 9.2 (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-hackers

Tree view

On Fri, Jan 20, 2012 at 10:27 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Robert Haas <robertmhaas@gmail.com> writes:
>> The code I've written so far does no canonicalization of the input
>> value of any kind, just as we do for XML.
>
> Fair enough.
>
>> So, given that framework, what the patch does is this: if you're using
>> UTF-8, then \uXXXX is accepted, provided that XXXX is something that
>> equates to a legal Unicode code point.  It isn't converted to the
>> corresponding character: it's just validated.  If you're NOT using
>> UTF-8, then it allows \uXXXX for code points up through 127 (which we
>> assume are the same in all encodings) and anything higher than that is
>> rejected.
>
> This seems a bit silly.  If you're going to leave the escape sequence as
> ASCII, then why not just validate that it names a legal Unicode code
> point and be done?  There is no reason whatever that that behavior needs
> to depend on the database encoding.

Mostly because that would prevent us from adding canonicalization in
the future, AFAICS, and I don't want to back myself into a corner.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

pgsql-hackers by date:

From: Robert Haas
Date: 20 January 2012, 11:30:52
Subject: Re: Group commit, revised

From: Robert Haas
Date: 20 January 2012, 11:33:04
Subject: Re: CLOG contention, part 2

Re: JSON for PG 9.2 - Mailing list pgsql-hackers

Previous

Next