Home > mailing lists

Re: Fix parsing of identifiers in jsonpath - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: Fix parsing of identifiers in jsonpath
Date	September 18, 2019 21:28:16
Msg-id	11458.1568842096@sss.pgh.pa.us Whole thread Raw
In response to	Fix parsing of identifiers in jsonpath (Nikita Glukhov <n.gluhov@postgrespro.ru>)
List	pgsql-hackers

Tree view

Nikita Glukhov <n.gluhov@postgrespro.ru> writes:
> I don't know if it is possible to check Unicode properties "ID_Start" and
> "ID_Continue" in Postgres, and what ZWNJ/ZWJ is.  Now, identifier's starting
> character set is simply determined by the exclusion of all recognized special
> characters.

TBH, I think you should simply ignore any aspect of any of these standards
that is defined by reference to Unicode.  We are not necessarily dealing
with a Unicode character set, so at best, references to things like ZWNJ
are unreachable no-ops in a lot of environments.

As a relevant example, modern SQL defines whitespace in terms of Unicode[1],
a fact that we have ignored from the start and will likely continue to
do so.

You could do a lot worse than to just consider identifiers to be the same
strings as our SQL lexer would do (modulo things like "$" that have
special status in the path language).

            regards, tom lane

[1] cf 4.2.4 "Character repertoires" in SQL:2011

pgsql-hackers by date:

From: Tom Lane
Date: 18 September 2019, 21:12:20
Subject: Re: Define jsonpath functions as stable

From: Melanie Plageman
Date: 18 September 2019, 21:32:13
Subject: Re: Memory Accounting

Re: Fix parsing of identifiers in jsonpath - Mailing list pgsql-hackers

Previous

Next