Thread: pgsql: Build de-escaped JSON strings in larger chunks during lexing

pgsql: Build de-escaped JSON strings in larger chunks during lexing

From
John Naylor
Date:
Build de-escaped JSON strings in larger chunks during lexing

During COPY BINARY with large JSONB blobs, it was found that half
the time was spent parsing JSON, with much of that spent in separate
appendStringInfoChar() calls for each input byte.

Add lookahead loop to json_lex_string() to allow batching multiple bytes
via appendBinaryStringInfo(). Also use this same logic when de-escaping
is not done, to avoid code duplication.

Report and proof of concept patch by Jelte Fennema, reworked by Andres
Freund and John Naylor

Discussion:
https://www.postgresql.org/message-id/CAGECzQQuXbies_nKgSiYifZUjBk6nOf2%3DTSXqRjj2BhUh8CTeA%40mail.gmail.com
Discussion:
https://www.postgresql.org/message-id/flat/PR3PR83MB0476F098CBCF68AF7A1CA89FF7B49@PR3PR83MB0476.EURPRD83.prod.outlook.com

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/3838fa269c15706df2b85ce2d6af8aacd5611655

Modified Files
--------------
src/common/jsonapi.c | 58 +++++++++++++++++++++++++++++++++++-----------------
1 file changed, 39 insertions(+), 19 deletions(-)


John Naylor <john.naylor@postgresql.org> writes:
> Build de-escaped JSON strings in larger chunks during lexing

It looks like this broke buildfarm member skink (valgrind).

            regards, tom lane



Re: pgsql: Build de-escaped JSON strings in larger chunks during lexing

From
John Naylor
Date:
On Tue, Jul 12, 2022 at 5:15 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
>
> It looks like this broke buildfarm member skink (valgrind).

Looking...

-- 
John Naylor
EDB: http://www.enterprisedb.com