Home > mailing lists

Re: jsonb format is pessimal for toast compression - Mailing list pgsql-hackers

From	Andrew Dunstan
Subject	Re: jsonb format is pessimal for toast compression
Date	August 8, 2014 19:36:05
Msg-id	53E4FC6A.3080606@dunslane.net Whole thread Raw
In response to	Re: jsonb format is pessimal for toast compression (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: jsonb format is pessimal for toast compression
List	pgsql-hackers

Tree view

On 08/08/2014 11:54 AM, Tom Lane wrote:
> Andrew Dunstan <andrew@dunslane.net> writes:
>> On 08/08/2014 11:18 AM, Tom Lane wrote:
>>> That's not really the issue here, I think.  The problem is that a
>>> relatively minor aspect of the representation, namely the choice to store
>>> a series of offsets rather than a series of lengths, produces
>>> nonrepetitive data even when the original input is repetitive.
>> It would certainly be worth validating that changing this would fix the
>> problem.
>> I don't know how invasive that would be - I suspect (without looking
>> very closely) not terribly much.
> I took a quick look and saw that this wouldn't be that easy to get around.
> As I'd suspected upthread, there are places that do random access into a
> JEntry array, such as the binary search in findJsonbValueFromContainer().
> If we have to add up all the preceding lengths to locate the corresponding
> value part, we lose the performance advantages of binary search.  AFAICS
> that's applied directly to the on-disk representation.  I'd thought
> perhaps there was always a transformation step to build a pointer list,
> but nope.
>
>             


It would be interesting to know what the performance hit would be if we 
calculated the offsets/pointers on the fly, especially if we could cache 
it somehow. The main benefit of binary search is in saving on 
comparisons, especially of strings, ISTM, and that could still be 
available - this would just be a bit of extra arithmetic.

cheers

andrew

pgsql-hackers by date:

From: Andrew Dunstan
Date: 08 August 2014, 19:26:58
Subject: Re: jsonb format is pessimal for toast compression

From: Peter Geoghegan
Date: 08 August 2014, 19:50:24
Subject: Re: A worst case for qsort

Re: jsonb format is pessimal for toast compression - Mailing list pgsql-hackers

Previous

Next