Re: Zedstore - compressed in-core columnar storage - Mailing list pgsql-hackers

From Konstantin Knizhnik
Subject Re: Zedstore - compressed in-core columnar storage
Date
Msg-id b522e7a7-e7f4-47f1-7430-c594f3f712b6@postgrespro.ru
Whole thread Raw
In response to Re: Zedstore - compressed in-core columnar storage  (Heikki Linnakangas <hlinnaka@iki.fi>)
Responses Re: Zedstore - compressed in-core columnar storage
List pgsql-hackers

On 10.04.2019 10:25, Heikki Linnakangas wrote:
> On 10/04/2019 09:29, Amit Kapila wrote:
>> On Tue, Apr 9, 2019 at 5:57 AM Ashwin Agrawal <aagrawal@pivotal.io> 
>> wrote:
>>> Row store
>>> ---------
>>>
>>> The tuples are stored one after another, sorted by TID. For each
>>> tuple, we store its 48-bit TID, a undo record pointer, and the actual
>>> tuple data uncompressed.
>>>
>>
>> Storing undo record pointer with each tuple can take quite a lot of
>> space in cases where you can't compress them.
>
> Yeah. This does depend on compression to eliminate the unused fields 
> quite heavily at the moment. But you could have a flag in the header 
> to indicate "no undo pointer needed", and just leave it out, when it's 
> needed.
>
>> Have you thought how will you implement the multi-locker scheme in
>> this design?  In zheap, we have used undo for the same and it is easy
>> to imagine when you have separate transaction slots for each
>> transaction.  I am not sure how will you implement the same here.
> I've been thinking that the undo record would store all the XIDs 
> involved. So if there are multiple lockers, the UNDO record would 
> store a list of XIDs. Alternatively, I suppose you could store 
> multiple UNDO pointers for the same tuple.
>
> - Heikki
>
>

I also a little bit confused about UNDO records and MVCC support in 
Zedstore. Actually columnar store is mostly needed for analytic for
read-only or append-only data. One of the disadvantages of Postgres is 
quite larger per-record space overhead caused by MVCC.
It may be critical if you want to store huge timeseries  with relatively 
small number of columns (like measurements of some sensor).
It will be nice to have storage format which reduce this overhead when 
it is not needed (data is not updated).

Right now, even without UNFO pages, size of zedstore is larger than size 
of original Postgres table.
It seems to be very strange.



-- 
Konstantin Knizhnik
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company




pgsql-hackers by date:

Previous
From: Heikki Linnakangas
Date:
Subject: Re: Zedstore - compressed in-core columnar storage
Next
From: Heikki Linnakangas
Date:
Subject: Re: Zedstore - compressed in-core columnar storage