Home > mailing lists

Re: Pluggable Storage - Andres's take - Mailing list pgsql-hackers

From	Andres Freund
Subject	Re: Pluggable Storage - Andres's take
Date	May 8, 2019 21:46:27
Msg-id	20190508214627.hw7wuqwawunhynj6@alap3.anarazel.de Whole thread Raw
In response to	Re: Pluggable Storage - Andres's take (Ashwin Agrawal <aagrawal@pivotal.io>)
Responses	Re: Pluggable Storage - Andres's take
List	pgsql-hackers

Tree view

Hi,

On 2019-05-07 23:18:39 -0700, Ashwin Agrawal wrote:
> On Mon, May 6, 2019 at 1:39 PM Ashwin Agrawal <aagrawal@pivotal.io> wrote:
> > Also wish to point out, while working on Zedstore, we realized that
> > TupleDesc from Relation object can be trusted at AM layer for
> > scan_begin() API. As for ALTER TABLE rewrite case (ATRewriteTables()),
> > catalog is updated first and hence the relation object passed to AM
> > layer reflects new TupleDesc. For heapam its fine as it doesn't use
> > the TupleDesc today during scans in AM layer for scan_getnextslot().
> > Only TupleDesc which can trusted and matches the on-disk layout of the
> > tuple for scans hence is from TupleTableSlot. Which is little
> > unfortunate as TupleTableSlot is only available in scan_getnextslot(),
> > and not in scan_begin(). Means if AM wishes to do some initialization
> > based on TupleDesc for scans can't be done in scan_begin() and forced
> > to delay till has access to TupleTableSlot. We should at least add
> > comment for scan_begin() to strongly clarify not to trust Relation
> > object TupleDesc. Or maybe other alternative would be have separate
> > API for rewrite case.
> 
> Just to correct my typo, I wish to say, TupleDesc from Relation object can't
> be trusted at AM layer for scan_begin() API.
> 
> Andres, any thoughts on above. I see you had proposed "change the
> table_beginscan* API so it
> provides a slot" in [1], but seems received no response/comments that time.
> [1]
> https://www.postgresql.org/message-id/20181211021340.mqaown4njtcgrjvr%40alap3.anarazel.de

I don't think passing a slot at beginscan time is a good idea. There's
several places that want to use different slots for the same scan, and
we probably want to increase that over time (e.g. for batching), not
decrease it.

What kind of initialization do you want to do based on the tuple desc at
beginscan time?

Greetings,

Andres Freund

pgsql-hackers by date:

From: Andres Freund
Date: 08 May 2019, 21:43:35
Subject: Re: Pluggable Storage - Andres's take

From: Andres Freund
Date: 08 May 2019, 21:51:35
Subject: Re: Inconsistency between table am callback and table function names

Re: Pluggable Storage - Andres's take - Mailing list pgsql-hackers

Previous

Next