Home > mailing lists

Re: [HACKERS] Faster methods for getting SPI results - Mailing list pgsql-hackers

From	Chapman Flack
Subject	Re: [HACKERS] Faster methods for getting SPI results
Date	August 2, 2017 08:30:30
Msg-id	59813946.40508@anastigmatix.net Whole thread Raw
In response to	[HACKERS] Faster methods for getting SPI results (Jim Nasby <Jim.Nasby@BlueTreble.com>)
Responses	Re: [HACKERS] Faster methods for getting SPI results (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-hackers

Tree view

On 12/20/16 23:14, Jim Nasby wrote:
> I'm guessing one issue might be that
> we don't want to call an external interpreter while potentially holding page
> pins, but even then couldn't we just copy a single tuple at a time and save
> a huge amount of palloc overhead?

On 04/06/17 03:38, Craig Ringer wrote:
> Also, what rules apply in terms of what you can/cannot do from within
> a callback? Presumably it's unsafe to perform additional SPI calls,
> perform transactions, call into the executor, change the current
> snapshot, etc, but I would consider that reasonably obvious. Are there
> any specific things to avoid?

Confessing, right up front, that I'm not very familiar with the executor
or DestReceiver code, but thinking of issues that might be expected with
PLs, I wonder if there could be a design where the per-tuple callback
could sometimes return a status HAVE_SLOW_STUFF_TO_DO.

If it does, the executor could release some pins or locks, stack some
state, whatever allows it to (as far as practicable) relax restrictions
on what the callback would be allowed to do, then reinvoke the callback,
not with another tuple, but with OK_GO_DO_YOUR_SLOW_STUFF.

On return from that call, the executor could reacquire its stacked
state/locks/pins and resume generating tuples.

That way, a callback could, say, return normally 9 out of 10 times, just
quickly buffering up 10 tuples, and every 10th time return SLOW_STUFF_TO_DO
and get a chance to jump into the PL interpreter and deal with those 10 ...
(a) minimizing the restrictions on what the PL routine may do, and (b)
allowing any costs of state-stacking/lock-releasing-reacquiring, and control
transfer to the interpreter, to be amortized over some number of tuples.
How many tuples that should be might be an empirical question for any given
PL, but with a protocol like this, the callback has an easy way to control
it.

Or would that be overcomplicated?

-Chap

pgsql-hackers by date:

From: Amit Kapila
Date: 02 August 2017, 08:10:24
Subject: Re: [HACKERS] Proposal for CSN based snapshots

From: Amit Langote
Date: 02 August 2017, 10:56:16
Subject: [HACKERS] INSERT ON CONFLICT and partitioned tables

Re: [HACKERS] Faster methods for getting SPI results - Mailing list pgsql-hackers

Previous

Next