Thread: Re: Using Expanded Objects other than Arrays from plpgsql
On Sun, Oct 20, 2024 at 8:46 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
> On Sun, Oct 20, 2024 at 10:13 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
(from thread https://www.postgresql.org/message-id/CACxu%3DvJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg%40mail.gmail.com)
>> But it seems like we could get an easy win by adjusting
>> plpgsql_exec_function along the lines of
>> ...
> I tried this change and couldn't get it to work, on the next line:
> if (VARATT_IS_EXTERNAL_EXPANDED_RW(DatumGetPointer(var->value)))
> var->value might not be a pointer, as it seems at least from my gdb
> scratching, but say an integer. This segfaults on non-array but
> non-expandable datum.
We need the same test that exec_assign_value makes,
!var->datatype->typbyval, before it's safe to apply DatumGetPointer.
So line 549 needs to be more like
- if (!var->isnull && var->datatype->typisarray)
+ if (!var->isnull && !var->datatype->typbyval)
> Another comment that caught my eye was this one:
> https://github.com/postgres/postgres/blob/master/src/pl/plpgsql/src/pl_exec.c#L8304
> Not sure what the implication is there.
Yeah, that's some more unfinished business. I'm not sure if it
matters to your use-case or not.
BTW, we probably should move this thread to pgsql-hackers.
And here we are, thanks for your help on this Tom. For some thread switching context for others, I'm writing a postgres extension that wraps the SuiteSparse:GraphBLAS API and provides new types for sparse and dense matrices and vectors. It's like a combination of numpy and scipy.sparse but for Postgres with an emphasis on graph analytics as sparse adjacency matrices using linear algebra.
I use the expandeddatum API to flatten and expand on disk compressed representations of these objects into "live" in-memory objects managed by SuiteSparse. All GraphBLAS objects are opaque handles, and my expanded objects are essentially a box around this handle. I use memory context callbacks to free the handles when the memory context of the box is freed. This works very well and I've made a lot of progress on creating a very clean algebraic API, here are the doctests for matrices, this is all generated from live code!
Doing some benchmarking I noticed that when writing some simple graph algorithms in plpgsql, my objects were being constantly flattened and expanded. Posting my question above to pgsql-general Tom gave me some tips and suggested I move the thread here.
My non-expert summary: plpgsql only optimizes for expanded objects if they are arrays. Non array expanded objects get flattened/expanded on every use. For large matrices and vectors this is very expensive. Ideally I'd like to expand my object, use it throughout the function, return it to another function that may use it, and only flatten it when it goes to disk or it's completely unavoidable. The comment in expandeddatum.h really kind of sells this as one of the main features:
* An expanded object is meant to survive across multiple operations, but
* not to be enormously long-lived; for example it might be a local variable
* in a PL/pgSQL procedure. So its extra bulk compared to the on-disk format
* is a worthwhile trade-off.
* not to be enormously long-lived; for example it might be a local variable
* in a PL/pgSQL procedure. So its extra bulk compared to the on-disk format
* is a worthwhile trade-off.
In my case it's not a question of saving bulk, the on disk representation of a matrix is not useful at compute time, it needs to be expanded (using GraphBLAS's serialize/deserialize API) for it to be usable for matrix operations like matmul. In most cases algorithms using these objects iterate in a loop, doing various algebraic operations almost always involving a matmul until they converge on a stable solution or they exhaust the input elements. Here for example is a "minimum parent BFS" that takes a graph and a starting node, and traverses the graph breadth first, computing a vector of every node and its minimum parent id.
CREATE OR REPLACE FUNCTION bfs(graph matrix, start_node bigint)
RETURNS vector LANGUAGE plpgsql AS
$$
DECLARE
bfs_vector vector = vector('int32');
next_vector vector = vector('int32');
BEGIN
bfs_vector = set_element(bfs_vector, start_node, 1);
WHILE sssp_vector != next_vector LOOP
next_vector = dup(bfs_vector);
bfs_vector = vxm(bfs_vector, graph, 'any_secondi_int32',
w=>bfs_vector, accum=>'min_int32');
END LOOP;
RETURN bfs_vector;
end;
$$;
(If you're wondering "Why would anyone do it this way" it's because SuiteSparse is optimized for parallel sparse matrix multiplication and has a JIT compiler that can target multiple architectures, at the moment CPUs and CUDA GPUs. Reusing the same Linear Algebra already prevalent in graph theory means not having to think about any low level implementation issues and having code that is completely portable from CPU to GPU or other accelerators).
So, I made the two small changes Tom suggested above and I have them in a side fork here:
Good news, my code still works, but bad news is there is still a lot of flattening/expanding/freeing going on at multiple points in each iteration of the algorithm. I'll note too that:
BEGIN
bfs_vector = set_element(sssp_vector, start_node, 1);
bfs_vector = set_element(sssp_vector, start_node, 1);
I'd prefer that to not be an assignment, set_element mutates the object (I eventually plan to support subscripting syntax like bfs_vector[start_node] = 1)
same with:
bfs_vector = vxm(bfs_vector, graph, 'any_secondi_int32',
w=>bfs_vector, accum=>'min_int32');
w=>bfs_vector, accum=>'min_int32');
This matmul mutates bfs_vector, I shouldn't need to reassign it back but at the moment it seems necessary otherwise the mutations are lost but this costs a full flatten/expand cycle.
Short term my goal is to optimize plpgsql so that my objects stay expanded for the life of the function. Long term there's some "unfinished business" to use Tom's words around the expandeddatum API. I'm not really qualified to speak on these issue but this is my understanding of some of them:
- plpgsql knows how to expand arrays and is hardwired for it, but how would it know how to expand other expandable types?
- Issues with exec_check_rw_parameter also being hardwired to only optimize expanded objects for array append and prepend, I suspect this has something to do with my issue above about mutating objects in place.
I may have missed something but hopefully that brings anyone up to speed interested in this topic.
-Michel
Michel Pelletier <pelletier.michel@gmail.com> writes: > Doing some benchmarking I noticed that when writing some simple graph > algorithms in plpgsql, my objects were being constantly flattened and > expanded. Posting my question above to pgsql-general Tom gave me some tips > and suggested I move the thread here. > My non-expert summary: plpgsql only optimizes for expanded objects if they > are arrays. Non array expanded objects get flattened/expanded on every > use. For large matrices and vectors this is very expensive. Ideally I'd > like to expand my object, use it throughout the function, return it to > another function that may use it, and only flatten it when it goes to disk > or it's completely unavoidable. Right. My recollection is that when we first invented expanded objects, we only implemented expanded arrays, and so it seemed premature to try to define generalized APIs. So that went on the back burner until we got some more cases to look at. But now we also have expanded records, and with your use-case as an example of an extension trying to do similar things, I feel like we have enough examples to move forward. As far as the hack we were discussing upthread goes, I realized that it should test for typlen == -1 not just !typbyval, since the VARATT_IS_EXTERNAL_EXPANDED_RW test requires that there be a length word. With that fix and some comment updates, it looks like the attached. I'm inclined to just go ahead and push that. It won't move the needle hugely far, but it will help, and it seems simple and safe. To make more progress, there are basically two areas that we need to look at: 1. exec_assign_value() has hard-wired knowledge that it's a good idea to expand an array that's being assigned to a plpgsql variable, and likewise hard-wired knowledge that calling expand_array() is how to do that. The bit in plpgsql_exec_function() that we're patching in the attached is the same thing, but for the initial assignment of a function input parameter to a plpgsql variable. At the time this was written I was quite unsure that forcible expansion would be a net win, but the code is nine years old now and there's been few or no performance complaints. So maybe it's okay to decide that "always expand expandable types during assignment" is a suitable policy across the board, and we don't need to figure out a smarter rule. It sounds like that'd probably be a win for your application, which gives me more faith in the idea than I would've had before. That leaves the problem of how to find the right code to call to do the expansion. Clearly, we need to attach something to data types to make that possible. My first thought was to invent a pg_type column (and CREATE TYPE/ALTER TYPE options, and pg_dump support, yadda yadda) containing the OID of a function corresponding to expand_array(). However, that work would have to be done again the next time we think of something we'd like to know about a data type. So now I'm thinking that we should steal ideas from the "prosupport" API (see src/include/nodes/supportnodes.h) and invent a concept of a "type support" function that can handle an extensible set of different requests. The first one would be to pass back the address of an expansion function comparable to expand_array(), if the type supports being converted to an expanded object. 2. Most of the performance gold is hidden in deciding when we can optimize operations on expanded objects that look like plpgsql_var := f(plpgsql_var, other-parameters); by passing a R/W rather than R/O expanded pointer to f() and letting it munge the expanded object in-place. If we fail to do that then f() has to construct a new expanded object for its result. It's probably still better off getting a R/O pointer than a flat object, but nonetheless we fail to avoid a lot of data copying. The difficulty here is that we do not want to change the normal naive semantics of such an assignment, in particular "if f() throws an error then the value of plpgsql_var has not been modified". This means that we can only optimize when the R/W parameter is to be passed to the top-level function of the expression (else, some function called later could throw an error and ruin things). Furthermore, we need a guarantee from f() that it will not throw an error after modifying the value. As things stand, plpgsql has hard-wired knowledge that array_subscript_handler(), array_append(), and array_prepend() are safe in that way, but it knows nothing about anything else. So one route to making things better seems fairly clear: invent a new prosupport request that asks whether the function is prepared to make such a guarantee. I wonder though if you can guarantee any such thing for your functions, when you are relying on a library that's probably not designed with such a restriction in mind. If this won't work then we need a new concept. One idea I was toying with is that it doesn't matter if f() throws an error so long as the plpgsql function is not executing within an exception block: if the error propagates out of the plpgsql function then we no longer care about the value of the variable. That would very substantially weaken the requirements on how f() is implemented. I'm not sure that we could make this work across multiple levels of plpgsql functions (that is, if the value of the variable ultimately resides in some outer function) but right now that's not an issue since no plpgsql function as such would ever promise to be safe, thus it would never receive a R/W pointer to some outer function's variable. > same with: > bfs_vector = vxm(bfs_vector, graph, 'any_secondi_int32', > w=>bfs_vector, accum=>'min_int32'); > This matmul mutates bfs_vector, I shouldn't need to reassign it back but at > the moment it seems necessary otherwise the mutations are lost but this > costs a full flatten/expand cycle. I'm not hugely excited about that. We could imagine extending this concept to INOUT parameters of procedures, but it doesn't seem like that buys anything except a small notational savings. Maybe it would work to allow multiple parameters of a procedure to be passed as R/W, whereas we're restricted to one for the function-notation method. So possibly there's a gain there but I'm not sure how big. BTW, if I understand your example correctly then bfs_vector is being passed to vxm() twice. This brings up an interesting point: if we pass the first instance as R/W, and vxm() manipulates it, then the changes would also be visible in its other parameter "w". This is certainly not per normal semantics. A "safe" function would have to either not have any possibility that two parameters could refer to the same object, or be coded in a way that made it impervious to this issue --- in your example, it couldn't look at "w" anymore once it'd started modifying the first parameter. Is that an okay requirement, and if not what shall we do about it? Anyway that's my brain dump for today. Thoughts? regards, tom lane diff --git a/src/pl/plpgsql/src/pl_exec.c b/src/pl/plpgsql/src/pl_exec.c index ea9740e3f8..80b8654fc8 100644 --- a/src/pl/plpgsql/src/pl_exec.c +++ b/src/pl/plpgsql/src/pl_exec.c @@ -532,21 +532,22 @@ plpgsql_exec_function(PLpgSQL_function *func, FunctionCallInfo fcinfo, false); /* - * Force any array-valued parameter to be stored in - * expanded form in our local variable, in hopes of - * improving efficiency of uses of the variable. (This is - * a hack, really: why only arrays? Need more thought - * about which cases are likely to win. See also - * typisarray-specific heuristic in exec_assign_value.) - * - * Special cases: If passed a R/W expanded pointer, assume + * If it's a pass-by-reference type, check to see if we + * received a R/W expanded-object pointer. If so, assume * we can commandeer the object rather than having to copy * it. If passed a R/O expanded pointer, just keep it as * the value of the variable for the moment. (We'll force * it to R/W if the variable gets modified, but that may * very well never happen.) + * + * Also, force any flat array value to be stored in + * expanded form in our local variable, in hopes of + * improving efficiency of uses of the variable. (This is + * a hack, really: why only arrays? Need more thought + * about which cases are likely to win. See also + * typisarray-specific heuristic in exec_assign_value.) */ - if (!var->isnull && var->datatype->typisarray) + if (!var->isnull && var->datatype->typlen == -1) { if (VARATT_IS_EXTERNAL_EXPANDED_RW(DatumGetPointer(var->value))) { @@ -561,7 +562,7 @@ plpgsql_exec_function(PLpgSQL_function *func, FunctionCallInfo fcinfo, { /* R/O pointer, keep it as-is until assigned to */ } - else + else if (var->datatype->typisarray) { /* flat array, so force to expanded form */ assign_simple_var(&estate, var,
On Tue, Oct 22, 2024 at 12:33 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
But now we also have
expanded records, and with your use-case as an example of an extension
trying to do similar things, I feel like we have enough examples to
move forward.
Great!
As far as the hack we were discussing upthread goes, I realized that
it should test for typlen == -1 not just !typbyval, since the
VARATT_IS_EXTERNAL_EXPANDED_RW test requires that there be a length
word. With that fix and some comment updates, it looks like the
attached. I'm inclined to just go ahead and push that. It won't move
the needle hugely far, but it will help, and it seems simple and safe.
I made those changes and my code works a bit faster, it looks like it takes a couple of the top level expansions out. I'll have more data in the morning.
To make more progress, there are basically two areas that we need
to look at:
1. exec_assign_value() has hard-wired knowledge that it's a good idea
to expand an array that's being assigned to a plpgsql variable, and
likewise hard-wired knowledge that calling expand_array() is how to
do that. The bit in plpgsql_exec_function() that we're patching
in the attached is the same thing, but for the initial assignment of
a function input parameter to a plpgsql variable. At the time this
was written I was quite unsure that forcible expansion would be a net
win, but the code is nine years old now and there's been few or no
performance complaints. So maybe it's okay to decide that "always
expand expandable types during assignment" is a suitable policy across
the board, and we don't need to figure out a smarter rule. It sounds
like that'd probably be a win for your application, which gives me
more faith in the idea than I would've had before.
Definitely a win, as the flattened format of my objects don't have any run time use, so there is no chance for net loss for me. I guess I'm using this feature differently from how arrays are, which have a usable flattened format so there is a need to weigh a trade off with expanding or not. In my case, only the expanded version is useful, and serializing the flat version is expensive. Formalizing something like expand_array would work well for me, as my expand_matrix function has the identical function signature and serves the exact same purpose.
So now
I'm thinking that we should steal ideas from the "prosupport" API
(see src/include/nodes/supportnodes.h) and invent a concept of a
"type support" function that can handle an extensible set of
different requests. The first one would be to pass back the
address of an expansion function comparable to expand_array(),
if the type supports being converted to an expanded object.
I'll look into the support code, I haven't seen that before.
2. Most of the performance gold is hidden in deciding when we
can optimize operations on expanded objects that look like
plpgsql_var := f(plpgsql_var, other-parameters);
by passing a R/W rather than R/O expanded pointer to f() and letting
it munge the expanded object in-place. If we fail to do that then
f() has to construct a new expanded object for its result. It's
probably still better off getting a R/O pointer than a flat object,
but nonetheless we fail to avoid a lot of data copying.
The difficulty here is that we do not want to change the normal naive
semantics of such an assignment, in particular "if f() throws an error
then the value of plpgsql_var has not been modified". This means that
we can only optimize when the R/W parameter is to be passed to the
top-level function of the expression (else, some function called later
could throw an error and ruin things). Furthermore, we need a
guarantee from f() that it will not throw an error after modifying
the value.
As things stand, plpgsql has hard-wired knowledge that
array_subscript_handler(), array_append(), and array_prepend()
are safe in that way, but it knows nothing about anything else.
So one route to making things better seems fairly clear: invent a new
prosupport request that asks whether the function is prepared to make
such a guarantee. I wonder though if you can guarantee any such thing
for your functions, when you are relying on a library that's probably
not designed with such a restriction in mind. If this won't work then
we need a new concept.
This will work for the GraphBLAS API. The expanded object in my case is really just a small box struct around a GraphBLAS "handle" which is an opaque pointer to data which I cannot mutate, only the library can change the object behind the handle. The API makes strong guarantees that it will either do the operation and return a success code or not do the operation and return an error code. It's not possible (normally) to get a corrupt or incomplete object back.
One idea I was toying with is that it doesn't matter if f()
throws an error so long as the plpgsql function is not executing
within an exception block: if the error propagates out of the plpgsql
function then we no longer care about the value of the variable.
That would very substantially weaken the requirements on how f()
is implemented. I'm not sure that we could make this work across
multiple levels of plpgsql functions (that is, if the value of the
variable ultimately resides in some outer function) but right now
that's not an issue since no plpgsql function as such would ever
promise to be safe, thus it would never receive a R/W pointer to
some outer function's variable.
The water here is pretty deep for me but I'm pretty sure I get what you're saying, I'll need to do some more studying of the plpgsql code which I've been spending the last couple of days familiarizing myself with more.
> same with:
> bfs_vector = vxm(bfs_vector, graph, 'any_secondi_int32',
> w=>bfs_vector, accum=>'min_int32');
> This matmul mutates bfs_vector, I shouldn't need to reassign it back but at
> the moment it seems necessary otherwise the mutations are lost but this
> costs a full flatten/expand cycle.
I'm not hugely excited about that. We could imagine extending
this concept to INOUT parameters of procedures, but it doesn't
seem like that buys anything except a small notational savings.
Maybe it would work to allow multiple parameters of a procedure
to be passed as R/W, whereas we're restricted to one for the
function-notation method. So possibly there's a gain there but
I'm not sure how big.
BTW, if I understand your example correctly then bfs_vector is
being passed to vxm() twice.
Yes, this is not unusual in this form of linear algebra, as multiple operations often accumulate into the same object to prevent a bunch of copying during each iteration of a given algorithm. There is also a "mask" parameter where another or the same object can be provided to either mask in or out (compliment mask) values during the accumulation phase. This is very useful for many algorithms, and good example is the Burkhardt method of Triangle Counting (https://journals.sagepub.com/doi/10.1177/1473871616666393) which in GraphBLAS boils down to:
GrB_mxm (C, A, NULL, semiring, A, A, GrB_DESC_S) ;
GrB_reduce (&ntri, NULL, monoid, C, NULL) ;
ntri /= 6 ;
In this case A is three of the parameters to mxm, the left operand, right operand, and a structural mask. This can be summed up as "A squared, masked by A", which when reduced returns the number of triangles in the graph (times 6).
This brings up an interesting
point: if we pass the first instance as R/W, and vxm() manipulates it,
then the changes would also be visible in its other parameter "w".
This is certainly not per normal semantics. A "safe" function would
have to either not have any possibility that two parameters could
refer to the same object, or be coded in a way that made it impervious
to this issue --- in your example, it couldn't look at "w" anymore
once it'd started modifying the first parameter. Is that an okay
requirement, and if not what shall we do about it?
I *think*, if I understand you correctly, that this isn't an issue for the GraphBLAS. My expanded objects are just boxes around an opaque handle, I don't actually mutate anything inside the box, and I can't see past the opaque pointer. SuiteSparse may be storing the matrix in one of many different formats, or on a GPU, or who knows, all I have is a handle to "A" which I pass to GraphBLAS methods which is the only way I can interact with them. Here's the definition of that vxm function:
It's pretty straightforward, get the arguments, and pass them to the GraphBLAS API, there is no mutable structure inside the expanded "box", just the handle.
I'm using the expanded object API to solve my two key problems, flatten an object for disk storage, expand that object (through the GraphBLAS serialize/deserialize API) and turn it into an object handle, which is secretly just a pointer of course to the internal details of the object, but I can't see or change that, only SuiteSparse can.
(btw sorry about the bad parameter names, "w" is the name from the API spec which I've been sticking to, which is the optional output object to use, if one is not passed, I create a new one, this is similar to the numpy "out" parameter semantics) .
I added some debug instrumentation that might show a bit more of what's going on for me, consider this function:
CREATE OR REPLACE FUNCTION test(graph matrix)
RETURNS matrix LANGUAGE plpgsql AS
$$
DECLARE
n bigint = nrows(graph);
m bigint = ncols(graph);
BEGIN
RETURN graph;
end;
$$;
RETURNS matrix LANGUAGE plpgsql AS
$$
DECLARE
n bigint = nrows(graph);
m bigint = ncols(graph);
BEGIN
RETURN graph;
end;
$$;
The graph passes straight through but first I call two methods to get the number rows and columns. When I run it on a graph:
postgres=# select pg_typeof(test(graph)) from test_graphs ;
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_ncols
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
pg_typeof
-----------
matrix
(1 row)
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_ncols
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
pg_typeof
-----------
matrix
(1 row)
THe matrix gets expanded twice, presumably because the object comes in flat, and both nrows() and ncols() expands it, which ends up being two separate handles and thus two separate objects to the GraphBLAS.
Here's another example:
CREATE OR REPLACE FUNCTION test2(graph matrix)
RETURNS bigint LANGUAGE plpgsql AS
$$
BEGIN
perform set_element(graph, 1, 1, 1);
RETURN nvals(graph);
end;
$$;
CREATE FUNCTION
postgres=# select test2(matrix('int32'));
DEBUG: new_matrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: scalar_int32
DEBUG: new_scalar
DEBUG: matrix_set_element
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: DatumGetScalar
DEBUG: matrix_get_flat_size
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: context_callback_matrix_free
DEBUG: context_callback_scalar_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: context_callback_matrix_free
test2
-------
0
(1 row)
RETURNS bigint LANGUAGE plpgsql AS
$$
BEGIN
perform set_element(graph, 1, 1, 1);
RETURN nvals(graph);
end;
$$;
CREATE FUNCTION
postgres=# select test2(matrix('int32'));
DEBUG: new_matrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: scalar_int32
DEBUG: new_scalar
DEBUG: matrix_set_element
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: DatumGetScalar
DEBUG: matrix_get_flat_size
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: context_callback_matrix_free
DEBUG: context_callback_scalar_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: context_callback_matrix_free
test2
-------
0
(1 row)
I would expect that to return 1. If I do "graph = set_element(graph, 1, 1, 1)" it works.
I hope that gives a bit more information about my use cases, in general I'm very happy with the API, it's very algebraic, I have a lot of interesting plans for supporting more operators and subscription syntax, but this issue is not my top priority to see if we can resolve it. I'm sure I missed something in your detailed plan so I'll be going over it some more this week. Please let me know if you have any other questions about my use case or concerns about my expectations.
Thank you!
-Michel
Michel Pelletier <pelletier.michel@gmail.com> writes: > Here's another example: > CREATE OR REPLACE FUNCTION test2(graph matrix) > RETURNS bigint LANGUAGE plpgsql AS > $$ > BEGIN > perform set_element(graph, 1, 1, 1); > RETURN nvals(graph); > end; > $$; > CREATE FUNCTION > postgres=# select test2(matrix('int32')); > DEBUG: new_matrix > DEBUG: matrix_get_flat_size > DEBUG: flatten_matrix > DEBUG: scalar_int32 > DEBUG: new_scalar > DEBUG: matrix_set_element > DEBUG: DatumGetMatrix > DEBUG: expand_matrix > DEBUG: new_matrix > DEBUG: DatumGetScalar > DEBUG: matrix_get_flat_size > DEBUG: matrix_get_flat_size > DEBUG: flatten_matrix > DEBUG: context_callback_matrix_free > DEBUG: context_callback_scalar_free > DEBUG: matrix_nvals > DEBUG: DatumGetMatrix > DEBUG: expand_matrix > DEBUG: new_matrix > DEBUG: context_callback_matrix_free > DEBUG: context_callback_matrix_free > test2 > ------- > 0 > (1 row) I'm a little confused by your debug output. What are "scalar_int32" and "new_scalar", and what part of the plpgsql function is causing them to be invoked? Another thing that confuses me is why there's a second flatten_matrix operation happening here. Shouldn't set_element return its result as a R/W expanded object? > I would expect that to return 1. If I do "graph = set_element(graph, 1, 1, > 1)" it works. I think you have a faulty understanding of PERFORM. It's defined as "evaluate this expression and throw away the result", so it's *not* going to change "graph", not even if set_element declares that argument as INOUT. (Our interpretation of OUT arguments for functions is that they're just an alternate notation for specifying the function result.) If you want to avoid the explicit assignment back to "graph" then the thing to do would be to declare set_element as a procedure, not a function, with an INOUT argument and then call it with CALL. That's only cosmetically different though, in that the updated "graph" value is still passed back much as if it were a function result, and then the CALL infrastructure knows it has to assign that back to the argument variable. And, as I tried to explain earlier, that code path currently has no mechanism for avoiding making a copy of the graph somewhere along the line: it will pass "graph" to the procedure as either a flat Datum or a R/O expanded object, so that set_element will be required to copy that before modifying it. We can imagine extending whatever we do for "x := f(x)" cases so that it also works during "CALL p(x)". But I think that's only going to yield cosmetic or notational improvements so I don't want to start with doing that. Let's focus first on improving the existing infrastructure for the f(x) case. regards, tom lane
I wrote: > One idea I was toying with is that it doesn't matter if f() > throws an error so long as the plpgsql function is not executing > within an exception block: if the error propagates out of the plpgsql > function then we no longer care about the value of the variable. > That would very substantially weaken the requirements on how f() > is implemented. The more I think about this idea the better I like it. We can improve on the original concept a bit: the assignment can be within an exception block so long as the target variable is too. For example, consider DECLARE x float8[]; BEGIN ... DECLARE y float8[]; BEGIN x := array_append(x, 42); y := array_append(y, 42); END; EXCEPTION WHEN ...; END; Currently, both calls of array_append are subject to R/W optimization, so that array_append must provide a strong guarantee that it won't throw an error after it's begun to change the R/W object. If we redefine things so that the optimization is applied only to "y", then AFAICS we need nothing from array_append. It only has to be sure it doesn't corrupt the object so badly that it can't be freed ... but that requirement exists already, for anything dealing with expanded objects. So this would put us in a situation where we could apply the optimization by default, which'd be a huge win. There is an exception: if we are considering x := array_cat(x, x); then I don't think we can optimize because of the aliasing problem I mentioned before. So there'd have to be a restriction that the target variable is mentioned only once in the function's arguments. For stuff like your vxm() function, that'd be annoying. But functions that need that and are willing to deal with the aliasing hazard could still provide a prosupport function that promises it's okay. What we'd accomplish is that a large fraction of interesting functions could get the benefit without having to create a prosupport function, which is a win all around. Also worth noting: in the above example, we could optimize the update on "x" too, if we know that "x" is not referenced in the block's EXCEPTION handlers. I wouldn't bother with this in the first version, but it might be worth doing later. So if we go this way, the universe of functions that can benefit from the optimization enlarges considerably, and the risk of bugs that break the optimization drops considerably. The cost is that some cases that were optimized before now will not be. But I suspect that plpgsql functions where this optimization is key probably don't contain EXCEPTION handlers at all, so that they won't notice any change. Thoughts? regards, tom lane
On Wed, Oct 23, 2024 at 8:21 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
> Here's another example:
> CREATE OR REPLACE FUNCTION test2(graph matrix)
> RETURNS bigint LANGUAGE plpgsql AS
> $$
> BEGIN
> perform set_element(graph, 1, 1, 1);
> RETURN nvals(graph);
> end;
> $$;
> CREATE FUNCTION
> postgres=# select test2(matrix('int32'));
> DEBUG: new_matrix
> DEBUG: matrix_get_flat_size
> DEBUG: flatten_matrix
> DEBUG: scalar_int32
> DEBUG: new_scalar
> DEBUG: matrix_set_element
> DEBUG: DatumGetMatrix
> DEBUG: expand_matrix
> DEBUG: new_matrix
> DEBUG: DatumGetScalar
> DEBUG: matrix_get_flat_size
> DEBUG: matrix_get_flat_size
> DEBUG: flatten_matrix
> DEBUG: context_callback_matrix_free
> DEBUG: context_callback_scalar_free
> DEBUG: matrix_nvals
> DEBUG: DatumGetMatrix
> DEBUG: expand_matrix
> DEBUG: new_matrix
> DEBUG: context_callback_matrix_free
> DEBUG: context_callback_matrix_free
> test2
> -------
> 0
> (1 row)
I'm a little confused by your debug output. What are "scalar_int32"
and "new_scalar", and what part of the plpgsql function is causing
them to be invoked?
GraphBLAS scalars hold a single element value for the matrix type. Internally, they are simply 1x1 matrices (much like vectors are 1xn matrices). The function signature is:
set_element(a matrix, i bigint, j bigint, s scalar)
There is a "CAST (integer as scalar)" function (scalar_int32) that casts Postgres integers to GraphBLAS GrB_INT32 scalar elements (which calls new_scalar because like vectors and matrices, they are expanded objects which have a GrB_Scalar "handle"). Scalars are useful for working with individual values, for example reduce() returns a scalar. There are way more efficient ways to push huge C arrays of values into matrices but for now I'm just working at the element level.
Another thing that confuses me is why there's a second flatten_matrix
operation happening here. Shouldn't set_element return its result
as a R/W expanded object?
That confuses me too, and my default assumption is always that I'm doing it wrong. set_element does return a R/W object afaict, here is the return:
where:
#define OS_RETURN_MATRIX(_matrix) return EOHPGetRWDatum(&(_matrix)->hdr)
> I would expect that to return 1. If I do "graph = set_element(graph, 1, 1,
> 1)" it works.
I think you have a faulty understanding of PERFORM. It's defined as
"evaluate this expression and throw away the result", so it's *not*
going to change "graph", not even if set_element declares that
argument as INOUT.
Faulty indeed, I was going from the plpgsql statement documentation:
My understanding of "side-effects" was flawed there, but I'm fine with "x = set_element(x...)" anyway as I was trying to follow the example of array_append et al.
(Our interpretation of OUT arguments for functions
is that they're just an alternate notation for specifying the function
result.) If you want to avoid the explicit assignment back to "graph"
then the thing to do would be to declare set_element as a procedure,
not a function, with an INOUT argument and then call it with CALL.
I'll stick with the assignment.
That's only cosmetically different though, in that the updated
"graph" value is still passed back much as if it were a function
result, and then the CALL infrastructure knows it has to assign that
back to the argument variable. And, as I tried to explain earlier,
that code path currently has no mechanism for avoiding making a copy
of the graph somewhere along the line: it will pass "graph" to the
procedure as either a flat Datum or a R/O expanded object, so that
set_element will be required to copy that before modifying it.
Right, I'm still figuring out exactly what that code flow is. This is my first dive into these corners of the code so thank you for being patient with me. I promise to write up some expanded object documentation if this works!
We can imagine extending whatever we do for "x := f(x)" cases so that
it also works during "CALL p(x)". But I think that's only going to
yield cosmetic or notational improvements so I don't want to start
with doing that. Let's focus first on improving the existing
infrastructure for the f(x) case.
+1
-Michel
Michel Pelletier <pelletier.michel@gmail.com> writes: > On Wed, Oct 23, 2024 at 8:21 AM Tom Lane <tgl@sss.pgh.pa.us> wrote: >> Another thing that confuses me is why there's a second flatten_matrix >> operation happening here. Shouldn't set_element return its result >> as a R/W expanded object? > That confuses me too, and my default assumption is always that I'm doing it > wrong. set_element does return a R/W object afaict, here is the return: > https://github.com/OneSparse/OneSparse/blob/main/src/matrix.c#L1726 Hmph. That seems right. Can you add errbacktrace() to your logging ereports, in hopes of seeing how we're getting to flatten_matrix? Or break there with gdb for a more complete/reliable stack trace. regards, tom lane
On Wed, Oct 23, 2024 at 9:04 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
I wrote:
> One idea I was toying with is that it doesn't matter if f()
> throws an error so long as the plpgsql function is not executing
> within an exception block: if the error propagates out of the plpgsql
> function then we no longer care about the value of the variable.
> That would very substantially weaken the requirements on how f()
> is implemented.
The more I think about this idea the better I like it. We can
improve on the original concept a bit: the assignment can be
within an exception block so long as the target variable is too.
For example, consider
DECLARE x float8[];
BEGIN
...
DECLARE y float8[];
BEGIN
x := array_append(x, 42);
y := array_append(y, 42);
END;
EXCEPTION WHEN ...;
END;
Currently, both calls of array_append are subject to R/W optimization,
so that array_append must provide a strong guarantee that it won't
throw an error after it's begun to change the R/W object. If we
redefine things so that the optimization is applied only to "y",
then AFAICS we need nothing from array_append. It only has to be
sure it doesn't corrupt the object so badly that it can't be freed
... but that requirement exists already, for anything dealing with
expanded objects. So this would put us in a situation where we
could apply the optimization by default, which'd be a huge win.
Great! I can make that same guarantee.
There is an exception: if we are considering
x := array_cat(x, x);
then I don't think we can optimize because of the aliasing problem
I mentioned before. So there'd have to be a restriction that the
target variable is mentioned only once in the function's arguments.
For stuff like your vxm() function, that'd be annoying. But functions
that need that and are willing to deal with the aliasing hazard could
still provide a prosupport function that promises it's okay. What
we'd accomplish is that a large fraction of interesting functions
could get the benefit without having to create a prosupport function,
which is a win all around.
I see, I'm not completely clear on the prosupport code so let me repeat it back just so I know I'm getting it right, it looks like I'll need to write a C function, that I specify with CREATE FUNCTION ... SUPPORT, that the planner will call asking me to tell it that it's ok to alias arguments (which is fine for SuiteSparse so no problem). You also mentioned a couple emails back about a "type support" feature similar to prosupport, that would allow me to specify an eager expansion function for my types.
Also worth noting: in the above example, we could optimize the
update on "x" too, if we know that "x" is not referenced in the
block's EXCEPTION handlers. I wouldn't bother with this in the
first version, but it might be worth doing later.
So if we go this way, the universe of functions that can benefit
from the optimization enlarges considerably, and the risk of bugs
that break the optimization drops considerably. The cost is that
some cases that were optimized before now will not be. But I
suspect that plpgsql functions where this optimization is key
probably don't contain EXCEPTION handlers at all, so that they
won't notice any change.
Sounds like a good tradeoff to me! Hopefully if anyone does have concerns with this approach they'll see this thread and comment.
Thanks,
-Michel
regards, tom lane
Michel Pelletier <pelletier.michel@gmail.com> writes: > On Wed, Oct 23, 2024 at 9:04 AM Tom Lane <tgl@sss.pgh.pa.us> wrote: >> For stuff like your vxm() function, that'd be annoying. But functions >> that need that and are willing to deal with the aliasing hazard could >> still provide a prosupport function that promises it's okay. What >> we'd accomplish is that a large fraction of interesting functions >> could get the benefit without having to create a prosupport function, >> which is a win all around. > I see, I'm not completely clear on the prosupport code so let me repeat it > back just so I know I'm getting it right, it looks like I'll need to write > a C function, that I specify with CREATE FUNCTION ... SUPPORT, that the > planner will call asking me to tell it that it's ok to alias arguments > (which is fine for SuiteSparse so no problem). You also mentioned a couple > emails back about a "type support" feature similar to prosupport, that > would allow me to specify an eager expansion function for my types. Right. You could omit the support function for anything where aliasing is impossible (e.g., there's only one matrix argument). Also, very possibly multiple functions could share the same support function. >> Also worth noting: in the above example, we could optimize the >> update on "x" too, if we know that "x" is not referenced in the >> block's EXCEPTION handlers. I wouldn't bother with this in the >> first version, but it might be worth doing later. Aside: I'm less excited about that than I was earlier. It's not wrong for the example I gave, but in more general cases it doesn't work: DECLARE x float8[]; BEGIN ... BEGIN x := array_append(x, 42); EXCEPTION WHEN ...; END; // we might use x here END; So for an "x" in a further-out DECLARE section, we'd have to track not only whether x is used in the EXCEPTION clause, but whether it's used anyplace that can be reached after the exception block ends. That's starting to sound overly complicated for the benefit. >> So if we go this way, the universe of functions that can benefit >> from the optimization enlarges considerably, and the risk of bugs >> that break the optimization drops considerably. The cost is that >> some cases that were optimized before now will not be. > Sounds like a good tradeoff to me! Hopefully if anyone does have concerns > with this approach they'll see this thread and comment. I realized that we could suppress any complaints of that ilk by creating prosupport functions for the three built-in functions that are handled today, allowing them to operate on the same rules as now. I might not bother with that on purely performance grounds, but it's likely worth doing simply to provide an in-core test case for the code path where a prosupport function can be called. I'm still writing up details, but right now I'm envisioning completely separate sets of rules for the prosupport case versus the no-prosupport case. regards, tom lane
I wrote: > ... I'm still writing up > details, but right now I'm envisioning completely separate sets of > rules for the prosupport case versus the no-prosupport case. So here is the design I've come up with for optimizing R/W expanded object updates in plpgsql without any special knowledge from a prosupport function. AFAICS this requires no assumptions at all about the behavior of called functions, other than the bare minimum "you can't corrupt the object to the point where it wouldn't be cleanly free-able". In particular that means it can work for user-written called functions in plpgsql, SQL, or whatever, not only for C-coded functions. There are two requirements to apply the optimization: * If the assignment statement is within a BEGIN ... EXCEPTION block, its target variable must be declared inside the most-closely-nested such block. This ensures that if an error is thrown from within the assignment statement's expression, we do not care about the value of the target variable, except to the extent of being able to clean it up. * The target variable must be referenced exactly once within the RHS expression. This avoids aliasing hazards such as we discussed upthread. But unlike the existing rule, that reference can be anywhere in the RHS --- it doesn't have to be an argument of the topmost function. So for example we can optimize foo := fee(fi(fo(fum(foo), other_variable), ...)); While I've not tried to write any code yet, I think both of these conditions should be reasonably easy to verify. Given that those conditions are met and the current value of the assignment target variable is a R/W expanded pointer, we can execute the assignment as follows: * Provide the R/W expanded pointer as the value of the Param representing the sole reference. At the same time, apply TransferExpandedObject to reparent the object under the transient eval_mcontext memory context that's being used to evaluate the RHS expression, and then set the target variable's value to NULL. (At this point, the R/W object has exactly the same logical status as any intermediate calculation result that's palloc'd in the eval_mcontext.) * At successful completion of the RHS, assign the result to the target variable normally. This includes, if it's an R/W expanded object, reparenting it under the calling function's main context. If the RHS expression winds up returning the same expanded object (probably mutated), then the outer function regains ownership of it after no more than a couple of TransferExpandedObject calls, which are cheap. If the RHS returns something different, then either the original expanded object got discarded during RHS evaluation, or it will be cleaned up when we reset the eval_mcontext, so that it won't be leaked. I didn't originally foresee the need to transfer the object into the transient memory context. But this design avoids any possibility of double-free attempts, which would be likely to happen if we allow the outer function's variable to hold onto a reference to the object while the RHS is evaluated. A function receiving an R/W reference is entitled to assume that that value is not otherwise referenced and can be freed when it's no longer needed, so it might well get freed during RHS evaluation. By converting the original R/W object into (effectively) a temporary value within the RHS evaluation, we make that assumption valid. So, while this design greatly expands the set of cases we can optimize, it does lose some cases that the old approach could support. I envision addressing that by allowing a prosupport function attached to the RHS' topmost function to "bless" other cases as safe, using reasoning similar to the old rules. (Or different rules, even, but it's on the prosupport function to be sure it's safe.) I don't have a detailed design in mind, but I'm thinking along the lines of just passing the whole RHS expression to the prosupport function and letting it decide what's safe. In any case, we don't need to even call the prosupport function unless there's an exception block or multiple RHS references to the target variable. regards, tom lane
Here's two backtraces from gdb from this function:
CREATE OR REPLACE FUNCTION test2(graph matrix)
RETURNS bigint LANGUAGE plpgsql AS
$$
BEGIN
perform set_element(graph, 1, 1, 1);
RETURN nvals(graph);
end;
$$;
RETURNS bigint LANGUAGE plpgsql AS
$$
BEGIN
perform set_element(graph, 1, 1, 1);
RETURN nvals(graph);
end;
$$;
Both traces are in that file split on the hyphens line 44. I'm still puzzling it out, could it have something to do with the volatility of the functions? I'm not entirely clear on how to interpret function volatility with expanded objects, nvals is STRICT, but set_element is STABLE. I think my logic there was because it's a mutation. This is likely another misunderstanding of mine.
-Michel
On Wed, Oct 23, 2024 at 7:10 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
> On Wed, Oct 23, 2024 at 8:21 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> Another thing that confuses me is why there's a second flatten_matrix
>> operation happening here. Shouldn't set_element return its result
>> as a R/W expanded object?
> That confuses me too, and my default assumption is always that I'm doing it
> wrong. set_element does return a R/W object afaict, here is the return:
> https://github.com/OneSparse/OneSparse/blob/main/src/matrix.c#L1726
Hmph. That seems right. Can you add errbacktrace() to your logging
ereports, in hopes of seeing how we're getting to flatten_matrix?
Or break there with gdb for a more complete/reliable stack trace.
regards, tom lane
On Thu, Oct 24, 2024 at 11:32 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
I wrote:
> ... I'm still writing up
> details, but right now I'm envisioning completely separate sets of
> rules for the prosupport case versus the no-prosupport case.
So here is the design I've come up with for optimizing R/W expanded
object updates in plpgsql without any special knowledge from a
prosupport function. AFAICS this requires no assumptions at all
about the behavior of called functions, other than the bare minimum
"you can't corrupt the object to the point where it wouldn't be
cleanly free-able". In particular that means it can work for
user-written called functions in plpgsql, SQL, or whatever, not
only for C-coded functions.
Great, I checked with the upstream library authors and they verified that the object can't be corrupted to where it can't be freed. Since my expanded objects are just a box around a library handle, I use a MemoryContext callback to call the library free function when the context cleans up, and we can't think of a path where that will fail.
There are two requirements to apply the optimization:
* If the assignment statement is within a BEGIN ... EXCEPTION block,
its target variable must be declared inside the most-closely-nested
such block. This ensures that if an error is thrown from within the
assignment statement's expression, we do not care about the value
of the target variable, except to the extent of being able to clean
it up.
My users are writing algebraic expressions to be done in bulk on GPUs, etc. I don't think I have to worry too much about wrapping stuff in exception blocks while handling my library objects.
<snip>
While I've not tried to write any code yet, I think both of these
conditions should be reasonably easy to verify.
Given that those conditions are met and the current value of the
assignment target variable is a R/W expanded pointer, we can
execute the assignment as follows:
<snip>
So, while this design greatly expands the set of cases we can
optimize, it does lose some cases that the old approach could
support. I envision addressing that by allowing a prosupport
function attached to the RHS' topmost function to "bless"
other cases as safe, using reasoning similar to the old rules.
(Or different rules, even, but it's on the prosupport function
to be sure it's safe.) I don't have a detailed design in mind,
but I'm thinking along the lines of just passing the whole RHS
expression to the prosupport function and letting it decide
what's safe. In any case, we don't need to even call the
prosupport function unless there's an exception block or
multiple RHS references to the target variable.
That all sounds great, and it sounds like my prosupport function just needs to return true, or set some kind of flag saying aliasing is ok. I'd like to help as much as possible, but some of that reparenting stuff was pretty deep for me, other than being a quick sanity check case, is there anything I can do to help?
regards, tom lane
Michel Pelletier <pelletier.michel@gmail.com> writes: > Here's two backtraces from gdb from this function: > CREATE OR REPLACE FUNCTION test2(graph matrix) > RETURNS bigint LANGUAGE plpgsql AS > $$ > BEGIN > perform set_element(graph, 1, 1, 1); > RETURN nvals(graph); > end; > $$; Well, you shouldn't be using PERFORM. Not only does it not do the right thing, but it's not optimized for expanded objects at all: they'll get flattened both on the way into the statement and on the way out. Try it with graph := set_element(graph, 1, 1, 1); RETURN nvals(graph); regards, tom lane
Michel Pelletier <pelletier.michel@gmail.com> writes: > That all sounds great, and it sounds like my prosupport function just needs > to return true, or set some kind of flag saying aliasing is ok. I'd like > to help as much as possible, but some of that reparenting stuff was pretty > deep for me, other than being a quick sanity check case, is there anything > I can do to help? I wasn't expecting you to write the code, but if you can test it and see how much it helps your use-case, that would be great. Here is a v1 patch series that does the first part of what we've been talking about, which is to implement the new optimization rule for the case of a single RHS reference to the target variable. I'm curious to know if it helps you at all by itself. You'll definitely also need commit 534d0ea6c, so probably applying these on our git master branch would be the place to start. regards, tom lane From e207d9d377a75181f5bf655ffb81ef53328d3ea3 Mon Sep 17 00:00:00 2001 From: Tom Lane <tgl@sss.pgh.pa.us> Date: Wed, 30 Oct 2024 15:57:35 -0400 Subject: [PATCH v1 1/3] Preliminary refactoring. This short and boring patch simply moves the responsibility for initializing PLpgSQL_expr.target_param into plpgsql parsing, rather than doing it at first execution of the expr as before. This doesn't save anything in terms of runtime, since the work was trivial and done only once per expr anyway. But it makes the info available during parsing, which will be useful for the next step. Likewise set PLpgSQL_expr.func during parsing. According to the comments, this was once impossible; but it's certainly possible since we invented the plpgsql_curr_compile variable. Again, this saves little runtime, but it seems far cleaner conceptually. While at it, I reordered stuff in struct PLpgSQL_expr to make it clearer which fields are filled when, and merged some duplicative code in pl_gram.y. Discussion: https://postgr.es/m/CACxu=vJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg@mail.gmail.com --- src/pl/plpgsql/src/pl_exec.c | 27 --------------- src/pl/plpgsql/src/pl_gram.y | 65 ++++++++++++++++++++++++------------ src/pl/plpgsql/src/plpgsql.h | 31 +++++++++-------- 3 files changed, 62 insertions(+), 61 deletions(-) diff --git a/src/pl/plpgsql/src/pl_exec.c b/src/pl/plpgsql/src/pl_exec.c index 86c5bd324a..ce7cdb6458 100644 --- a/src/pl/plpgsql/src/pl_exec.c +++ b/src/pl/plpgsql/src/pl_exec.c @@ -4174,12 +4174,6 @@ exec_prepare_plan(PLpgSQL_execstate *estate, SPIPlanPtr plan; SPIPrepareOptions options; - /* - * The grammar can't conveniently set expr->func while building the parse - * tree, so make sure it's set before parser hooks need it. - */ - expr->func = estate->func; - /* * Generate and save the plan */ @@ -5016,21 +5010,7 @@ exec_assign_expr(PLpgSQL_execstate *estate, PLpgSQL_datum *target, * If first time through, create a plan for this expression. */ if (expr->plan == NULL) - { - /* - * Mark the expression as being an assignment source, if target is a - * simple variable. (This is a bit messy, but it seems cleaner than - * modifying the API of exec_prepare_plan for the purpose. We need to - * stash the target dno into the expr anyway, so that it will be - * available if we have to replan.) - */ - if (target->dtype == PLPGSQL_DTYPE_VAR) - expr->target_param = target->dno; - else - expr->target_param = -1; /* should be that already */ - exec_prepare_plan(estate, expr, 0); - } value = exec_eval_expr(estate, expr, &isnull, &valtype, &valtypmod); exec_assign_value(estate, target, value, isnull, valtype, valtypmod); @@ -6282,13 +6262,6 @@ setup_param_list(PLpgSQL_execstate *estate, PLpgSQL_expr *expr) * that they are interrupting an active use of parameters. */ paramLI->parserSetupArg = (void *) expr; - - /* - * Also make sure this is set before parser hooks need it. There is - * no need to save and restore, since the value is always correct once - * set. (Should be set already, but let's be sure.) - */ - expr->func = estate->func; } else { diff --git a/src/pl/plpgsql/src/pl_gram.y b/src/pl/plpgsql/src/pl_gram.y index 8182ce28aa..5431977d69 100644 --- a/src/pl/plpgsql/src/pl_gram.y +++ b/src/pl/plpgsql/src/pl_gram.y @@ -64,6 +64,10 @@ static bool tok_is_keyword(int token, union YYSTYPE *lval, static void word_is_not_variable(PLword *word, int location); static void cword_is_not_variable(PLcword *cword, int location); static void current_token_is_not_variable(int tok); +static PLpgSQL_expr *make_plpgsql_expr(const char *query, + RawParseMode parsemode); +static void expr_is_assignment_source(PLpgSQL_expr *expr, + PLpgSQL_datum *target); static PLpgSQL_expr *read_sql_construct(int until, int until2, int until3, @@ -529,6 +533,10 @@ decl_statement : decl_varname decl_const decl_datatype decl_collate decl_notnull errmsg("variable \"%s\" must have a default value, since it's declared NOT NULL", var->refname), parser_errposition(@5))); + + if (var->default_val != NULL) + expr_is_assignment_source(var->default_val, + (PLpgSQL_datum *) var); } | decl_varname K_ALIAS K_FOR decl_aliasitem ';' { @@ -987,6 +995,7 @@ stmt_assign : T_DATUM pmode, false, true, NULL, NULL); + expr_is_assignment_source(new->expr, $1.datum); $$ = (PLpgSQL_stmt *) new; } @@ -2637,6 +2646,38 @@ current_token_is_not_variable(int tok) yyerror("syntax error"); } +/* Convenience routine to construct a PLpgSQL_expr struct */ +static PLpgSQL_expr * +make_plpgsql_expr(const char *query, + RawParseMode parsemode) +{ + PLpgSQL_expr *expr = palloc0(sizeof(PLpgSQL_expr)); + + expr->query = pstrdup(query); + expr->parseMode = parsemode; + expr->func = plpgsql_curr_compile; + expr->ns = plpgsql_ns_top(); + /* might get changed later during parsing: */ + expr->target_param = -1; + /* other fields are left as zeroes until first execution */ + return expr; +} + +/* Mark a PLpgSQL_expr as being the source of an assignment to target */ +static void +expr_is_assignment_source(PLpgSQL_expr *expr, PLpgSQL_datum *target) +{ + /* + * Mark the expression as being an assignment source, if target is a + * simple variable. We don't currently support optimized assignments to + * other DTYPEs. + */ + if (target->dtype == PLPGSQL_DTYPE_VAR) + expr->target_param = target->dno; + else + expr->target_param = -1; /* should be that already */ +} + /* Convenience routine to read an expression with one possible terminator */ static PLpgSQL_expr * read_sql_expression(int until, const char *expected) @@ -2774,13 +2815,7 @@ read_sql_construct(int until, */ plpgsql_append_source_text(&ds, startlocation, endlocation); - expr = palloc0(sizeof(PLpgSQL_expr)); - expr->query = pstrdup(ds.data); - expr->parseMode = parsemode; - expr->plan = NULL; - expr->paramnos = NULL; - expr->target_param = -1; - expr->ns = plpgsql_ns_top(); + expr = make_plpgsql_expr(ds.data, parsemode); pfree(ds.data); if (valid_sql) @@ -3102,13 +3137,7 @@ make_execsql_stmt(int firsttoken, int location, PLword *word) while (ds.len > 0 && scanner_isspace(ds.data[ds.len - 1])) ds.data[--ds.len] = '\0'; - expr = palloc0(sizeof(PLpgSQL_expr)); - expr->query = pstrdup(ds.data); - expr->parseMode = RAW_PARSE_DEFAULT; - expr->plan = NULL; - expr->paramnos = NULL; - expr->target_param = -1; - expr->ns = plpgsql_ns_top(); + expr = make_plpgsql_expr(ds.data, RAW_PARSE_DEFAULT); pfree(ds.data); check_sql_expr(expr->query, expr->parseMode, location); @@ -3980,13 +4009,7 @@ read_cursor_args(PLpgSQL_var *cursor, int until) appendStringInfoString(&ds, ", "); } - expr = palloc0(sizeof(PLpgSQL_expr)); - expr->query = pstrdup(ds.data); - expr->parseMode = RAW_PARSE_PLPGSQL_EXPR; - expr->plan = NULL; - expr->paramnos = NULL; - expr->target_param = -1; - expr->ns = plpgsql_ns_top(); + expr = make_plpgsql_expr(ds.data, RAW_PARSE_PLPGSQL_EXPR); pfree(ds.data); /* Next we'd better find the until token */ diff --git a/src/pl/plpgsql/src/plpgsql.h b/src/pl/plpgsql/src/plpgsql.h index 50c3b28472..fbb6000caa 100644 --- a/src/pl/plpgsql/src/plpgsql.h +++ b/src/pl/plpgsql/src/plpgsql.h @@ -219,14 +219,22 @@ typedef struct PLpgSQL_expr { char *query; /* query string, verbatim from function body */ RawParseMode parseMode; /* raw_parser() mode to use */ - SPIPlanPtr plan; /* plan, or NULL if not made yet */ - Bitmapset *paramnos; /* all dnos referenced by this query */ + struct PLpgSQL_function *func; /* function containing this expr */ + struct PLpgSQL_nsitem *ns; /* namespace chain visible to this expr */ - /* function containing this expr (not set until we first parse query) */ - struct PLpgSQL_function *func; + /* + * These fields are used to help optimize assignments to expanded-datum + * variables. If this expression is the source of an assignment to a + * simple variable, target_param holds that variable's dno (else it's -1). + */ + int target_param; /* dno of assign target, or -1 if none */ - /* namespace chain visible to this expr */ - struct PLpgSQL_nsitem *ns; + /* + * Fields above are set during plpgsql parsing. Remaining fields are left + * as zeroes/NULLs until we first parse/plan the query. + */ + SPIPlanPtr plan; /* plan, or NULL if not made yet */ + Bitmapset *paramnos; /* all dnos referenced by this query */ /* fields for "simple expression" fast-path execution: */ Expr *expr_simple_expr; /* NULL means not a simple expr */ @@ -235,14 +243,11 @@ typedef struct PLpgSQL_expr bool expr_simple_mutable; /* true if simple expr is mutable */ /* - * These fields are used to optimize assignments to expanded-datum - * variables. If this expression is the source of an assignment to a - * simple variable, target_param holds that variable's dno; else it's -1. - * If we match a Param within expr_simple_expr to such a variable, that - * Param's address is stored in expr_rw_param; then expression code - * generation will allow the value for that Param to be passed read/write. + * If we match a Param within expr_simple_expr to the variable identified + * by target_param, that Param's address is stored in expr_rw_param; then + * expression code generation will allow the value for that Param to be + * passed as a read/write expanded-object pointer. */ - int target_param; /* dno of assign target, or -1 if none */ Param *expr_rw_param; /* read/write Param within expr, if any */ /* -- 2.43.5 From dc1abe9852ba02b294509c3331dd2813901f841b Mon Sep 17 00:00:00 2001 From: Tom Lane <tgl@sss.pgh.pa.us> Date: Wed, 30 Oct 2024 16:03:06 -0400 Subject: [PATCH v1 2/3] Detect whether plpgsql assignment targets are "local" variables. Mark whether the target of a potentially optimizable assignment is "local", in the sense of being declared inside any exception block that could trap an error thrown from the assignment. (This implies that we needn't preserve the variable's value in case of an error.) Normally, this requires a post-parsing scan of the function's parse tree, since we don't know while parsing a BEGIN ... construct whether we will find EXCEPTION at its end. However, if there are no BEGIN ... EXCEPTION blocks in the function at all, then all assignments are local, even those to variables representing function arguments. We optimize that common case by initializing the target_is_local flags to "true", and fixing them up with a post-scan only if we found EXCEPTION. The scan is implemented by code that's largely copied-and-pasted from the nearby code to scan a plpgsql parse tree for deletion. It's a bit annoying to have three copies of that now, but I'm not seeing a way to refactor it that would save much code on net. Note that variables' default-value expressions are never interesting for expanded-variable optimization, since they couldn't contain a reference to the target variable anyway. But the code is set up to compute their target_param and target_is_local correctly anyway, for consistency and in case someone thinks of a use for that data. I added a bit of plpgsql_dumptree support to help verify that this code sets the flags as expected. I'm not set on keeping that, but I do want to keep the addition of a plpgsql_dumptree call in plpgsql_compile_inline. It's at best an oversight that "#option dump" doesn't work in a DO block. Discussion: https://postgr.es/m/CACxu=vJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg@mail.gmail.com --- src/pl/plpgsql/src/pl_comp.c | 12 + src/pl/plpgsql/src/pl_funcs.c | 398 ++++++++++++++++++++++++++++++++++ src/pl/plpgsql/src/pl_gram.y | 15 ++ src/pl/plpgsql/src/plpgsql.h | 7 +- 4 files changed, 431 insertions(+), 1 deletion(-) diff --git a/src/pl/plpgsql/src/pl_comp.c b/src/pl/plpgsql/src/pl_comp.c index 5633e3c790..c1ca93fe21 100644 --- a/src/pl/plpgsql/src/pl_comp.c +++ b/src/pl/plpgsql/src/pl_comp.c @@ -365,6 +365,7 @@ do_compile(FunctionCallInfo fcinfo, function->nstatements = 0; function->requires_procedure_resowner = false; + function->has_exception_block = false; /* * Initialize the compiler, particularly the namespace stack. The @@ -806,6 +807,9 @@ do_compile(FunctionCallInfo fcinfo, plpgsql_finish_datums(function); + if (function->has_exception_block) + plpgsql_mark_local_assignment_targets(function); + /* Debug dump for completed functions */ if (plpgsql_DumpExecTree) plpgsql_dumptree(function); @@ -899,6 +903,7 @@ plpgsql_compile_inline(char *proc_source) function->nstatements = 0; function->requires_procedure_resowner = false; + function->has_exception_block = false; plpgsql_ns_init(); plpgsql_ns_push(func_name, PLPGSQL_LABEL_BLOCK); @@ -956,6 +961,13 @@ plpgsql_compile_inline(char *proc_source) plpgsql_finish_datums(function); + if (function->has_exception_block) + plpgsql_mark_local_assignment_targets(function); + + /* Debug dump for completed functions */ + if (plpgsql_DumpExecTree) + plpgsql_dumptree(function); + /* * Pop the error context stack */ diff --git a/src/pl/plpgsql/src/pl_funcs.c b/src/pl/plpgsql/src/pl_funcs.c index eeb7c4d7c0..889377fc9a 100644 --- a/src/pl/plpgsql/src/pl_funcs.c +++ b/src/pl/plpgsql/src/pl_funcs.c @@ -333,6 +333,401 @@ plpgsql_getdiag_kindname(PLpgSQL_getdiag_kind kind) } +/********************************************************************** + * Mark assignment source expressions that have local target variables, + * that is, variables declared within the exception block most closely + * containing the assignment itself. (Such target variables need not be + * preserved if the assignment's source expression raises an error, + * allowing better optimization.) + * + * This code need not be called if the plpgsql function contains no exception + * blocks, because expr_is_assignment_source() will have set all the flags + * to true already. Also, we need not examine default-value expressions for + * variables, because variable declarations are necessarily within the nearest + * exception block. (In DECLARE ... BEGIN ... EXCEPTION ... END, the variable + * initializations are done before entering the exception scope.) So it's + * sufficient to find assignment statements. + * + * Within the recursion, local_dnos is a Bitmapset of dnos of variables + * known to be declared within the current exception level. + **********************************************************************/ +static void mark_stmt(PLpgSQL_stmt *stmt, Bitmapset *local_dnos); +static void mark_block(PLpgSQL_stmt_block *block, Bitmapset *local_dnos); +static void mark_assign(PLpgSQL_stmt_assign *stmt, Bitmapset *local_dnos); +static void mark_if(PLpgSQL_stmt_if *stmt, Bitmapset *local_dnos); +static void mark_case(PLpgSQL_stmt_case *stmt, Bitmapset *local_dnos); +static void mark_loop(PLpgSQL_stmt_loop *stmt, Bitmapset *local_dnos); +static void mark_while(PLpgSQL_stmt_while *stmt, Bitmapset *local_dnos); +static void mark_fori(PLpgSQL_stmt_fori *stmt, Bitmapset *local_dnos); +static void mark_fors(PLpgSQL_stmt_fors *stmt, Bitmapset *local_dnos); +static void mark_forc(PLpgSQL_stmt_forc *stmt, Bitmapset *local_dnos); +static void mark_foreach_a(PLpgSQL_stmt_foreach_a *stmt, Bitmapset *local_dnos); +static void mark_exit(PLpgSQL_stmt_exit *stmt, Bitmapset *local_dnos); +static void mark_return(PLpgSQL_stmt_return *stmt, Bitmapset *local_dnos); +static void mark_return_next(PLpgSQL_stmt_return_next *stmt, Bitmapset *local_dnos); +static void mark_return_query(PLpgSQL_stmt_return_query *stmt, Bitmapset *local_dnos); +static void mark_raise(PLpgSQL_stmt_raise *stmt, Bitmapset *local_dnos); +static void mark_assert(PLpgSQL_stmt_assert *stmt, Bitmapset *local_dnos); +static void mark_execsql(PLpgSQL_stmt_execsql *stmt, Bitmapset *local_dnos); +static void mark_dynexecute(PLpgSQL_stmt_dynexecute *stmt, Bitmapset *local_dnos); +static void mark_dynfors(PLpgSQL_stmt_dynfors *stmt, Bitmapset *local_dnos); +static void mark_getdiag(PLpgSQL_stmt_getdiag *stmt, Bitmapset *local_dnos); +static void mark_open(PLpgSQL_stmt_open *stmt, Bitmapset *local_dnos); +static void mark_fetch(PLpgSQL_stmt_fetch *stmt, Bitmapset *local_dnos); +static void mark_close(PLpgSQL_stmt_close *stmt, Bitmapset *local_dnos); +static void mark_perform(PLpgSQL_stmt_perform *stmt, Bitmapset *local_dnos); +static void mark_call(PLpgSQL_stmt_call *stmt, Bitmapset *local_dnos); +static void mark_commit(PLpgSQL_stmt_commit *stmt, Bitmapset *local_dnos); +static void mark_rollback(PLpgSQL_stmt_rollback *stmt, Bitmapset *local_dnos); + + +static void +mark_stmt(PLpgSQL_stmt *stmt, Bitmapset *local_dnos) +{ + switch (stmt->cmd_type) + { + case PLPGSQL_STMT_BLOCK: + mark_block((PLpgSQL_stmt_block *) stmt, local_dnos); + break; + case PLPGSQL_STMT_ASSIGN: + mark_assign((PLpgSQL_stmt_assign *) stmt, local_dnos); + break; + case PLPGSQL_STMT_IF: + mark_if((PLpgSQL_stmt_if *) stmt, local_dnos); + break; + case PLPGSQL_STMT_CASE: + mark_case((PLpgSQL_stmt_case *) stmt, local_dnos); + break; + case PLPGSQL_STMT_LOOP: + mark_loop((PLpgSQL_stmt_loop *) stmt, local_dnos); + break; + case PLPGSQL_STMT_WHILE: + mark_while((PLpgSQL_stmt_while *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FORI: + mark_fori((PLpgSQL_stmt_fori *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FORS: + mark_fors((PLpgSQL_stmt_fors *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FORC: + mark_forc((PLpgSQL_stmt_forc *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FOREACH_A: + mark_foreach_a((PLpgSQL_stmt_foreach_a *) stmt, local_dnos); + break; + case PLPGSQL_STMT_EXIT: + mark_exit((PLpgSQL_stmt_exit *) stmt, local_dnos); + break; + case PLPGSQL_STMT_RETURN: + mark_return((PLpgSQL_stmt_return *) stmt, local_dnos); + break; + case PLPGSQL_STMT_RETURN_NEXT: + mark_return_next((PLpgSQL_stmt_return_next *) stmt, local_dnos); + break; + case PLPGSQL_STMT_RETURN_QUERY: + mark_return_query((PLpgSQL_stmt_return_query *) stmt, local_dnos); + break; + case PLPGSQL_STMT_RAISE: + mark_raise((PLpgSQL_stmt_raise *) stmt, local_dnos); + break; + case PLPGSQL_STMT_ASSERT: + mark_assert((PLpgSQL_stmt_assert *) stmt, local_dnos); + break; + case PLPGSQL_STMT_EXECSQL: + mark_execsql((PLpgSQL_stmt_execsql *) stmt, local_dnos); + break; + case PLPGSQL_STMT_DYNEXECUTE: + mark_dynexecute((PLpgSQL_stmt_dynexecute *) stmt, local_dnos); + break; + case PLPGSQL_STMT_DYNFORS: + mark_dynfors((PLpgSQL_stmt_dynfors *) stmt, local_dnos); + break; + case PLPGSQL_STMT_GETDIAG: + mark_getdiag((PLpgSQL_stmt_getdiag *) stmt, local_dnos); + break; + case PLPGSQL_STMT_OPEN: + mark_open((PLpgSQL_stmt_open *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FETCH: + mark_fetch((PLpgSQL_stmt_fetch *) stmt, local_dnos); + break; + case PLPGSQL_STMT_CLOSE: + mark_close((PLpgSQL_stmt_close *) stmt, local_dnos); + break; + case PLPGSQL_STMT_PERFORM: + mark_perform((PLpgSQL_stmt_perform *) stmt, local_dnos); + break; + case PLPGSQL_STMT_CALL: + mark_call((PLpgSQL_stmt_call *) stmt, local_dnos); + break; + case PLPGSQL_STMT_COMMIT: + mark_commit((PLpgSQL_stmt_commit *) stmt, local_dnos); + break; + case PLPGSQL_STMT_ROLLBACK: + mark_rollback((PLpgSQL_stmt_rollback *) stmt, local_dnos); + break; + default: + elog(ERROR, "unrecognized cmd_type: %d", stmt->cmd_type); + break; + } +} + +static void +mark_stmts(List *stmts, Bitmapset *local_dnos) +{ + ListCell *s; + + foreach(s, stmts) + { + mark_stmt((PLpgSQL_stmt *) lfirst(s), local_dnos); + } +} + +static void +mark_block(PLpgSQL_stmt_block *block, Bitmapset *local_dnos) +{ + if (block->exceptions) + { + ListCell *e; + + /* + * The block creates a new exception scope, so variables declared at + * outer levels are nonlocal. For that matter, so are any variables + * declared in the block's DECLARE section. Hence, we must pass down + * empty local_dnos. + */ + mark_stmts(block->body, NULL); + + foreach(e, block->exceptions->exc_list) + { + PLpgSQL_exception *exc = (PLpgSQL_exception *) lfirst(e); + + mark_stmts(exc->action, NULL); + } + } + else + { + /* + * Otherwise, the block does not create a new exception scope, and any + * variables it declares can also be considered local within it. Note + * that only initializable datum types (VAR, REC) are included in + * initvarnos; but that's sufficient for our purposes. + */ + local_dnos = bms_copy(local_dnos); + for (int i = 0; i < block->n_initvars; i++) + local_dnos = bms_add_member(local_dnos, block->initvarnos[i]); + mark_stmts(block->body, local_dnos); + bms_free(local_dnos); + } +} + +static void +mark_assign(PLpgSQL_stmt_assign *stmt, Bitmapset *local_dnos) +{ + PLpgSQL_expr *expr = stmt->expr; + + /* + * If the assignment target is a plain DTYPE_VAR datum, mark it as local + * or not. (If it's not a VAR, we don't care.) + */ + if (expr->target_param >= 0) + expr->target_is_local = bms_is_member(expr->target_param, local_dnos); +} + +static void +mark_if(PLpgSQL_stmt_if *stmt, Bitmapset *local_dnos) +{ + ListCell *l; + + /* stmt->cond cannot be an assignment source */ + mark_stmts(stmt->then_body, local_dnos); + foreach(l, stmt->elsif_list) + { + PLpgSQL_if_elsif *elif = (PLpgSQL_if_elsif *) lfirst(l); + + /* elif->cond cannot be an assignment source */ + mark_stmts(elif->stmts, local_dnos); + } + mark_stmts(stmt->else_body, local_dnos); +} + +static void +mark_case(PLpgSQL_stmt_case *stmt, Bitmapset *local_dnos) +{ + ListCell *l; + + /* stmt->t_expr cannot be an assignment source */ + foreach(l, stmt->case_when_list) + { + PLpgSQL_case_when *cwt = (PLpgSQL_case_when *) lfirst(l); + + /* cwt->expr cannot be an assignment source */ + mark_stmts(cwt->stmts, local_dnos); + } + mark_stmts(stmt->else_stmts, local_dnos); +} + +static void +mark_loop(PLpgSQL_stmt_loop *stmt, Bitmapset *local_dnos) +{ + mark_stmts(stmt->body, local_dnos); +} + +static void +mark_while(PLpgSQL_stmt_while *stmt, Bitmapset *local_dnos) +{ + /* stmt->cond cannot be an assignment source */ + mark_stmts(stmt->body, local_dnos); +} + +static void +mark_fori(PLpgSQL_stmt_fori *stmt, Bitmapset *local_dnos) +{ + /* stmt->lower, upper, step cannot be an assignment source */ + mark_stmts(stmt->body, local_dnos); +} + +static void +mark_fors(PLpgSQL_stmt_fors *stmt, Bitmapset *local_dnos) +{ + mark_stmts(stmt->body, local_dnos); + /* stmt->query cannot be an assignment source */ +} + +static void +mark_forc(PLpgSQL_stmt_forc *stmt, Bitmapset *local_dnos) +{ + mark_stmts(stmt->body, local_dnos); + /* stmt->argquery cannot be an assignment source */ +} + +static void +mark_foreach_a(PLpgSQL_stmt_foreach_a *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ + mark_stmts(stmt->body, local_dnos); +} + +static void +mark_open(PLpgSQL_stmt_open *stmt, Bitmapset *local_dnos) +{ + /* stmt->argquery, query, dynquery cannot be an assignment source */ + /* stmt->params cannot contain an assignment source */ +} + +static void +mark_fetch(PLpgSQL_stmt_fetch *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_close(PLpgSQL_stmt_close *stmt, Bitmapset *local_dnos) +{ +} + +static void +mark_perform(PLpgSQL_stmt_perform *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_call(PLpgSQL_stmt_call *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_commit(PLpgSQL_stmt_commit *stmt, Bitmapset *local_dnos) +{ +} + +static void +mark_rollback(PLpgSQL_stmt_rollback *stmt, Bitmapset *local_dnos) +{ +} + +static void +mark_exit(PLpgSQL_stmt_exit *stmt, Bitmapset *local_dnos) +{ + /* stmt->cond cannot be an assignment source */ +} + +static void +mark_return(PLpgSQL_stmt_return *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_return_next(PLpgSQL_stmt_return_next *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_return_query(PLpgSQL_stmt_return_query *stmt, Bitmapset *local_dnos) +{ + /* stmt->query, dynquery cannot be an assignment source */ + /* stmt->params cannot contain an assignment source */ +} + +static void +mark_raise(PLpgSQL_stmt_raise *stmt, Bitmapset *local_dnos) +{ + /* stmt->params cannot contain an assignment source */ + /* stmt->options cannot contain an assignment source */ +} + +static void +mark_assert(PLpgSQL_stmt_assert *stmt, Bitmapset *local_dnos) +{ + /* stmt->cond, message cannot be an assignment source */ +} + +static void +mark_execsql(PLpgSQL_stmt_execsql *stmt, Bitmapset *local_dnos) +{ + /* stmt->sqlstmt cannot be an assignment source */ +} + +static void +mark_dynexecute(PLpgSQL_stmt_dynexecute *stmt, Bitmapset *local_dnos) +{ + /* stmt->query cannot be an assignment source */ + /* stmt->params cannot contain an assignment source */ +} + +static void +mark_dynfors(PLpgSQL_stmt_dynfors *stmt, Bitmapset *local_dnos) +{ + mark_stmts(stmt->body, local_dnos); + /* stmt->query cannot be an assignment source */ + /* stmt->params cannot contain an assignment source */ +} + +static void +mark_getdiag(PLpgSQL_stmt_getdiag *stmt, Bitmapset *local_dnos) +{ +} + +void +plpgsql_mark_local_assignment_targets(PLpgSQL_function *func) +{ + Bitmapset *local_dnos; + + /* Function parameters can be treated as local targets at outer level */ + local_dnos = NULL; + for (int i = 0; i < func->fn_nargs; i++) + local_dnos = bms_add_member(local_dnos, func->fn_argvarnos[i]); + if (func->action) + mark_block(func->action, local_dnos); + bms_free(local_dnos); +} + + /********************************************************************** * Release memory when a PL/pgSQL function is no longer needed * @@ -1594,6 +1989,9 @@ static void dump_expr(PLpgSQL_expr *expr) { printf("'%s'", expr->query); + if (expr->target_param >= 0) + printf(" target %d%s", expr->target_param, + expr->target_is_local ? " (local)" : ""); } void diff --git a/src/pl/plpgsql/src/pl_gram.y b/src/pl/plpgsql/src/pl_gram.y index 5431977d69..ddbfda8388 100644 --- a/src/pl/plpgsql/src/pl_gram.y +++ b/src/pl/plpgsql/src/pl_gram.y @@ -2314,6 +2314,8 @@ exception_sect : PLpgSQL_exception_block *new = palloc(sizeof(PLpgSQL_exception_block)); PLpgSQL_variable *var; + plpgsql_curr_compile->has_exception_block = true; + var = plpgsql_build_variable("sqlstate", lineno, plpgsql_build_datatype(TEXTOID, -1, @@ -2659,6 +2661,7 @@ make_plpgsql_expr(const char *query, expr->ns = plpgsql_ns_top(); /* might get changed later during parsing: */ expr->target_param = -1; + expr->target_is_local = false; /* other fields are left as zeroes until first execution */ return expr; } @@ -2673,9 +2676,21 @@ expr_is_assignment_source(PLpgSQL_expr *expr, PLpgSQL_datum *target) * other DTYPEs. */ if (target->dtype == PLPGSQL_DTYPE_VAR) + { expr->target_param = target->dno; + + /* + * For now, assume the target is local to the nearest enclosing + * exception block. That's correct if the function contains no + * exception blocks; otherwise we'll update this later. + */ + expr->target_is_local = true; + } else + { expr->target_param = -1; /* should be that already */ + expr->target_is_local = false; /* ditto */ + } } /* Convenience routine to read an expression with one possible terminator */ diff --git a/src/pl/plpgsql/src/plpgsql.h b/src/pl/plpgsql/src/plpgsql.h index fbb6000caa..c6fadc5660 100644 --- a/src/pl/plpgsql/src/plpgsql.h +++ b/src/pl/plpgsql/src/plpgsql.h @@ -225,9 +225,12 @@ typedef struct PLpgSQL_expr /* * These fields are used to help optimize assignments to expanded-datum * variables. If this expression is the source of an assignment to a - * simple variable, target_param holds that variable's dno (else it's -1). + * simple variable, target_param holds that variable's dno (else it's -1), + * and target_is_local indicates whether the target is declared inside the + * closest exception block containing the assignment. */ int target_param; /* dno of assign target, or -1 if none */ + bool target_is_local; /* is it within nearest exception block? */ /* * Fields above are set during plpgsql parsing. Remaining fields are left @@ -1014,6 +1017,7 @@ typedef struct PLpgSQL_function /* data derived while parsing body */ unsigned int nstatements; /* counter for assigning stmtids */ bool requires_procedure_resowner; /* contains CALL or DO? */ + bool has_exception_block; /* contains BEGIN...EXCEPTION? */ /* these fields change when the function is used */ struct PLpgSQL_execstate *cur_estate; @@ -1314,6 +1318,7 @@ extern PLpgSQL_nsitem *plpgsql_ns_find_nearest_loop(PLpgSQL_nsitem *ns_cur); */ extern PGDLLEXPORT const char *plpgsql_stmt_typename(PLpgSQL_stmt *stmt); extern const char *plpgsql_getdiag_kindname(PLpgSQL_getdiag_kind kind); +extern void plpgsql_mark_local_assignment_targets(PLpgSQL_function *func); extern void plpgsql_free_function_memory(PLpgSQL_function *func); extern void plpgsql_dumptree(PLpgSQL_function *func); -- 2.43.5 From 36cb6bb2dd860c2e3e02cdf871acde9efded97e1 Mon Sep 17 00:00:00 2001 From: Tom Lane <tgl@sss.pgh.pa.us> Date: Fri, 1 Nov 2024 16:50:25 -0400 Subject: [PATCH v1 3/3] Implement new optimization rule for updates of expanded variables. If a read/write expanded variable is declared locally to the assignment statement that is updating it, and it is referenced exactly once in the assignment RHS, then we can optimize the operation as a direct update of the expanded value, whether or not the function(s) operating on it can be trusted not to modify the value before throwing an error. This works because if an error does get thrown, we no longer care what value the variable has. In cases where that doesn't work, fall back to the previous rule that checks for safety of the top-level function. In any case, postpone determination of whether these optimizations are feasible until we are executing a Param referencing the target variable and that variable holds a R/W expanded object. While the previous incarnation of exec_check_rw_parameter was pretty cheap, this is a bit less so, and our plan to invoke support functions will make it even less so. So avoiding the check for variables where it couldn't be useful should be a win. Discussion: https://postgr.es/m/CACxu=vJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg@mail.gmail.com --- src/include/executor/execExpr.h | 1 + src/pl/plpgsql/src/expected/plpgsql_array.out | 9 + src/pl/plpgsql/src/pl_exec.c | 377 +++++++++++++++--- src/pl/plpgsql/src/plpgsql.h | 22 +- src/pl/plpgsql/src/sql/plpgsql_array.sql | 9 + src/tools/pgindent/typedefs.list | 2 + 6 files changed, 358 insertions(+), 62 deletions(-) diff --git a/src/include/executor/execExpr.h b/src/include/executor/execExpr.h index cd97dfa062..b5fa05fc63 100644 --- a/src/include/executor/execExpr.h +++ b/src/include/executor/execExpr.h @@ -405,6 +405,7 @@ typedef struct ExprEvalStep { ExecEvalSubroutine paramfunc; /* add-on evaluation subroutine */ void *paramarg; /* private data for same */ + void *paramarg2; /* more private data for same */ int paramid; /* numeric ID for parameter */ Oid paramtype; /* OID of parameter's datatype */ } cparam; diff --git a/src/pl/plpgsql/src/expected/plpgsql_array.out b/src/pl/plpgsql/src/expected/plpgsql_array.out index ad60e0e8be..e5db6d6087 100644 --- a/src/pl/plpgsql/src/expected/plpgsql_array.out +++ b/src/pl/plpgsql/src/expected/plpgsql_array.out @@ -52,6 +52,15 @@ NOTICE: a = ("{""(,11)""}",), a.c1[1].i = 11 do $$ declare a int[]; begin a := array_agg(x) from (values(1),(2),(3)) v(x); raise notice 'a = %', a; end$$; NOTICE: a = {1,2,3} +do $$ declare a int[] := array[1,2,3]; +begin + -- test scenarios for optimization of updates of R/W expanded objects + a := array_append(a, 42); -- optimizable using "transfer" method + a := a || a[3]; -- optimizable using "inplace" method + a := a || a; -- not optimizable + raise notice 'a = %', a; +end$$; +NOTICE: a = {1,2,3,42,3,1,2,3,42,3} create temp table onecol as select array[1,2] as f1; do $$ declare a int[]; begin a := f1 from onecol; raise notice 'a = %', a; end$$; diff --git a/src/pl/plpgsql/src/pl_exec.c b/src/pl/plpgsql/src/pl_exec.c index ce7cdb6458..c29de7a30f 100644 --- a/src/pl/plpgsql/src/pl_exec.c +++ b/src/pl/plpgsql/src/pl_exec.c @@ -251,6 +251,15 @@ static HTAB *shared_cast_hash = NULL; else \ Assert(rc == PLPGSQL_RC_OK) +/* State struct for count_param_references */ +typedef struct count_param_references_context +{ + int paramid; + int count; + Param *last_param; +} count_param_references_context; + + /************************************************************ * Local function forward declarations ************************************************************/ @@ -336,7 +345,9 @@ static void exec_prepare_plan(PLpgSQL_execstate *estate, static void exec_simple_check_plan(PLpgSQL_execstate *estate, PLpgSQL_expr *expr); static bool exec_is_simple_query(PLpgSQL_expr *expr); static void exec_save_simple_expr(PLpgSQL_expr *expr, CachedPlan *cplan); -static void exec_check_rw_parameter(PLpgSQL_expr *expr); +static void exec_check_rw_parameter(PLpgSQL_expr *expr, int paramid); +static bool count_param_references(Node *node, + count_param_references_context *context); static void exec_check_assignable(PLpgSQL_execstate *estate, int dno); static bool exec_eval_simple_expr(PLpgSQL_execstate *estate, PLpgSQL_expr *expr, @@ -384,6 +395,10 @@ static ParamExternData *plpgsql_param_fetch(ParamListInfo params, static void plpgsql_param_compile(ParamListInfo params, Param *param, ExprState *state, Datum *resv, bool *resnull); +static void plpgsql_param_eval_var_check(ExprState *state, ExprEvalStep *op, + ExprContext *econtext); +static void plpgsql_param_eval_var_transfer(ExprState *state, ExprEvalStep *op, + ExprContext *econtext); static void plpgsql_param_eval_var(ExprState *state, ExprEvalStep *op, ExprContext *econtext); static void plpgsql_param_eval_var_ro(ExprState *state, ExprEvalStep *op, @@ -6078,10 +6093,13 @@ exec_eval_simple_expr(PLpgSQL_execstate *estate, /* * Reset to "not simple" to leave sane state (with no dangling - * pointers) in case we fail while replanning. expr_simple_plansource - * can be left alone however, as that cannot move. + * pointers) in case we fail while replanning. We'll need to + * re-determine simplicity and R/W optimizability anyway, since those + * could change with the new plan. expr_simple_plansource can be left + * alone however, as that cannot move. */ expr->expr_simple_expr = NULL; + expr->expr_rwopt = PLPGSQL_RWOPT_UNKNOWN; expr->expr_rw_param = NULL; expr->expr_simple_plan = NULL; expr->expr_simple_plan_lxid = InvalidLocalTransactionId; @@ -6439,16 +6457,27 @@ plpgsql_param_compile(ParamListInfo params, Param *param, scratch.resnull = resnull; /* - * Select appropriate eval function. It seems worth special-casing - * DTYPE_VAR and DTYPE_RECFIELD for performance. Also, we can determine - * in advance whether MakeExpandedObjectReadOnly() will be required. - * Currently, only VAR/PROMISE and REC datums could contain read/write - * expanded objects. + * Select appropriate eval function. + * + * First, if this Param references the same varlena-type DTYPE_VAR datum + * that is the target of the assignment containing this simple expression, + * then it's possible we will be able to optimize handling of R/W expanded + * datums. We don't want to do the work needed to determine that unless + * we actually see a R/W expanded datum at runtime, so install a checking + * function that will figure that out when needed. + * + * Otherwise, it seems worth special-casing DTYPE_VAR and DTYPE_RECFIELD + * for performance. Also, we can determine in advance whether + * MakeExpandedObjectReadOnly() will be required. Currently, only + * VAR/PROMISE and REC datums could contain read/write expanded objects. */ if (datum->dtype == PLPGSQL_DTYPE_VAR) { - if (param != expr->expr_rw_param && - ((PLpgSQL_var *) datum)->datatype->typlen == -1) + bool isvarlena = (((PLpgSQL_var *) datum)->datatype->typlen == -1); + + if (isvarlena && dno == expr->target_param && expr->expr_simple_expr) + scratch.d.cparam.paramfunc = plpgsql_param_eval_var_check; + else if (isvarlena) scratch.d.cparam.paramfunc = plpgsql_param_eval_var_ro; else scratch.d.cparam.paramfunc = plpgsql_param_eval_var; @@ -6457,14 +6486,12 @@ plpgsql_param_compile(ParamListInfo params, Param *param, scratch.d.cparam.paramfunc = plpgsql_param_eval_recfield; else if (datum->dtype == PLPGSQL_DTYPE_PROMISE) { - if (param != expr->expr_rw_param && - ((PLpgSQL_var *) datum)->datatype->typlen == -1) + if (((PLpgSQL_var *) datum)->datatype->typlen == -1) scratch.d.cparam.paramfunc = plpgsql_param_eval_generic_ro; else scratch.d.cparam.paramfunc = plpgsql_param_eval_generic; } - else if (datum->dtype == PLPGSQL_DTYPE_REC && - param != expr->expr_rw_param) + else if (datum->dtype == PLPGSQL_DTYPE_REC) scratch.d.cparam.paramfunc = plpgsql_param_eval_generic_ro; else scratch.d.cparam.paramfunc = plpgsql_param_eval_generic; @@ -6473,14 +6500,170 @@ plpgsql_param_compile(ParamListInfo params, Param *param, * Note: it's tempting to use paramarg to store the estate pointer and * thereby save an indirection or two in the eval functions. But that * doesn't work because the compiled expression might be used with - * different estates for the same PL/pgSQL function. + * different estates for the same PL/pgSQL function. Instead, store + * pointers to the PLpgSQL_expr as well as this specific Param, to support + * plpgsql_param_eval_var_check(). */ - scratch.d.cparam.paramarg = NULL; + scratch.d.cparam.paramarg = expr; + scratch.d.cparam.paramarg2 = param; scratch.d.cparam.paramid = param->paramid; scratch.d.cparam.paramtype = param->paramtype; ExprEvalPushStep(state, &scratch); } +/* + * plpgsql_param_eval_var_check evaluation of EEOP_PARAM_CALLBACK step + * + * This is specialized to the case of DTYPE_VAR variables for which + * we may need to determine the applicability of a read/write optimization, + * but we've not done that yet. + */ +static void +plpgsql_param_eval_var_check(ExprState *state, ExprEvalStep *op, + ExprContext *econtext) +{ + ParamListInfo params; + PLpgSQL_execstate *estate; + int dno = op->d.cparam.paramid - 1; + PLpgSQL_var *var; + + /* fetch back the hook data */ + params = econtext->ecxt_param_list_info; + estate = (PLpgSQL_execstate *) params->paramFetchArg; + Assert(dno >= 0 && dno < estate->ndatums); + + /* now we can access the target datum */ + var = (PLpgSQL_var *) estate->datums[dno]; + Assert(var->dtype == PLPGSQL_DTYPE_VAR); + + /* + * If the variable's current value is a R/W expanded object, it's time to + * decide whether/how to optimize the assignment. + */ + if (!var->isnull && + VARATT_IS_EXTERNAL_EXPANDED_RW(DatumGetPointer(var->value))) + { + PLpgSQL_expr *expr = (PLpgSQL_expr *) op->d.cparam.paramarg; + Param *param = (Param *) op->d.cparam.paramarg2; + + /* + * We might have already figured this out while evaluating some other + * Param referencing the same variable. + */ + if (expr->expr_rwopt == PLPGSQL_RWOPT_UNKNOWN) + exec_check_rw_parameter(expr, op->d.cparam.paramid); + + /* + * Update the callback pointer to match what we decided to do, and + * pass off this execution to the selected function. + */ + switch (expr->expr_rwopt) + { + case PLPGSQL_RWOPT_UNKNOWN: + Assert(false); + break; + case PLPGSQL_RWOPT_NOPE: + /* Force the value to read-only in all future executions */ + op->d.cparam.paramfunc = plpgsql_param_eval_var_ro; + plpgsql_param_eval_var_ro(state, op, econtext); + break; + case PLPGSQL_RWOPT_TRANSFER: + /* There can be only one matching Param in this case */ + Assert(param == expr->expr_rw_param); + /* When the value is read/write, transfer to exec context */ + op->d.cparam.paramfunc = plpgsql_param_eval_var_transfer; + plpgsql_param_eval_var_transfer(state, op, econtext); + break; + case PLPGSQL_RWOPT_INPLACE: + if (param == expr->expr_rw_param) + { + /* When the value is read/write, deliver it as-is */ + op->d.cparam.paramfunc = plpgsql_param_eval_var; + plpgsql_param_eval_var(state, op, econtext); + } + else + { + /* Not the optimizable reference, so force to read-only */ + op->d.cparam.paramfunc = plpgsql_param_eval_var_ro; + plpgsql_param_eval_var_ro(state, op, econtext); + } + break; + } + return; + } + + /* + * Otherwise, continue to postpone that decision, and execute an inlined + * version of exec_eval_datum(). Although this value could potentially + * need MakeExpandedObjectReadOnly, we know it doesn't right now. + */ + *op->resvalue = var->value; + *op->resnull = var->isnull; + + /* safety check -- an assertion should be sufficient */ + Assert(var->datatype->typoid == op->d.cparam.paramtype); +} + +/* + * plpgsql_param_eval_var_transfer evaluation of EEOP_PARAM_CALLBACK step + * + * This is specialized to the case of DTYPE_VAR variables for which + * we have determined that a read/write expanded value can be handed off + * into execution of the expression (and then possibly returned to our + * function's ownership afterwards). We have to test though, because the + * variable might not contain a read/write expanded value during this + * execution. + */ +static void +plpgsql_param_eval_var_transfer(ExprState *state, ExprEvalStep *op, + ExprContext *econtext) +{ + ParamListInfo params; + PLpgSQL_execstate *estate; + int dno = op->d.cparam.paramid - 1; + PLpgSQL_var *var; + + /* fetch back the hook data */ + params = econtext->ecxt_param_list_info; + estate = (PLpgSQL_execstate *) params->paramFetchArg; + Assert(dno >= 0 && dno < estate->ndatums); + + /* now we can access the target datum */ + var = (PLpgSQL_var *) estate->datums[dno]; + Assert(var->dtype == PLPGSQL_DTYPE_VAR); + + /* + * If the variable's current value is a R/W expanded object, transfer its + * ownership into the expression execution context, then drop our own + * reference to the value by setting the variable to NULL. That'll be + * overwritten (perhaps with this same object) when control comes back + * from the expression. + */ + if (!var->isnull && + VARATT_IS_EXTERNAL_EXPANDED_RW(DatumGetPointer(var->value))) + { + *op->resvalue = TransferExpandedObject(var->value, + get_eval_mcontext(estate)); + *op->resnull = false; + + var->value = (Datum) 0; + var->isnull = true; + var->freeval = false; + } + else + { + /* + * Otherwise we can pass the variable's value directly; we now know + * that MakeExpandedObjectReadOnly isn't needed. + */ + *op->resvalue = var->value; + *op->resnull = var->isnull; + } + + /* safety check -- an assertion should be sufficient */ + Assert(var->datatype->typoid == op->d.cparam.paramtype); +} + /* * plpgsql_param_eval_var evaluation of EEOP_PARAM_CALLBACK step * @@ -7957,9 +8140,10 @@ exec_simple_check_plan(PLpgSQL_execstate *estate, PLpgSQL_expr *expr) MemoryContext oldcontext; /* - * Initialize to "not simple". + * Initialize to "not simple", and reset R/W optimizability. */ expr->expr_simple_expr = NULL; + expr->expr_rwopt = PLPGSQL_RWOPT_UNKNOWN; expr->expr_rw_param = NULL; /* @@ -8164,88 +8348,133 @@ exec_save_simple_expr(PLpgSQL_expr *expr, CachedPlan *cplan) expr->expr_simple_typmod = exprTypmod((Node *) tle_expr); /* We also want to remember if it is immutable or not */ expr->expr_simple_mutable = contain_mutable_functions((Node *) tle_expr); - - /* - * Lastly, check to see if there's a possibility of optimizing a - * read/write parameter. - */ - exec_check_rw_parameter(expr); } /* * exec_check_rw_parameter --- can we pass expanded object as read/write param? * - * If we have an assignment like "x := array_append(x, foo)" in which the + * There are two separate cases in which we can optimize an update to a + * variable that has a read/write expanded value by letting the called + * expression operate directly on the expanded value. In both cases we + * are considering assignments like "var := array_append(var, foo)" where + * the assignment target is also an input to the RHS expression. + * + * Case 1 (RWOPT_TRANSFER rule): if the variable is "local" in the sense that + * its declaration is not outside any BEGIN...EXCEPTION block surrounding the + * assignment, then we do not need to worry about preserving its value if the + * RHS expression throws an error. If in addition the variable is referenced + * exactly once in the RHS expression, then we can optimize by converting the + * read/write expanded value into a transient value within the expression + * evaluation context, and then setting the variable's recorded value to NULL + * to prevent double-free attempts. This works regardless of any other + * details of the RHS expression. If the expression eventually returns that + * same expanded object (possibly modified) then the variable will re-acquire + * ownership; while if it returns something else or throws an error, the + * expanded object will be discarded as part of cleanup of the evaluation + * context. + * + * Case 2 (RWOPT_INPLACE rule): if we have a non-local assignment or if + * it looks like "var := array_append(var, var[1])" with multiple references + * to the target variable, then we can't use case 1. Nonetheless, if the * top-level function is trusted not to corrupt its argument in case of an - * error, then when x has an expanded object as value, it is safe to pass the - * value as a read/write pointer and let the function modify the value - * in-place. + * error, then when the var has an expanded object as value, it is safe to + * pass the value as a read/write pointer to the top-level function and let + * the function modify the value in-place. (Any other references have to be + * passed as read-only pointers as usual.) Only the top-level function has to + * be trusted, since if anything further down fails, the object hasn't been + * modified yet. * - * This function checks for a safe expression, and sets expr->expr_rw_param - * to the address of any Param within the expression that can be passed as - * read/write (there can be only one); or to NULL when there is no safe Param. + * This function checks to see if the assignment is optimizable according + * to either rule, and updates expr->expr_rwopt accordingly. In addition, + * it sets expr->expr_rw_param to the address of the Param within the + * expression that can be passed as read/write (there can be only one); + * or to NULL when there is no safe Param. * - * Note that this mechanism intentionally applies the safety labeling to just - * one Param; the expression could contain other Params referencing the target - * variable, but those must still be treated as read-only. + * Note that this mechanism intentionally allows just one Param to emit a + * read/write pointer; in case 2, the expression could contain other Params + * referencing the target variable, but those must be treated as read-only. * * Also note that we only apply this optimization within simple expressions. * There's no point in it for non-simple expressions, because the * exec_run_select code path will flatten any expanded result anyway. - * Also, it's safe to assume that an expr_simple_expr tree won't get copied - * somewhere before it gets compiled, so that looking for pointer equality - * to expr_rw_param will work for matching the target Param. That'd be much - * shakier in the general case. */ static void -exec_check_rw_parameter(PLpgSQL_expr *expr) +exec_check_rw_parameter(PLpgSQL_expr *expr, int paramid) { - int target_dno; + Expr *sexpr = expr->expr_simple_expr; Oid funcid; List *fargs; ListCell *lc; /* Assume unsafe */ + expr->expr_rwopt = PLPGSQL_RWOPT_NOPE; expr->expr_rw_param = NULL; - /* Done if expression isn't an assignment source */ - target_dno = expr->target_param; - if (target_dno < 0) - return; + /* Shouldn't be here for non-simple expression */ + Assert(sexpr != NULL); + + /* Param should match the expression's assignment target, too */ + Assert(paramid == expr->target_param + 1); /* - * If target variable isn't referenced by expression, no need to look - * further. + * If the assignment is to a "local" variable (one whose value won't + * matter anymore if expression evaluation fails), and this Param is the + * only reference to that variable in the expression, then we can + * unconditionally optimize using the "transfer" method. */ - if (!bms_is_member(target_dno, expr->paramnos)) - return; + if (expr->target_is_local) + { + count_param_references_context context; - /* Shouldn't be here for non-simple expression */ - Assert(expr->expr_simple_expr != NULL); + /* See how many references there are, and find one of them */ + context.paramid = paramid; + context.count = 0; + context.last_param = NULL; + (void) count_param_references((Node *) sexpr, &context); + + /* If we're here, the expr must contain some reference to the var */ + Assert(context.count > 0); + + /* If exactly one reference, success! */ + if (context.count == 1) + { + expr->expr_rwopt = PLPGSQL_RWOPT_TRANSFER; + expr->expr_rw_param = context.last_param; + return; + } + } /* + * Otherwise, see if we can trust the expression's top-level function to + * apply the "inplace" method. + * * Top level of expression must be a simple FuncExpr, OpExpr, or - * SubscriptingRef, else we can't optimize. + * SubscriptingRef, else we can't identify which function is relevant. But + * it's okay to look through any RelabelType above that, since that can't + * fail. */ - if (IsA(expr->expr_simple_expr, FuncExpr)) + if (IsA(sexpr, RelabelType)) + sexpr = ((RelabelType *) sexpr)->arg; + if (IsA(sexpr, FuncExpr)) { - FuncExpr *fexpr = (FuncExpr *) expr->expr_simple_expr; + FuncExpr *fexpr = (FuncExpr *) sexpr; funcid = fexpr->funcid; fargs = fexpr->args; } - else if (IsA(expr->expr_simple_expr, OpExpr)) + else if (IsA(sexpr, OpExpr)) { - OpExpr *opexpr = (OpExpr *) expr->expr_simple_expr; + OpExpr *opexpr = (OpExpr *) sexpr; funcid = opexpr->opfuncid; fargs = opexpr->args; } - else if (IsA(expr->expr_simple_expr, SubscriptingRef)) + else if (IsA(sexpr, SubscriptingRef)) { - SubscriptingRef *sbsref = (SubscriptingRef *) expr->expr_simple_expr; + SubscriptingRef *sbsref = (SubscriptingRef *) sexpr; /* We only trust standard varlena arrays to be safe */ + /* TODO: install some extensibility here */ if (get_typsubscript(sbsref->refcontainertype, NULL) != F_ARRAY_SUBSCRIPT_HANDLER) return; @@ -8256,9 +8485,10 @@ exec_check_rw_parameter(PLpgSQL_expr *expr) Param *param = (Param *) sbsref->refexpr; if (param->paramkind == PARAM_EXTERN && - param->paramid == target_dno + 1) + param->paramid == paramid) { /* Found the Param we want to pass as read/write */ + expr->expr_rwopt = PLPGSQL_RWOPT_INPLACE; expr->expr_rw_param = param; return; } @@ -8293,9 +8523,10 @@ exec_check_rw_parameter(PLpgSQL_expr *expr) Param *param = (Param *) arg; if (param->paramkind == PARAM_EXTERN && - param->paramid == target_dno + 1) + param->paramid == paramid) { /* Found the Param we want to pass as read/write */ + expr->expr_rwopt = PLPGSQL_RWOPT_INPLACE; expr->expr_rw_param = param; return; } @@ -8303,6 +8534,36 @@ exec_check_rw_parameter(PLpgSQL_expr *expr) } } +/* + * Count Params referencing the specified paramid, and return one of them + * if there are any. + * + * We actually only need to distinguish 0, 1, and N references; so we can + * abort the tree traversal as soon as we've found two. + */ +static bool +count_param_references(Node *node, count_param_references_context *context) +{ + if (node == NULL) + return false; + else if (IsA(node, Param)) + { + Param *param = (Param *) node; + + if (param->paramkind == PARAM_EXTERN && + param->paramid == context->paramid) + { + context->last_param = param; + if (++(context->count) > 1) + return true; /* abort tree traversal */ + } + return false; + } + else + return expression_tree_walker(node, count_param_references, + (void *) context); +} + /* * exec_check_assignable --- is it OK to assign to the indicated datum? * diff --git a/src/pl/plpgsql/src/plpgsql.h b/src/pl/plpgsql/src/plpgsql.h index c6fadc5660..3bafeea28b 100644 --- a/src/pl/plpgsql/src/plpgsql.h +++ b/src/pl/plpgsql/src/plpgsql.h @@ -187,6 +187,17 @@ typedef enum PLpgSQL_resolve_option PLPGSQL_RESOLVE_COLUMN, /* prefer table column to plpgsql var */ } PLpgSQL_resolve_option; +/* + * Status of optimization of assignment to a read/write expanded object + */ +typedef enum PLpgSQL_rwopt +{ + PLPGSQL_RWOPT_UNKNOWN = 0, /* applicability not determined yet */ + PLPGSQL_RWOPT_NOPE, /* cannot do any optimization */ + PLPGSQL_RWOPT_TRANSFER, /* transfer the old value into expr state */ + PLPGSQL_RWOPT_INPLACE, /* pass value as R/W to top-level function */ +} PLpgSQL_rwopt; + /********************************************************************** * Node and structure definitions @@ -246,11 +257,14 @@ typedef struct PLpgSQL_expr bool expr_simple_mutable; /* true if simple expr is mutable */ /* - * If we match a Param within expr_simple_expr to the variable identified - * by target_param, that Param's address is stored in expr_rw_param; then - * expression code generation will allow the value for that Param to be - * passed as a read/write expanded-object pointer. + * expr_rwopt tracks whether we have determined that assignment to a + * read/write expanded object (stored in the target_param datum) can be + * optimized by passing it to the expr as a read/write expanded-object + * pointer. If so, expr_rw_param identifies the specific Param that + * should emit a read/write pointer; any others will emit read-only + * pointers. */ + PLpgSQL_rwopt expr_rwopt; /* can we apply R/W optimization? */ Param *expr_rw_param; /* read/write Param within expr, if any */ /* diff --git a/src/pl/plpgsql/src/sql/plpgsql_array.sql b/src/pl/plpgsql/src/sql/plpgsql_array.sql index 4b9ff51594..4a346203dc 100644 --- a/src/pl/plpgsql/src/sql/plpgsql_array.sql +++ b/src/pl/plpgsql/src/sql/plpgsql_array.sql @@ -48,6 +48,15 @@ begin a.c1[1].i := 11; raise notice 'a = %, a.c1[1].i = %', a, a.c1[1].i; end$$; do $$ declare a int[]; begin a := array_agg(x) from (values(1),(2),(3)) v(x); raise notice 'a = %', a; end$$; +do $$ declare a int[] := array[1,2,3]; +begin + -- test scenarios for optimization of updates of R/W expanded objects + a := array_append(a, 42); -- optimizable using "transfer" method + a := a || a[3]; -- optimizable using "inplace" method + a := a || a; -- not optimizable + raise notice 'a = %', a; +end$$; + create temp table onecol as select array[1,2] as f1; do $$ declare a int[]; diff --git a/src/tools/pgindent/typedefs.list b/src/tools/pgindent/typedefs.list index 171a7dd5d2..ea1735386f 100644 --- a/src/tools/pgindent/typedefs.list +++ b/src/tools/pgindent/typedefs.list @@ -1866,6 +1866,7 @@ PLpgSQL_rec PLpgSQL_recfield PLpgSQL_resolve_option PLpgSQL_row +PLpgSQL_rwopt PLpgSQL_stmt PLpgSQL_stmt_assert PLpgSQL_stmt_assign @@ -3392,6 +3393,7 @@ core_yy_extra_type core_yyscan_t corrupt_items cost_qual_eval_context +count_param_references_context cp_hash_func create_upper_paths_hook_type createdb_failure_params -- 2.43.5
On Fri, Nov 1, 2024 at 3:20 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
Well, you shouldn't be using PERFORM. Not only does it not do the
right thing, but it's not optimized for expanded objects at all:
they'll get flattened both on the way into the statement and on
the way out. Try it with
graph := set_element(graph, 1, 1, 1);
RETURN nvals(graph);
Ah my bad, you mentioned that already and I missed it, here's the two backtraces with the assignment:
regards, tom lane
On Fri, Nov 1, 2024 at 3:27 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
Here is a v1 patch series that does the first part of what we've been
talking about, which is to implement the new optimization rule for
the case of a single RHS reference to the target variable. I'm
curious to know if it helps you at all by itself. You'll definitely
also need commit 534d0ea6c, so probably applying these on our git
master branch would be the place to start.
I'll apply these tonight and get back to you asap. There are many functions in my API that take only one expanded RHS argument, so I'll look for some cases where your changes reduce expansions when I run my tests.
Thank you!
-Michel
On Fri, Nov 1, 2024 at 5:53 PM Michel Pelletier <pelletier.michel@gmail.com> wrote:
On Fri, Nov 1, 2024 at 3:27 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:Michel Pelletier <pelletier.michel@gmail.com> writes:
Here is a v1 patch series that does the first part of what we've been
talking about, which is to implement the new optimization rule for
the case of a single RHS reference to the target variable. I'm
curious to know if it helps you at all by itself. You'll definitely
also need commit 534d0ea6c, so probably applying these on our git
master branch would be the place to start.I'll apply these tonight and get back to you asap. There are many functions in my API that take only one expanded RHS argument, so I'll look for some cases where your changes reduce expansions when I run my tests.
create or replace function test_expand(graph matrix) returns matrix language plpgsql as
$$
declare
nvals bigint = nvals(graph);
begin
return graph;
end;
$$;
$$
declare
nvals bigint = nvals(graph);
begin
return graph;
end;
$$;
postgres=# select test_expand(a) from test_fixture ;
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_out
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_out
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
The second expansion in matrix_out happens outside the function, so inside there is only the one expansion for both matrix_nvals and the assignment. Thank you! All my tests continue to pass and the change seems to work well.
Looking forward to helping test the support function idea, let me know if there's anything else I can do to validate the idea.
-Michel
The second expansion in matrix_out happens outside the function, so inside there is only the one expansion for both matrix_nvals and the assignment. Thank you! All my tests continue to pass and the change seems to work well.
Hmm, well I spoke to soon looking at the wrong function without an assignment, I've been down in gdb a bit too much and I got mixed up, here's another function that wraps it and does an assignment, and it looks like there is another expansion happening after the nvals but before the assignment:
create or replace function test_expand_expand(graph matrix) returns matrix language plpgsql as
$$
declare
nvals bigint = nvals(graph);
begin
graph = test_expand(graph);
return test_expand(graph);
end;
$$;
$$
declare
nvals bigint = nvals(graph);
begin
graph = test_expand(graph);
return test_expand(graph);
end;
$$;
postgres=# select test_expand_expand(a) from test_fixture ;
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_out
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
DEBUG: matrix_out
DEBUG: DatumGetMatrix
DEBUG: expand_matrix
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
So going back on the assumption I'm somehow not returning the right pointer, but digging into the array code I'm pretty sure I'm following the same pattern and tracing my code path is calling EOHPGetRWDatum when I return the object, I can't get it to stop on DeleteExpandedObject, so I don't think plpgsql is deleting it, I'm going to keep investigating, sorry for the noise.
-Michel
So going back on the assumption I'm somehow not returning the right pointer, but digging into the array code I'm pretty sure I'm following the same pattern and tracing my code path is calling EOHPGetRWDatum when I return the object, I can't get it to stop on DeleteExpandedObject, so I don't think plpgsql is deleting it, I'm going to keep investigating, sorry for the noise.
Pavel Stehule indicated this might be a good example to put into contrib:
I've updated the repo to include points from our current discussion wrt multiple expansions in a plpgsql function, and added some test functions. The new pgexpanded type just keeps track of the number of times it's been expanded starting from an initialized value. While it only stores a flattened integer, it follows the typical pattern of pallocing the value and pfreeing it with a memory context callback, so it should be covering most of the boilerplate needed to start a new expanded type.
This would also be a good place to showcase and test the support function feature you've described to me.
It occurs to me that including this example in contrib would be an easier way for us to collaborate on my existing issue instead of punting back and forth on the list and would benefit all future expanded object developers. Do you think that's a good idea? If this were in contrib you could access the code I'm working with directly and I can just follow along in my project.
-Michel
út 19. 11. 2024 v 18:51 odesílatel Michel Pelletier <pelletier.michel@gmail.com> napsal:
So going back on the assumption I'm somehow not returning the right pointer, but digging into the array code I'm pretty sure I'm following the same pattern and tracing my code path is calling EOHPGetRWDatum when I return the object, I can't get it to stop on DeleteExpandedObject, so I don't think plpgsql is deleting it, I'm going to keep investigating, sorry for the noise.A couple years ago I tried to compress what I learned about expanded objects into a dummy extension that just provides the necessary boilerplate. It wasn't great but a start:Pavel Stehule indicated this might be a good example to put into contrib:I've updated the repo to include points from our current discussion wrt multiple expansions in a plpgsql function, and added some test functions. The new pgexpanded type just keeps track of the number of times it's been expanded starting from an initialized value. While it only stores a flattened integer, it follows the typical pattern of pallocing the value and pfreeing it with a memory context callback, so it should be covering most of the boilerplate needed to start a new expanded type.This would also be a good place to showcase and test the support function feature you've described to me.It occurs to me that including this example in contrib would be an easier way for us to collaborate on my existing issue instead of punting back and forth on the list and would benefit all future expanded object developers. Do you think that's a good idea? If this were in contrib you could access the code I'm working with directly and I can just follow along in my project.
another position can be src/test/modules - I think so your example is "similar" to plsample
Regards
Pavel
-Michel
Pavel Stehule <pavel.stehule@gmail.com> writes: > út 19. 11. 2024 v 18:51 odesílatel Michel Pelletier < > pelletier.michel@gmail.com> napsal: >> A couple years ago I tried to compress what I learned about expanded >> objects into a dummy extension that just provides the necessary >> boilerplate. It wasn't great but a start: >> https://github.com/michelp/pgexpanded >> Pavel Stehule indicated this might be a good example to put into contrib: > another position can be src/test/modules - I think so your example is > "similar" to plsample Yeah. I think we've largely adopted the position that contrib should contain installable modules that do something potentially useful to end-users. A pure skeleton wouldn't be that, but if it's fleshed out enough to be test code for some core features then src/test/modules could be a reasonable home. regards, tom lane
On Tue, Nov 19, 2024 at 11:45 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Pavel Stehule <pavel.stehule@gmail.com> writes:
> út 19. 11. 2024 v 18:51 odesílatel Michel Pelletier <
> pelletier.michel@gmail.com> napsal:
>> A couple years ago I tried to compress what I learned about expanded
>> objects into a dummy extension that just provides the necessary
>> boilerplate. It wasn't great but a start:
>> https://github.com/michelp/pgexpanded
>> Pavel Stehule indicated this might be a good example to put into contrib:
> another position can be src/test/modules - I think so your example is
> "similar" to plsample
Yeah. I think we've largely adopted the position that contrib should
contain installable modules that do something potentially useful to
end-users. A pure skeleton wouldn't be that, but if it's fleshed out
enough to be test code for some core features then src/test/modules
could be a reasonable home.
Great! I'll put a patch together that adds the skeleton object to src/test/modules and I'll write some expected tests that run the expansion through its paces, when the support function feature happens I'll update it to include tests for that.
Should I include Tom's patch changes on top of mine or keep those separate? I'm not entirely clear on the best practice to carry those forward as well.
-Michel
Michel Pelletier <pelletier.michel@gmail.com> writes: > Here's a WIP patch for a pgexpanded example in src/test/modules. I didn't look at your patch yet, but in the meantime here's an update that takes the next step towards what I promised. 0001-0003 are the same as before, with a couple of trivial changes to rebase them up to current HEAD. 0004 adds a support function request to allow extension functions to perform in-place updates. You should be able to use that to improve what your extension is doing. The new comments in supportnodes.h explain how to use it (plus see the built-in examples, though they are quite simple). regards, tom lane From 2b5c421f608d77994c1a898c9860b8a28bc2f46f Mon Sep 17 00:00:00 2001 From: Tom Lane <tgl@sss.pgh.pa.us> Date: Tue, 3 Dec 2024 14:31:04 -0500 Subject: [PATCH v2 1/4] Preliminary refactoring. This short and boring patch simply moves the responsibility for initializing PLpgSQL_expr.target_param into plpgsql parsing, rather than doing it at first execution of the expr as before. This doesn't save anything in terms of runtime, since the work was trivial and done only once per expr anyway. But it makes the info available during parsing, which will be useful for the next step. Likewise set PLpgSQL_expr.func during parsing. According to the comments, this was once impossible; but it's certainly possible since we invented the plpgsql_curr_compile variable. Again, this saves little runtime, but it seems far cleaner conceptually. While at it, I reordered stuff in struct PLpgSQL_expr to make it clearer which fields are filled when, and merged some duplicative code in pl_gram.y. Discussion: https://postgr.es/m/CACxu=vJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg@mail.gmail.com --- src/pl/plpgsql/src/pl_exec.c | 27 --------------- src/pl/plpgsql/src/pl_gram.y | 65 ++++++++++++++++++++++++------------ src/pl/plpgsql/src/plpgsql.h | 31 +++++++++-------- 3 files changed, 62 insertions(+), 61 deletions(-) diff --git a/src/pl/plpgsql/src/pl_exec.c b/src/pl/plpgsql/src/pl_exec.c index e31206e7f4..1a9c010205 100644 --- a/src/pl/plpgsql/src/pl_exec.c +++ b/src/pl/plpgsql/src/pl_exec.c @@ -4174,12 +4174,6 @@ exec_prepare_plan(PLpgSQL_execstate *estate, SPIPlanPtr plan; SPIPrepareOptions options; - /* - * The grammar can't conveniently set expr->func while building the parse - * tree, so make sure it's set before parser hooks need it. - */ - expr->func = estate->func; - /* * Generate and save the plan */ @@ -5016,21 +5010,7 @@ exec_assign_expr(PLpgSQL_execstate *estate, PLpgSQL_datum *target, * If first time through, create a plan for this expression. */ if (expr->plan == NULL) - { - /* - * Mark the expression as being an assignment source, if target is a - * simple variable. (This is a bit messy, but it seems cleaner than - * modifying the API of exec_prepare_plan for the purpose. We need to - * stash the target dno into the expr anyway, so that it will be - * available if we have to replan.) - */ - if (target->dtype == PLPGSQL_DTYPE_VAR) - expr->target_param = target->dno; - else - expr->target_param = -1; /* should be that already */ - exec_prepare_plan(estate, expr, 0); - } value = exec_eval_expr(estate, expr, &isnull, &valtype, &valtypmod); exec_assign_value(estate, target, value, isnull, valtype, valtypmod); @@ -6282,13 +6262,6 @@ setup_param_list(PLpgSQL_execstate *estate, PLpgSQL_expr *expr) * that they are interrupting an active use of parameters. */ paramLI->parserSetupArg = expr; - - /* - * Also make sure this is set before parser hooks need it. There is - * no need to save and restore, since the value is always correct once - * set. (Should be set already, but let's be sure.) - */ - expr->func = estate->func; } else { diff --git a/src/pl/plpgsql/src/pl_gram.y b/src/pl/plpgsql/src/pl_gram.y index 8182ce28aa..5431977d69 100644 --- a/src/pl/plpgsql/src/pl_gram.y +++ b/src/pl/plpgsql/src/pl_gram.y @@ -64,6 +64,10 @@ static bool tok_is_keyword(int token, union YYSTYPE *lval, static void word_is_not_variable(PLword *word, int location); static void cword_is_not_variable(PLcword *cword, int location); static void current_token_is_not_variable(int tok); +static PLpgSQL_expr *make_plpgsql_expr(const char *query, + RawParseMode parsemode); +static void expr_is_assignment_source(PLpgSQL_expr *expr, + PLpgSQL_datum *target); static PLpgSQL_expr *read_sql_construct(int until, int until2, int until3, @@ -529,6 +533,10 @@ decl_statement : decl_varname decl_const decl_datatype decl_collate decl_notnull errmsg("variable \"%s\" must have a default value, since it's declared NOT NULL", var->refname), parser_errposition(@5))); + + if (var->default_val != NULL) + expr_is_assignment_source(var->default_val, + (PLpgSQL_datum *) var); } | decl_varname K_ALIAS K_FOR decl_aliasitem ';' { @@ -987,6 +995,7 @@ stmt_assign : T_DATUM pmode, false, true, NULL, NULL); + expr_is_assignment_source(new->expr, $1.datum); $$ = (PLpgSQL_stmt *) new; } @@ -2637,6 +2646,38 @@ current_token_is_not_variable(int tok) yyerror("syntax error"); } +/* Convenience routine to construct a PLpgSQL_expr struct */ +static PLpgSQL_expr * +make_plpgsql_expr(const char *query, + RawParseMode parsemode) +{ + PLpgSQL_expr *expr = palloc0(sizeof(PLpgSQL_expr)); + + expr->query = pstrdup(query); + expr->parseMode = parsemode; + expr->func = plpgsql_curr_compile; + expr->ns = plpgsql_ns_top(); + /* might get changed later during parsing: */ + expr->target_param = -1; + /* other fields are left as zeroes until first execution */ + return expr; +} + +/* Mark a PLpgSQL_expr as being the source of an assignment to target */ +static void +expr_is_assignment_source(PLpgSQL_expr *expr, PLpgSQL_datum *target) +{ + /* + * Mark the expression as being an assignment source, if target is a + * simple variable. We don't currently support optimized assignments to + * other DTYPEs. + */ + if (target->dtype == PLPGSQL_DTYPE_VAR) + expr->target_param = target->dno; + else + expr->target_param = -1; /* should be that already */ +} + /* Convenience routine to read an expression with one possible terminator */ static PLpgSQL_expr * read_sql_expression(int until, const char *expected) @@ -2774,13 +2815,7 @@ read_sql_construct(int until, */ plpgsql_append_source_text(&ds, startlocation, endlocation); - expr = palloc0(sizeof(PLpgSQL_expr)); - expr->query = pstrdup(ds.data); - expr->parseMode = parsemode; - expr->plan = NULL; - expr->paramnos = NULL; - expr->target_param = -1; - expr->ns = plpgsql_ns_top(); + expr = make_plpgsql_expr(ds.data, parsemode); pfree(ds.data); if (valid_sql) @@ -3102,13 +3137,7 @@ make_execsql_stmt(int firsttoken, int location, PLword *word) while (ds.len > 0 && scanner_isspace(ds.data[ds.len - 1])) ds.data[--ds.len] = '\0'; - expr = palloc0(sizeof(PLpgSQL_expr)); - expr->query = pstrdup(ds.data); - expr->parseMode = RAW_PARSE_DEFAULT; - expr->plan = NULL; - expr->paramnos = NULL; - expr->target_param = -1; - expr->ns = plpgsql_ns_top(); + expr = make_plpgsql_expr(ds.data, RAW_PARSE_DEFAULT); pfree(ds.data); check_sql_expr(expr->query, expr->parseMode, location); @@ -3980,13 +4009,7 @@ read_cursor_args(PLpgSQL_var *cursor, int until) appendStringInfoString(&ds, ", "); } - expr = palloc0(sizeof(PLpgSQL_expr)); - expr->query = pstrdup(ds.data); - expr->parseMode = RAW_PARSE_PLPGSQL_EXPR; - expr->plan = NULL; - expr->paramnos = NULL; - expr->target_param = -1; - expr->ns = plpgsql_ns_top(); + expr = make_plpgsql_expr(ds.data, RAW_PARSE_PLPGSQL_EXPR); pfree(ds.data); /* Next we'd better find the until token */ diff --git a/src/pl/plpgsql/src/plpgsql.h b/src/pl/plpgsql/src/plpgsql.h index 50c3b28472..fbb6000caa 100644 --- a/src/pl/plpgsql/src/plpgsql.h +++ b/src/pl/plpgsql/src/plpgsql.h @@ -219,14 +219,22 @@ typedef struct PLpgSQL_expr { char *query; /* query string, verbatim from function body */ RawParseMode parseMode; /* raw_parser() mode to use */ - SPIPlanPtr plan; /* plan, or NULL if not made yet */ - Bitmapset *paramnos; /* all dnos referenced by this query */ + struct PLpgSQL_function *func; /* function containing this expr */ + struct PLpgSQL_nsitem *ns; /* namespace chain visible to this expr */ - /* function containing this expr (not set until we first parse query) */ - struct PLpgSQL_function *func; + /* + * These fields are used to help optimize assignments to expanded-datum + * variables. If this expression is the source of an assignment to a + * simple variable, target_param holds that variable's dno (else it's -1). + */ + int target_param; /* dno of assign target, or -1 if none */ - /* namespace chain visible to this expr */ - struct PLpgSQL_nsitem *ns; + /* + * Fields above are set during plpgsql parsing. Remaining fields are left + * as zeroes/NULLs until we first parse/plan the query. + */ + SPIPlanPtr plan; /* plan, or NULL if not made yet */ + Bitmapset *paramnos; /* all dnos referenced by this query */ /* fields for "simple expression" fast-path execution: */ Expr *expr_simple_expr; /* NULL means not a simple expr */ @@ -235,14 +243,11 @@ typedef struct PLpgSQL_expr bool expr_simple_mutable; /* true if simple expr is mutable */ /* - * These fields are used to optimize assignments to expanded-datum - * variables. If this expression is the source of an assignment to a - * simple variable, target_param holds that variable's dno; else it's -1. - * If we match a Param within expr_simple_expr to such a variable, that - * Param's address is stored in expr_rw_param; then expression code - * generation will allow the value for that Param to be passed read/write. + * If we match a Param within expr_simple_expr to the variable identified + * by target_param, that Param's address is stored in expr_rw_param; then + * expression code generation will allow the value for that Param to be + * passed as a read/write expanded-object pointer. */ - int target_param; /* dno of assign target, or -1 if none */ Param *expr_rw_param; /* read/write Param within expr, if any */ /* -- 2.43.5 From 0e8a5e90ff546353391a57849413e802ec746153 Mon Sep 17 00:00:00 2001 From: Tom Lane <tgl@sss.pgh.pa.us> Date: Tue, 3 Dec 2024 14:32:13 -0500 Subject: [PATCH v2 2/4] Detect whether plpgsql assignment targets are "local" variables. Mark whether the target of a potentially optimizable assignment is "local", in the sense of being declared inside any exception block that could trap an error thrown from the assignment. (This implies that we needn't preserve the variable's value in case of an error.) Normally, this requires a post-parsing scan of the function's parse tree, since we don't know while parsing a BEGIN ... construct whether we will find EXCEPTION at its end. However, if there are no BEGIN ... EXCEPTION blocks in the function at all, then all assignments are local, even those to variables representing function arguments. We optimize that common case by initializing the target_is_local flags to "true", and fixing them up with a post-scan only if we found EXCEPTION. The scan is implemented by code that's largely copied-and-pasted from the nearby code to scan a plpgsql parse tree for deletion. It's a bit annoying to have three copies of that now, but I'm not seeing a way to refactor it that would save much code on net. Note that variables' default-value expressions are never interesting for expanded-variable optimization, since they couldn't contain a reference to the target variable anyway. But the code is set up to compute their target_param and target_is_local correctly anyway, for consistency and in case someone thinks of a use for that data. I added a bit of plpgsql_dumptree support to help verify that this code sets the flags as expected. I'm not set on keeping that, but I do want to keep the addition of a plpgsql_dumptree call in plpgsql_compile_inline. It's at best an oversight that "#option dump" doesn't work in a DO block. Discussion: https://postgr.es/m/CACxu=vJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg@mail.gmail.com --- src/pl/plpgsql/src/pl_comp.c | 12 + src/pl/plpgsql/src/pl_funcs.c | 398 ++++++++++++++++++++++++++++++++++ src/pl/plpgsql/src/pl_gram.y | 15 ++ src/pl/plpgsql/src/plpgsql.h | 7 +- 4 files changed, 431 insertions(+), 1 deletion(-) diff --git a/src/pl/plpgsql/src/pl_comp.c b/src/pl/plpgsql/src/pl_comp.c index 6255a86d75..ed9845d85a 100644 --- a/src/pl/plpgsql/src/pl_comp.c +++ b/src/pl/plpgsql/src/pl_comp.c @@ -365,6 +365,7 @@ do_compile(FunctionCallInfo fcinfo, function->nstatements = 0; function->requires_procedure_resowner = false; + function->has_exception_block = false; /* * Initialize the compiler, particularly the namespace stack. The @@ -806,6 +807,9 @@ do_compile(FunctionCallInfo fcinfo, plpgsql_finish_datums(function); + if (function->has_exception_block) + plpgsql_mark_local_assignment_targets(function); + /* Debug dump for completed functions */ if (plpgsql_DumpExecTree) plpgsql_dumptree(function); @@ -899,6 +903,7 @@ plpgsql_compile_inline(char *proc_source) function->nstatements = 0; function->requires_procedure_resowner = false; + function->has_exception_block = false; plpgsql_ns_init(); plpgsql_ns_push(func_name, PLPGSQL_LABEL_BLOCK); @@ -956,6 +961,13 @@ plpgsql_compile_inline(char *proc_source) plpgsql_finish_datums(function); + if (function->has_exception_block) + plpgsql_mark_local_assignment_targets(function); + + /* Debug dump for completed functions */ + if (plpgsql_DumpExecTree) + plpgsql_dumptree(function); + /* * Pop the error context stack */ diff --git a/src/pl/plpgsql/src/pl_funcs.c b/src/pl/plpgsql/src/pl_funcs.c index eeb7c4d7c0..889377fc9a 100644 --- a/src/pl/plpgsql/src/pl_funcs.c +++ b/src/pl/plpgsql/src/pl_funcs.c @@ -333,6 +333,401 @@ plpgsql_getdiag_kindname(PLpgSQL_getdiag_kind kind) } +/********************************************************************** + * Mark assignment source expressions that have local target variables, + * that is, variables declared within the exception block most closely + * containing the assignment itself. (Such target variables need not be + * preserved if the assignment's source expression raises an error, + * allowing better optimization.) + * + * This code need not be called if the plpgsql function contains no exception + * blocks, because expr_is_assignment_source() will have set all the flags + * to true already. Also, we need not examine default-value expressions for + * variables, because variable declarations are necessarily within the nearest + * exception block. (In DECLARE ... BEGIN ... EXCEPTION ... END, the variable + * initializations are done before entering the exception scope.) So it's + * sufficient to find assignment statements. + * + * Within the recursion, local_dnos is a Bitmapset of dnos of variables + * known to be declared within the current exception level. + **********************************************************************/ +static void mark_stmt(PLpgSQL_stmt *stmt, Bitmapset *local_dnos); +static void mark_block(PLpgSQL_stmt_block *block, Bitmapset *local_dnos); +static void mark_assign(PLpgSQL_stmt_assign *stmt, Bitmapset *local_dnos); +static void mark_if(PLpgSQL_stmt_if *stmt, Bitmapset *local_dnos); +static void mark_case(PLpgSQL_stmt_case *stmt, Bitmapset *local_dnos); +static void mark_loop(PLpgSQL_stmt_loop *stmt, Bitmapset *local_dnos); +static void mark_while(PLpgSQL_stmt_while *stmt, Bitmapset *local_dnos); +static void mark_fori(PLpgSQL_stmt_fori *stmt, Bitmapset *local_dnos); +static void mark_fors(PLpgSQL_stmt_fors *stmt, Bitmapset *local_dnos); +static void mark_forc(PLpgSQL_stmt_forc *stmt, Bitmapset *local_dnos); +static void mark_foreach_a(PLpgSQL_stmt_foreach_a *stmt, Bitmapset *local_dnos); +static void mark_exit(PLpgSQL_stmt_exit *stmt, Bitmapset *local_dnos); +static void mark_return(PLpgSQL_stmt_return *stmt, Bitmapset *local_dnos); +static void mark_return_next(PLpgSQL_stmt_return_next *stmt, Bitmapset *local_dnos); +static void mark_return_query(PLpgSQL_stmt_return_query *stmt, Bitmapset *local_dnos); +static void mark_raise(PLpgSQL_stmt_raise *stmt, Bitmapset *local_dnos); +static void mark_assert(PLpgSQL_stmt_assert *stmt, Bitmapset *local_dnos); +static void mark_execsql(PLpgSQL_stmt_execsql *stmt, Bitmapset *local_dnos); +static void mark_dynexecute(PLpgSQL_stmt_dynexecute *stmt, Bitmapset *local_dnos); +static void mark_dynfors(PLpgSQL_stmt_dynfors *stmt, Bitmapset *local_dnos); +static void mark_getdiag(PLpgSQL_stmt_getdiag *stmt, Bitmapset *local_dnos); +static void mark_open(PLpgSQL_stmt_open *stmt, Bitmapset *local_dnos); +static void mark_fetch(PLpgSQL_stmt_fetch *stmt, Bitmapset *local_dnos); +static void mark_close(PLpgSQL_stmt_close *stmt, Bitmapset *local_dnos); +static void mark_perform(PLpgSQL_stmt_perform *stmt, Bitmapset *local_dnos); +static void mark_call(PLpgSQL_stmt_call *stmt, Bitmapset *local_dnos); +static void mark_commit(PLpgSQL_stmt_commit *stmt, Bitmapset *local_dnos); +static void mark_rollback(PLpgSQL_stmt_rollback *stmt, Bitmapset *local_dnos); + + +static void +mark_stmt(PLpgSQL_stmt *stmt, Bitmapset *local_dnos) +{ + switch (stmt->cmd_type) + { + case PLPGSQL_STMT_BLOCK: + mark_block((PLpgSQL_stmt_block *) stmt, local_dnos); + break; + case PLPGSQL_STMT_ASSIGN: + mark_assign((PLpgSQL_stmt_assign *) stmt, local_dnos); + break; + case PLPGSQL_STMT_IF: + mark_if((PLpgSQL_stmt_if *) stmt, local_dnos); + break; + case PLPGSQL_STMT_CASE: + mark_case((PLpgSQL_stmt_case *) stmt, local_dnos); + break; + case PLPGSQL_STMT_LOOP: + mark_loop((PLpgSQL_stmt_loop *) stmt, local_dnos); + break; + case PLPGSQL_STMT_WHILE: + mark_while((PLpgSQL_stmt_while *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FORI: + mark_fori((PLpgSQL_stmt_fori *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FORS: + mark_fors((PLpgSQL_stmt_fors *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FORC: + mark_forc((PLpgSQL_stmt_forc *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FOREACH_A: + mark_foreach_a((PLpgSQL_stmt_foreach_a *) stmt, local_dnos); + break; + case PLPGSQL_STMT_EXIT: + mark_exit((PLpgSQL_stmt_exit *) stmt, local_dnos); + break; + case PLPGSQL_STMT_RETURN: + mark_return((PLpgSQL_stmt_return *) stmt, local_dnos); + break; + case PLPGSQL_STMT_RETURN_NEXT: + mark_return_next((PLpgSQL_stmt_return_next *) stmt, local_dnos); + break; + case PLPGSQL_STMT_RETURN_QUERY: + mark_return_query((PLpgSQL_stmt_return_query *) stmt, local_dnos); + break; + case PLPGSQL_STMT_RAISE: + mark_raise((PLpgSQL_stmt_raise *) stmt, local_dnos); + break; + case PLPGSQL_STMT_ASSERT: + mark_assert((PLpgSQL_stmt_assert *) stmt, local_dnos); + break; + case PLPGSQL_STMT_EXECSQL: + mark_execsql((PLpgSQL_stmt_execsql *) stmt, local_dnos); + break; + case PLPGSQL_STMT_DYNEXECUTE: + mark_dynexecute((PLpgSQL_stmt_dynexecute *) stmt, local_dnos); + break; + case PLPGSQL_STMT_DYNFORS: + mark_dynfors((PLpgSQL_stmt_dynfors *) stmt, local_dnos); + break; + case PLPGSQL_STMT_GETDIAG: + mark_getdiag((PLpgSQL_stmt_getdiag *) stmt, local_dnos); + break; + case PLPGSQL_STMT_OPEN: + mark_open((PLpgSQL_stmt_open *) stmt, local_dnos); + break; + case PLPGSQL_STMT_FETCH: + mark_fetch((PLpgSQL_stmt_fetch *) stmt, local_dnos); + break; + case PLPGSQL_STMT_CLOSE: + mark_close((PLpgSQL_stmt_close *) stmt, local_dnos); + break; + case PLPGSQL_STMT_PERFORM: + mark_perform((PLpgSQL_stmt_perform *) stmt, local_dnos); + break; + case PLPGSQL_STMT_CALL: + mark_call((PLpgSQL_stmt_call *) stmt, local_dnos); + break; + case PLPGSQL_STMT_COMMIT: + mark_commit((PLpgSQL_stmt_commit *) stmt, local_dnos); + break; + case PLPGSQL_STMT_ROLLBACK: + mark_rollback((PLpgSQL_stmt_rollback *) stmt, local_dnos); + break; + default: + elog(ERROR, "unrecognized cmd_type: %d", stmt->cmd_type); + break; + } +} + +static void +mark_stmts(List *stmts, Bitmapset *local_dnos) +{ + ListCell *s; + + foreach(s, stmts) + { + mark_stmt((PLpgSQL_stmt *) lfirst(s), local_dnos); + } +} + +static void +mark_block(PLpgSQL_stmt_block *block, Bitmapset *local_dnos) +{ + if (block->exceptions) + { + ListCell *e; + + /* + * The block creates a new exception scope, so variables declared at + * outer levels are nonlocal. For that matter, so are any variables + * declared in the block's DECLARE section. Hence, we must pass down + * empty local_dnos. + */ + mark_stmts(block->body, NULL); + + foreach(e, block->exceptions->exc_list) + { + PLpgSQL_exception *exc = (PLpgSQL_exception *) lfirst(e); + + mark_stmts(exc->action, NULL); + } + } + else + { + /* + * Otherwise, the block does not create a new exception scope, and any + * variables it declares can also be considered local within it. Note + * that only initializable datum types (VAR, REC) are included in + * initvarnos; but that's sufficient for our purposes. + */ + local_dnos = bms_copy(local_dnos); + for (int i = 0; i < block->n_initvars; i++) + local_dnos = bms_add_member(local_dnos, block->initvarnos[i]); + mark_stmts(block->body, local_dnos); + bms_free(local_dnos); + } +} + +static void +mark_assign(PLpgSQL_stmt_assign *stmt, Bitmapset *local_dnos) +{ + PLpgSQL_expr *expr = stmt->expr; + + /* + * If the assignment target is a plain DTYPE_VAR datum, mark it as local + * or not. (If it's not a VAR, we don't care.) + */ + if (expr->target_param >= 0) + expr->target_is_local = bms_is_member(expr->target_param, local_dnos); +} + +static void +mark_if(PLpgSQL_stmt_if *stmt, Bitmapset *local_dnos) +{ + ListCell *l; + + /* stmt->cond cannot be an assignment source */ + mark_stmts(stmt->then_body, local_dnos); + foreach(l, stmt->elsif_list) + { + PLpgSQL_if_elsif *elif = (PLpgSQL_if_elsif *) lfirst(l); + + /* elif->cond cannot be an assignment source */ + mark_stmts(elif->stmts, local_dnos); + } + mark_stmts(stmt->else_body, local_dnos); +} + +static void +mark_case(PLpgSQL_stmt_case *stmt, Bitmapset *local_dnos) +{ + ListCell *l; + + /* stmt->t_expr cannot be an assignment source */ + foreach(l, stmt->case_when_list) + { + PLpgSQL_case_when *cwt = (PLpgSQL_case_when *) lfirst(l); + + /* cwt->expr cannot be an assignment source */ + mark_stmts(cwt->stmts, local_dnos); + } + mark_stmts(stmt->else_stmts, local_dnos); +} + +static void +mark_loop(PLpgSQL_stmt_loop *stmt, Bitmapset *local_dnos) +{ + mark_stmts(stmt->body, local_dnos); +} + +static void +mark_while(PLpgSQL_stmt_while *stmt, Bitmapset *local_dnos) +{ + /* stmt->cond cannot be an assignment source */ + mark_stmts(stmt->body, local_dnos); +} + +static void +mark_fori(PLpgSQL_stmt_fori *stmt, Bitmapset *local_dnos) +{ + /* stmt->lower, upper, step cannot be an assignment source */ + mark_stmts(stmt->body, local_dnos); +} + +static void +mark_fors(PLpgSQL_stmt_fors *stmt, Bitmapset *local_dnos) +{ + mark_stmts(stmt->body, local_dnos); + /* stmt->query cannot be an assignment source */ +} + +static void +mark_forc(PLpgSQL_stmt_forc *stmt, Bitmapset *local_dnos) +{ + mark_stmts(stmt->body, local_dnos); + /* stmt->argquery cannot be an assignment source */ +} + +static void +mark_foreach_a(PLpgSQL_stmt_foreach_a *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ + mark_stmts(stmt->body, local_dnos); +} + +static void +mark_open(PLpgSQL_stmt_open *stmt, Bitmapset *local_dnos) +{ + /* stmt->argquery, query, dynquery cannot be an assignment source */ + /* stmt->params cannot contain an assignment source */ +} + +static void +mark_fetch(PLpgSQL_stmt_fetch *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_close(PLpgSQL_stmt_close *stmt, Bitmapset *local_dnos) +{ +} + +static void +mark_perform(PLpgSQL_stmt_perform *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_call(PLpgSQL_stmt_call *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_commit(PLpgSQL_stmt_commit *stmt, Bitmapset *local_dnos) +{ +} + +static void +mark_rollback(PLpgSQL_stmt_rollback *stmt, Bitmapset *local_dnos) +{ +} + +static void +mark_exit(PLpgSQL_stmt_exit *stmt, Bitmapset *local_dnos) +{ + /* stmt->cond cannot be an assignment source */ +} + +static void +mark_return(PLpgSQL_stmt_return *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_return_next(PLpgSQL_stmt_return_next *stmt, Bitmapset *local_dnos) +{ + /* stmt->expr cannot be an assignment source */ +} + +static void +mark_return_query(PLpgSQL_stmt_return_query *stmt, Bitmapset *local_dnos) +{ + /* stmt->query, dynquery cannot be an assignment source */ + /* stmt->params cannot contain an assignment source */ +} + +static void +mark_raise(PLpgSQL_stmt_raise *stmt, Bitmapset *local_dnos) +{ + /* stmt->params cannot contain an assignment source */ + /* stmt->options cannot contain an assignment source */ +} + +static void +mark_assert(PLpgSQL_stmt_assert *stmt, Bitmapset *local_dnos) +{ + /* stmt->cond, message cannot be an assignment source */ +} + +static void +mark_execsql(PLpgSQL_stmt_execsql *stmt, Bitmapset *local_dnos) +{ + /* stmt->sqlstmt cannot be an assignment source */ +} + +static void +mark_dynexecute(PLpgSQL_stmt_dynexecute *stmt, Bitmapset *local_dnos) +{ + /* stmt->query cannot be an assignment source */ + /* stmt->params cannot contain an assignment source */ +} + +static void +mark_dynfors(PLpgSQL_stmt_dynfors *stmt, Bitmapset *local_dnos) +{ + mark_stmts(stmt->body, local_dnos); + /* stmt->query cannot be an assignment source */ + /* stmt->params cannot contain an assignment source */ +} + +static void +mark_getdiag(PLpgSQL_stmt_getdiag *stmt, Bitmapset *local_dnos) +{ +} + +void +plpgsql_mark_local_assignment_targets(PLpgSQL_function *func) +{ + Bitmapset *local_dnos; + + /* Function parameters can be treated as local targets at outer level */ + local_dnos = NULL; + for (int i = 0; i < func->fn_nargs; i++) + local_dnos = bms_add_member(local_dnos, func->fn_argvarnos[i]); + if (func->action) + mark_block(func->action, local_dnos); + bms_free(local_dnos); +} + + /********************************************************************** * Release memory when a PL/pgSQL function is no longer needed * @@ -1594,6 +1989,9 @@ static void dump_expr(PLpgSQL_expr *expr) { printf("'%s'", expr->query); + if (expr->target_param >= 0) + printf(" target %d%s", expr->target_param, + expr->target_is_local ? " (local)" : ""); } void diff --git a/src/pl/plpgsql/src/pl_gram.y b/src/pl/plpgsql/src/pl_gram.y index 5431977d69..ddbfda8388 100644 --- a/src/pl/plpgsql/src/pl_gram.y +++ b/src/pl/plpgsql/src/pl_gram.y @@ -2314,6 +2314,8 @@ exception_sect : PLpgSQL_exception_block *new = palloc(sizeof(PLpgSQL_exception_block)); PLpgSQL_variable *var; + plpgsql_curr_compile->has_exception_block = true; + var = plpgsql_build_variable("sqlstate", lineno, plpgsql_build_datatype(TEXTOID, -1, @@ -2659,6 +2661,7 @@ make_plpgsql_expr(const char *query, expr->ns = plpgsql_ns_top(); /* might get changed later during parsing: */ expr->target_param = -1; + expr->target_is_local = false; /* other fields are left as zeroes until first execution */ return expr; } @@ -2673,9 +2676,21 @@ expr_is_assignment_source(PLpgSQL_expr *expr, PLpgSQL_datum *target) * other DTYPEs. */ if (target->dtype == PLPGSQL_DTYPE_VAR) + { expr->target_param = target->dno; + + /* + * For now, assume the target is local to the nearest enclosing + * exception block. That's correct if the function contains no + * exception blocks; otherwise we'll update this later. + */ + expr->target_is_local = true; + } else + { expr->target_param = -1; /* should be that already */ + expr->target_is_local = false; /* ditto */ + } } /* Convenience routine to read an expression with one possible terminator */ diff --git a/src/pl/plpgsql/src/plpgsql.h b/src/pl/plpgsql/src/plpgsql.h index fbb6000caa..c6fadc5660 100644 --- a/src/pl/plpgsql/src/plpgsql.h +++ b/src/pl/plpgsql/src/plpgsql.h @@ -225,9 +225,12 @@ typedef struct PLpgSQL_expr /* * These fields are used to help optimize assignments to expanded-datum * variables. If this expression is the source of an assignment to a - * simple variable, target_param holds that variable's dno (else it's -1). + * simple variable, target_param holds that variable's dno (else it's -1), + * and target_is_local indicates whether the target is declared inside the + * closest exception block containing the assignment. */ int target_param; /* dno of assign target, or -1 if none */ + bool target_is_local; /* is it within nearest exception block? */ /* * Fields above are set during plpgsql parsing. Remaining fields are left @@ -1014,6 +1017,7 @@ typedef struct PLpgSQL_function /* data derived while parsing body */ unsigned int nstatements; /* counter for assigning stmtids */ bool requires_procedure_resowner; /* contains CALL or DO? */ + bool has_exception_block; /* contains BEGIN...EXCEPTION? */ /* these fields change when the function is used */ struct PLpgSQL_execstate *cur_estate; @@ -1314,6 +1318,7 @@ extern PLpgSQL_nsitem *plpgsql_ns_find_nearest_loop(PLpgSQL_nsitem *ns_cur); */ extern PGDLLEXPORT const char *plpgsql_stmt_typename(PLpgSQL_stmt *stmt); extern const char *plpgsql_getdiag_kindname(PLpgSQL_getdiag_kind kind); +extern void plpgsql_mark_local_assignment_targets(PLpgSQL_function *func); extern void plpgsql_free_function_memory(PLpgSQL_function *func); extern void plpgsql_dumptree(PLpgSQL_function *func); -- 2.43.5 From 4666ac2ba3433574cd3024f49f58b290a58b1f86 Mon Sep 17 00:00:00 2001 From: Tom Lane <tgl@sss.pgh.pa.us> Date: Tue, 3 Dec 2024 14:43:33 -0500 Subject: [PATCH v2 3/4] Implement new optimization rule for updates of expanded variables. If a read/write expanded variable is declared locally to the assignment statement that is updating it, and it is referenced exactly once in the assignment RHS, then we can optimize the operation as a direct update of the expanded value, whether or not the function(s) operating on it can be trusted not to modify the value before throwing an error. This works because if an error does get thrown, we no longer care what value the variable has. In cases where that doesn't work, fall back to the previous rule that checks for safety of the top-level function. In any case, postpone determination of whether these optimizations are feasible until we are executing a Param referencing the target variable and that variable holds a R/W expanded object. While the previous incarnation of exec_check_rw_parameter was pretty cheap, this is a bit less so, and our plan to invoke support functions will make it even less so. So avoiding the check for variables where it couldn't be useful should be a win. Discussion: https://postgr.es/m/CACxu=vJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg@mail.gmail.com --- src/include/executor/execExpr.h | 1 + src/pl/plpgsql/src/expected/plpgsql_array.out | 9 + src/pl/plpgsql/src/pl_exec.c | 376 +++++++++++++++--- src/pl/plpgsql/src/plpgsql.h | 22 +- src/pl/plpgsql/src/sql/plpgsql_array.sql | 9 + src/tools/pgindent/typedefs.list | 2 + 6 files changed, 357 insertions(+), 62 deletions(-) diff --git a/src/include/executor/execExpr.h b/src/include/executor/execExpr.h index 56fb0d0adb..7d58b3dc9c 100644 --- a/src/include/executor/execExpr.h +++ b/src/include/executor/execExpr.h @@ -406,6 +406,7 @@ typedef struct ExprEvalStep { ExecEvalSubroutine paramfunc; /* add-on evaluation subroutine */ void *paramarg; /* private data for same */ + void *paramarg2; /* more private data for same */ int paramid; /* numeric ID for parameter */ Oid paramtype; /* OID of parameter's datatype */ } cparam; diff --git a/src/pl/plpgsql/src/expected/plpgsql_array.out b/src/pl/plpgsql/src/expected/plpgsql_array.out index ad60e0e8be..e5db6d6087 100644 --- a/src/pl/plpgsql/src/expected/plpgsql_array.out +++ b/src/pl/plpgsql/src/expected/plpgsql_array.out @@ -52,6 +52,15 @@ NOTICE: a = ("{""(,11)""}",), a.c1[1].i = 11 do $$ declare a int[]; begin a := array_agg(x) from (values(1),(2),(3)) v(x); raise notice 'a = %', a; end$$; NOTICE: a = {1,2,3} +do $$ declare a int[] := array[1,2,3]; +begin + -- test scenarios for optimization of updates of R/W expanded objects + a := array_append(a, 42); -- optimizable using "transfer" method + a := a || a[3]; -- optimizable using "inplace" method + a := a || a; -- not optimizable + raise notice 'a = %', a; +end$$; +NOTICE: a = {1,2,3,42,3,1,2,3,42,3} create temp table onecol as select array[1,2] as f1; do $$ declare a int[]; begin a := f1 from onecol; raise notice 'a = %', a; end$$; diff --git a/src/pl/plpgsql/src/pl_exec.c b/src/pl/plpgsql/src/pl_exec.c index 1a9c010205..ae878782b8 100644 --- a/src/pl/plpgsql/src/pl_exec.c +++ b/src/pl/plpgsql/src/pl_exec.c @@ -251,6 +251,15 @@ static HTAB *shared_cast_hash = NULL; else \ Assert(rc == PLPGSQL_RC_OK) +/* State struct for count_param_references */ +typedef struct count_param_references_context +{ + int paramid; + int count; + Param *last_param; +} count_param_references_context; + + /************************************************************ * Local function forward declarations ************************************************************/ @@ -336,7 +345,9 @@ static void exec_prepare_plan(PLpgSQL_execstate *estate, static void exec_simple_check_plan(PLpgSQL_execstate *estate, PLpgSQL_expr *expr); static bool exec_is_simple_query(PLpgSQL_expr *expr); static void exec_save_simple_expr(PLpgSQL_expr *expr, CachedPlan *cplan); -static void exec_check_rw_parameter(PLpgSQL_expr *expr); +static void exec_check_rw_parameter(PLpgSQL_expr *expr, int paramid); +static bool count_param_references(Node *node, + count_param_references_context *context); static void exec_check_assignable(PLpgSQL_execstate *estate, int dno); static bool exec_eval_simple_expr(PLpgSQL_execstate *estate, PLpgSQL_expr *expr, @@ -384,6 +395,10 @@ static ParamExternData *plpgsql_param_fetch(ParamListInfo params, static void plpgsql_param_compile(ParamListInfo params, Param *param, ExprState *state, Datum *resv, bool *resnull); +static void plpgsql_param_eval_var_check(ExprState *state, ExprEvalStep *op, + ExprContext *econtext); +static void plpgsql_param_eval_var_transfer(ExprState *state, ExprEvalStep *op, + ExprContext *econtext); static void plpgsql_param_eval_var(ExprState *state, ExprEvalStep *op, ExprContext *econtext); static void plpgsql_param_eval_var_ro(ExprState *state, ExprEvalStep *op, @@ -6078,10 +6093,13 @@ exec_eval_simple_expr(PLpgSQL_execstate *estate, /* * Reset to "not simple" to leave sane state (with no dangling - * pointers) in case we fail while replanning. expr_simple_plansource - * can be left alone however, as that cannot move. + * pointers) in case we fail while replanning. We'll need to + * re-determine simplicity and R/W optimizability anyway, since those + * could change with the new plan. expr_simple_plansource can be left + * alone however, as that cannot move. */ expr->expr_simple_expr = NULL; + expr->expr_rwopt = PLPGSQL_RWOPT_UNKNOWN; expr->expr_rw_param = NULL; expr->expr_simple_plan = NULL; expr->expr_simple_plan_lxid = InvalidLocalTransactionId; @@ -6439,16 +6457,27 @@ plpgsql_param_compile(ParamListInfo params, Param *param, scratch.resnull = resnull; /* - * Select appropriate eval function. It seems worth special-casing - * DTYPE_VAR and DTYPE_RECFIELD for performance. Also, we can determine - * in advance whether MakeExpandedObjectReadOnly() will be required. - * Currently, only VAR/PROMISE and REC datums could contain read/write - * expanded objects. + * Select appropriate eval function. + * + * First, if this Param references the same varlena-type DTYPE_VAR datum + * that is the target of the assignment containing this simple expression, + * then it's possible we will be able to optimize handling of R/W expanded + * datums. We don't want to do the work needed to determine that unless + * we actually see a R/W expanded datum at runtime, so install a checking + * function that will figure that out when needed. + * + * Otherwise, it seems worth special-casing DTYPE_VAR and DTYPE_RECFIELD + * for performance. Also, we can determine in advance whether + * MakeExpandedObjectReadOnly() will be required. Currently, only + * VAR/PROMISE and REC datums could contain read/write expanded objects. */ if (datum->dtype == PLPGSQL_DTYPE_VAR) { - if (param != expr->expr_rw_param && - ((PLpgSQL_var *) datum)->datatype->typlen == -1) + bool isvarlena = (((PLpgSQL_var *) datum)->datatype->typlen == -1); + + if (isvarlena && dno == expr->target_param && expr->expr_simple_expr) + scratch.d.cparam.paramfunc = plpgsql_param_eval_var_check; + else if (isvarlena) scratch.d.cparam.paramfunc = plpgsql_param_eval_var_ro; else scratch.d.cparam.paramfunc = plpgsql_param_eval_var; @@ -6457,14 +6486,12 @@ plpgsql_param_compile(ParamListInfo params, Param *param, scratch.d.cparam.paramfunc = plpgsql_param_eval_recfield; else if (datum->dtype == PLPGSQL_DTYPE_PROMISE) { - if (param != expr->expr_rw_param && - ((PLpgSQL_var *) datum)->datatype->typlen == -1) + if (((PLpgSQL_var *) datum)->datatype->typlen == -1) scratch.d.cparam.paramfunc = plpgsql_param_eval_generic_ro; else scratch.d.cparam.paramfunc = plpgsql_param_eval_generic; } - else if (datum->dtype == PLPGSQL_DTYPE_REC && - param != expr->expr_rw_param) + else if (datum->dtype == PLPGSQL_DTYPE_REC) scratch.d.cparam.paramfunc = plpgsql_param_eval_generic_ro; else scratch.d.cparam.paramfunc = plpgsql_param_eval_generic; @@ -6473,14 +6500,170 @@ plpgsql_param_compile(ParamListInfo params, Param *param, * Note: it's tempting to use paramarg to store the estate pointer and * thereby save an indirection or two in the eval functions. But that * doesn't work because the compiled expression might be used with - * different estates for the same PL/pgSQL function. + * different estates for the same PL/pgSQL function. Instead, store + * pointers to the PLpgSQL_expr as well as this specific Param, to support + * plpgsql_param_eval_var_check(). */ - scratch.d.cparam.paramarg = NULL; + scratch.d.cparam.paramarg = expr; + scratch.d.cparam.paramarg2 = param; scratch.d.cparam.paramid = param->paramid; scratch.d.cparam.paramtype = param->paramtype; ExprEvalPushStep(state, &scratch); } +/* + * plpgsql_param_eval_var_check evaluation of EEOP_PARAM_CALLBACK step + * + * This is specialized to the case of DTYPE_VAR variables for which + * we may need to determine the applicability of a read/write optimization, + * but we've not done that yet. + */ +static void +plpgsql_param_eval_var_check(ExprState *state, ExprEvalStep *op, + ExprContext *econtext) +{ + ParamListInfo params; + PLpgSQL_execstate *estate; + int dno = op->d.cparam.paramid - 1; + PLpgSQL_var *var; + + /* fetch back the hook data */ + params = econtext->ecxt_param_list_info; + estate = (PLpgSQL_execstate *) params->paramFetchArg; + Assert(dno >= 0 && dno < estate->ndatums); + + /* now we can access the target datum */ + var = (PLpgSQL_var *) estate->datums[dno]; + Assert(var->dtype == PLPGSQL_DTYPE_VAR); + + /* + * If the variable's current value is a R/W expanded object, it's time to + * decide whether/how to optimize the assignment. + */ + if (!var->isnull && + VARATT_IS_EXTERNAL_EXPANDED_RW(DatumGetPointer(var->value))) + { + PLpgSQL_expr *expr = (PLpgSQL_expr *) op->d.cparam.paramarg; + Param *param = (Param *) op->d.cparam.paramarg2; + + /* + * We might have already figured this out while evaluating some other + * Param referencing the same variable. + */ + if (expr->expr_rwopt == PLPGSQL_RWOPT_UNKNOWN) + exec_check_rw_parameter(expr, op->d.cparam.paramid); + + /* + * Update the callback pointer to match what we decided to do, and + * pass off this execution to the selected function. + */ + switch (expr->expr_rwopt) + { + case PLPGSQL_RWOPT_UNKNOWN: + Assert(false); + break; + case PLPGSQL_RWOPT_NOPE: + /* Force the value to read-only in all future executions */ + op->d.cparam.paramfunc = plpgsql_param_eval_var_ro; + plpgsql_param_eval_var_ro(state, op, econtext); + break; + case PLPGSQL_RWOPT_TRANSFER: + /* There can be only one matching Param in this case */ + Assert(param == expr->expr_rw_param); + /* When the value is read/write, transfer to exec context */ + op->d.cparam.paramfunc = plpgsql_param_eval_var_transfer; + plpgsql_param_eval_var_transfer(state, op, econtext); + break; + case PLPGSQL_RWOPT_INPLACE: + if (param == expr->expr_rw_param) + { + /* When the value is read/write, deliver it as-is */ + op->d.cparam.paramfunc = plpgsql_param_eval_var; + plpgsql_param_eval_var(state, op, econtext); + } + else + { + /* Not the optimizable reference, so force to read-only */ + op->d.cparam.paramfunc = plpgsql_param_eval_var_ro; + plpgsql_param_eval_var_ro(state, op, econtext); + } + break; + } + return; + } + + /* + * Otherwise, continue to postpone that decision, and execute an inlined + * version of exec_eval_datum(). Although this value could potentially + * need MakeExpandedObjectReadOnly, we know it doesn't right now. + */ + *op->resvalue = var->value; + *op->resnull = var->isnull; + + /* safety check -- an assertion should be sufficient */ + Assert(var->datatype->typoid == op->d.cparam.paramtype); +} + +/* + * plpgsql_param_eval_var_transfer evaluation of EEOP_PARAM_CALLBACK step + * + * This is specialized to the case of DTYPE_VAR variables for which + * we have determined that a read/write expanded value can be handed off + * into execution of the expression (and then possibly returned to our + * function's ownership afterwards). We have to test though, because the + * variable might not contain a read/write expanded value during this + * execution. + */ +static void +plpgsql_param_eval_var_transfer(ExprState *state, ExprEvalStep *op, + ExprContext *econtext) +{ + ParamListInfo params; + PLpgSQL_execstate *estate; + int dno = op->d.cparam.paramid - 1; + PLpgSQL_var *var; + + /* fetch back the hook data */ + params = econtext->ecxt_param_list_info; + estate = (PLpgSQL_execstate *) params->paramFetchArg; + Assert(dno >= 0 && dno < estate->ndatums); + + /* now we can access the target datum */ + var = (PLpgSQL_var *) estate->datums[dno]; + Assert(var->dtype == PLPGSQL_DTYPE_VAR); + + /* + * If the variable's current value is a R/W expanded object, transfer its + * ownership into the expression execution context, then drop our own + * reference to the value by setting the variable to NULL. That'll be + * overwritten (perhaps with this same object) when control comes back + * from the expression. + */ + if (!var->isnull && + VARATT_IS_EXTERNAL_EXPANDED_RW(DatumGetPointer(var->value))) + { + *op->resvalue = TransferExpandedObject(var->value, + get_eval_mcontext(estate)); + *op->resnull = false; + + var->value = (Datum) 0; + var->isnull = true; + var->freeval = false; + } + else + { + /* + * Otherwise we can pass the variable's value directly; we now know + * that MakeExpandedObjectReadOnly isn't needed. + */ + *op->resvalue = var->value; + *op->resnull = var->isnull; + } + + /* safety check -- an assertion should be sufficient */ + Assert(var->datatype->typoid == op->d.cparam.paramtype); +} + /* * plpgsql_param_eval_var evaluation of EEOP_PARAM_CALLBACK step * @@ -7957,9 +8140,10 @@ exec_simple_check_plan(PLpgSQL_execstate *estate, PLpgSQL_expr *expr) MemoryContext oldcontext; /* - * Initialize to "not simple". + * Initialize to "not simple", and reset R/W optimizability. */ expr->expr_simple_expr = NULL; + expr->expr_rwopt = PLPGSQL_RWOPT_UNKNOWN; expr->expr_rw_param = NULL; /* @@ -8164,88 +8348,133 @@ exec_save_simple_expr(PLpgSQL_expr *expr, CachedPlan *cplan) expr->expr_simple_typmod = exprTypmod((Node *) tle_expr); /* We also want to remember if it is immutable or not */ expr->expr_simple_mutable = contain_mutable_functions((Node *) tle_expr); - - /* - * Lastly, check to see if there's a possibility of optimizing a - * read/write parameter. - */ - exec_check_rw_parameter(expr); } /* * exec_check_rw_parameter --- can we pass expanded object as read/write param? * - * If we have an assignment like "x := array_append(x, foo)" in which the + * There are two separate cases in which we can optimize an update to a + * variable that has a read/write expanded value by letting the called + * expression operate directly on the expanded value. In both cases we + * are considering assignments like "var := array_append(var, foo)" where + * the assignment target is also an input to the RHS expression. + * + * Case 1 (RWOPT_TRANSFER rule): if the variable is "local" in the sense that + * its declaration is not outside any BEGIN...EXCEPTION block surrounding the + * assignment, then we do not need to worry about preserving its value if the + * RHS expression throws an error. If in addition the variable is referenced + * exactly once in the RHS expression, then we can optimize by converting the + * read/write expanded value into a transient value within the expression + * evaluation context, and then setting the variable's recorded value to NULL + * to prevent double-free attempts. This works regardless of any other + * details of the RHS expression. If the expression eventually returns that + * same expanded object (possibly modified) then the variable will re-acquire + * ownership; while if it returns something else or throws an error, the + * expanded object will be discarded as part of cleanup of the evaluation + * context. + * + * Case 2 (RWOPT_INPLACE rule): if we have a non-local assignment or if + * it looks like "var := array_append(var, var[1])" with multiple references + * to the target variable, then we can't use case 1. Nonetheless, if the * top-level function is trusted not to corrupt its argument in case of an - * error, then when x has an expanded object as value, it is safe to pass the - * value as a read/write pointer and let the function modify the value - * in-place. + * error, then when the var has an expanded object as value, it is safe to + * pass the value as a read/write pointer to the top-level function and let + * the function modify the value in-place. (Any other references have to be + * passed as read-only pointers as usual.) Only the top-level function has to + * be trusted, since if anything further down fails, the object hasn't been + * modified yet. * - * This function checks for a safe expression, and sets expr->expr_rw_param - * to the address of any Param within the expression that can be passed as - * read/write (there can be only one); or to NULL when there is no safe Param. + * This function checks to see if the assignment is optimizable according + * to either rule, and updates expr->expr_rwopt accordingly. In addition, + * it sets expr->expr_rw_param to the address of the Param within the + * expression that can be passed as read/write (there can be only one); + * or to NULL when there is no safe Param. * - * Note that this mechanism intentionally applies the safety labeling to just - * one Param; the expression could contain other Params referencing the target - * variable, but those must still be treated as read-only. + * Note that this mechanism intentionally allows just one Param to emit a + * read/write pointer; in case 2, the expression could contain other Params + * referencing the target variable, but those must be treated as read-only. * * Also note that we only apply this optimization within simple expressions. * There's no point in it for non-simple expressions, because the * exec_run_select code path will flatten any expanded result anyway. - * Also, it's safe to assume that an expr_simple_expr tree won't get copied - * somewhere before it gets compiled, so that looking for pointer equality - * to expr_rw_param will work for matching the target Param. That'd be much - * shakier in the general case. */ static void -exec_check_rw_parameter(PLpgSQL_expr *expr) +exec_check_rw_parameter(PLpgSQL_expr *expr, int paramid) { - int target_dno; + Expr *sexpr = expr->expr_simple_expr; Oid funcid; List *fargs; ListCell *lc; /* Assume unsafe */ + expr->expr_rwopt = PLPGSQL_RWOPT_NOPE; expr->expr_rw_param = NULL; - /* Done if expression isn't an assignment source */ - target_dno = expr->target_param; - if (target_dno < 0) - return; + /* Shouldn't be here for non-simple expression */ + Assert(sexpr != NULL); + + /* Param should match the expression's assignment target, too */ + Assert(paramid == expr->target_param + 1); /* - * If target variable isn't referenced by expression, no need to look - * further. + * If the assignment is to a "local" variable (one whose value won't + * matter anymore if expression evaluation fails), and this Param is the + * only reference to that variable in the expression, then we can + * unconditionally optimize using the "transfer" method. */ - if (!bms_is_member(target_dno, expr->paramnos)) - return; + if (expr->target_is_local) + { + count_param_references_context context; - /* Shouldn't be here for non-simple expression */ - Assert(expr->expr_simple_expr != NULL); + /* See how many references there are, and find one of them */ + context.paramid = paramid; + context.count = 0; + context.last_param = NULL; + (void) count_param_references((Node *) sexpr, &context); + + /* If we're here, the expr must contain some reference to the var */ + Assert(context.count > 0); + + /* If exactly one reference, success! */ + if (context.count == 1) + { + expr->expr_rwopt = PLPGSQL_RWOPT_TRANSFER; + expr->expr_rw_param = context.last_param; + return; + } + } /* + * Otherwise, see if we can trust the expression's top-level function to + * apply the "inplace" method. + * * Top level of expression must be a simple FuncExpr, OpExpr, or - * SubscriptingRef, else we can't optimize. + * SubscriptingRef, else we can't identify which function is relevant. But + * it's okay to look through any RelabelType above that, since that can't + * fail. */ - if (IsA(expr->expr_simple_expr, FuncExpr)) + if (IsA(sexpr, RelabelType)) + sexpr = ((RelabelType *) sexpr)->arg; + if (IsA(sexpr, FuncExpr)) { - FuncExpr *fexpr = (FuncExpr *) expr->expr_simple_expr; + FuncExpr *fexpr = (FuncExpr *) sexpr; funcid = fexpr->funcid; fargs = fexpr->args; } - else if (IsA(expr->expr_simple_expr, OpExpr)) + else if (IsA(sexpr, OpExpr)) { - OpExpr *opexpr = (OpExpr *) expr->expr_simple_expr; + OpExpr *opexpr = (OpExpr *) sexpr; funcid = opexpr->opfuncid; fargs = opexpr->args; } - else if (IsA(expr->expr_simple_expr, SubscriptingRef)) + else if (IsA(sexpr, SubscriptingRef)) { - SubscriptingRef *sbsref = (SubscriptingRef *) expr->expr_simple_expr; + SubscriptingRef *sbsref = (SubscriptingRef *) sexpr; /* We only trust standard varlena arrays to be safe */ + /* TODO: install some extensibility here */ if (get_typsubscript(sbsref->refcontainertype, NULL) != F_ARRAY_SUBSCRIPT_HANDLER) return; @@ -8256,9 +8485,10 @@ exec_check_rw_parameter(PLpgSQL_expr *expr) Param *param = (Param *) sbsref->refexpr; if (param->paramkind == PARAM_EXTERN && - param->paramid == target_dno + 1) + param->paramid == paramid) { /* Found the Param we want to pass as read/write */ + expr->expr_rwopt = PLPGSQL_RWOPT_INPLACE; expr->expr_rw_param = param; return; } @@ -8293,9 +8523,10 @@ exec_check_rw_parameter(PLpgSQL_expr *expr) Param *param = (Param *) arg; if (param->paramkind == PARAM_EXTERN && - param->paramid == target_dno + 1) + param->paramid == paramid) { /* Found the Param we want to pass as read/write */ + expr->expr_rwopt = PLPGSQL_RWOPT_INPLACE; expr->expr_rw_param = param; return; } @@ -8303,6 +8534,35 @@ exec_check_rw_parameter(PLpgSQL_expr *expr) } } +/* + * Count Params referencing the specified paramid, and return one of them + * if there are any. + * + * We actually only need to distinguish 0, 1, and N references; so we can + * abort the tree traversal as soon as we've found two. + */ +static bool +count_param_references(Node *node, count_param_references_context *context) +{ + if (node == NULL) + return false; + else if (IsA(node, Param)) + { + Param *param = (Param *) node; + + if (param->paramkind == PARAM_EXTERN && + param->paramid == context->paramid) + { + context->last_param = param; + if (++(context->count) > 1) + return true; /* abort tree traversal */ + } + return false; + } + else + return expression_tree_walker(node, count_param_references, context); +} + /* * exec_check_assignable --- is it OK to assign to the indicated datum? * diff --git a/src/pl/plpgsql/src/plpgsql.h b/src/pl/plpgsql/src/plpgsql.h index c6fadc5660..3bafeea28b 100644 --- a/src/pl/plpgsql/src/plpgsql.h +++ b/src/pl/plpgsql/src/plpgsql.h @@ -187,6 +187,17 @@ typedef enum PLpgSQL_resolve_option PLPGSQL_RESOLVE_COLUMN, /* prefer table column to plpgsql var */ } PLpgSQL_resolve_option; +/* + * Status of optimization of assignment to a read/write expanded object + */ +typedef enum PLpgSQL_rwopt +{ + PLPGSQL_RWOPT_UNKNOWN = 0, /* applicability not determined yet */ + PLPGSQL_RWOPT_NOPE, /* cannot do any optimization */ + PLPGSQL_RWOPT_TRANSFER, /* transfer the old value into expr state */ + PLPGSQL_RWOPT_INPLACE, /* pass value as R/W to top-level function */ +} PLpgSQL_rwopt; + /********************************************************************** * Node and structure definitions @@ -246,11 +257,14 @@ typedef struct PLpgSQL_expr bool expr_simple_mutable; /* true if simple expr is mutable */ /* - * If we match a Param within expr_simple_expr to the variable identified - * by target_param, that Param's address is stored in expr_rw_param; then - * expression code generation will allow the value for that Param to be - * passed as a read/write expanded-object pointer. + * expr_rwopt tracks whether we have determined that assignment to a + * read/write expanded object (stored in the target_param datum) can be + * optimized by passing it to the expr as a read/write expanded-object + * pointer. If so, expr_rw_param identifies the specific Param that + * should emit a read/write pointer; any others will emit read-only + * pointers. */ + PLpgSQL_rwopt expr_rwopt; /* can we apply R/W optimization? */ Param *expr_rw_param; /* read/write Param within expr, if any */ /* diff --git a/src/pl/plpgsql/src/sql/plpgsql_array.sql b/src/pl/plpgsql/src/sql/plpgsql_array.sql index 4b9ff51594..4a346203dc 100644 --- a/src/pl/plpgsql/src/sql/plpgsql_array.sql +++ b/src/pl/plpgsql/src/sql/plpgsql_array.sql @@ -48,6 +48,15 @@ begin a.c1[1].i := 11; raise notice 'a = %, a.c1[1].i = %', a, a.c1[1].i; end$$; do $$ declare a int[]; begin a := array_agg(x) from (values(1),(2),(3)) v(x); raise notice 'a = %', a; end$$; +do $$ declare a int[] := array[1,2,3]; +begin + -- test scenarios for optimization of updates of R/W expanded objects + a := array_append(a, 42); -- optimizable using "transfer" method + a := a || a[3]; -- optimizable using "inplace" method + a := a || a; -- not optimizable + raise notice 'a = %', a; +end$$; + create temp table onecol as select array[1,2] as f1; do $$ declare a int[]; diff --git a/src/tools/pgindent/typedefs.list b/src/tools/pgindent/typedefs.list index 2d4c870423..80328115a1 100644 --- a/src/tools/pgindent/typedefs.list +++ b/src/tools/pgindent/typedefs.list @@ -1867,6 +1867,7 @@ PLpgSQL_rec PLpgSQL_recfield PLpgSQL_resolve_option PLpgSQL_row +PLpgSQL_rwopt PLpgSQL_stmt PLpgSQL_stmt_assert PLpgSQL_stmt_assign @@ -3392,6 +3393,7 @@ core_yy_extra_type core_yyscan_t corrupt_items cost_qual_eval_context +count_param_references_context cp_hash_func create_upper_paths_hook_type createdb_failure_params -- 2.43.5 From 62828464b0b74a46da56d110ce2c831641f0b2bc Mon Sep 17 00:00:00 2001 From: Tom Lane <tgl@sss.pgh.pa.us> Date: Tue, 3 Dec 2024 19:25:22 -0500 Subject: [PATCH v2 4/4] Allow extension functions to participate in in-place updates. Commit 1dc5ebc90 allowed PL/pgSQL to perform in-place updates of expanded-object variables that are being updated with assignments like "x := f(x, ...)". However this was allowed only for a hard-wired list of functions f(), since we need to be sure that f() will not modify the variable if it fails. It was always envisioned that we should make that extensible, but at the time we didn't have a good way to do so. Since then we've invented the idea of "support functions" to allow attaching specialized optimization knowledge to functions, and that is a perfect mechanism for doing this. Hence, adjust PL/pgSQL to use a support function request instead of hard-wired logic to decide if in-place update is safe. Replace the previous behavior by creating support functions for the three functions that were previously hard-wired. Discussion: https://postgr.es/m/CACxu=vJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg@mail.gmail.com --- src/backend/utils/adt/array_userfuncs.c | 61 +++++++++++++ src/backend/utils/adt/arraysubs.c | 34 ++++++++ src/include/catalog/pg_proc.dat | 20 +++-- src/include/nodes/supportnodes.h | 55 +++++++++++- src/pl/plpgsql/src/expected/plpgsql_array.out | 3 +- src/pl/plpgsql/src/pl_exec.c | 86 ++++++++----------- src/pl/plpgsql/src/sql/plpgsql_array.sql | 1 + src/tools/pgindent/typedefs.list | 1 + 8 files changed, 202 insertions(+), 59 deletions(-) diff --git a/src/backend/utils/adt/array_userfuncs.c b/src/backend/utils/adt/array_userfuncs.c index 304a93112e..bddacd4802 100644 --- a/src/backend/utils/adt/array_userfuncs.c +++ b/src/backend/utils/adt/array_userfuncs.c @@ -16,6 +16,7 @@ #include "common/int.h" #include "common/pg_prng.h" #include "libpq/pqformat.h" +#include "nodes/supportnodes.h" #include "port/pg_bitutils.h" #include "utils/array.h" #include "utils/builtins.h" @@ -167,6 +168,36 @@ array_append(PG_FUNCTION_ARGS) PG_RETURN_DATUM(result); } +/* + * array_append_support() + * + * Planner support function for array_append() + */ +Datum +array_append_support(PG_FUNCTION_ARGS) +{ + Node *rawreq = (Node *) PG_GETARG_POINTER(0); + Node *ret = NULL; + + if (IsA(rawreq, SupportRequestModifyInPlace)) + { + /* + * We can optimize in-place appends if the function's array argument + * is the array being assigned to. We don't need to worry about array + * references within the other argument. + */ + SupportRequestModifyInPlace *req = (SupportRequestModifyInPlace *) rawreq; + Param *arg = (Param *) linitial(req->args); + + if (arg && IsA(arg, Param) && + arg->paramkind == PARAM_EXTERN && + arg->paramid == req->paramid) + ret = (Node *) arg; + } + + PG_RETURN_POINTER(ret); +} + /*----------------------------------------------------------------------------- * array_prepend : * push an element onto the front of a one-dimensional array @@ -230,6 +261,36 @@ array_prepend(PG_FUNCTION_ARGS) PG_RETURN_DATUM(result); } +/* + * array_prepend_support() + * + * Planner support function for array_prepend() + */ +Datum +array_prepend_support(PG_FUNCTION_ARGS) +{ + Node *rawreq = (Node *) PG_GETARG_POINTER(0); + Node *ret = NULL; + + if (IsA(rawreq, SupportRequestModifyInPlace)) + { + /* + * We can optimize in-place prepends if the function's array argument + * is the array being assigned to. We don't need to worry about array + * references within the other argument. + */ + SupportRequestModifyInPlace *req = (SupportRequestModifyInPlace *) rawreq; + Param *arg = (Param *) lsecond(req->args); + + if (arg && IsA(arg, Param) && + arg->paramkind == PARAM_EXTERN && + arg->paramid == req->paramid) + ret = (Node *) arg; + } + + PG_RETURN_POINTER(ret); +} + /*----------------------------------------------------------------------------- * array_cat : * concatenate two nD arrays to form an nD array, or diff --git a/src/backend/utils/adt/arraysubs.c b/src/backend/utils/adt/arraysubs.c index 6f68dfa5b2..3c4f1664d3 100644 --- a/src/backend/utils/adt/arraysubs.c +++ b/src/backend/utils/adt/arraysubs.c @@ -18,6 +18,7 @@ #include "nodes/makefuncs.h" #include "nodes/nodeFuncs.h" #include "nodes/subscripting.h" +#include "nodes/supportnodes.h" #include "parser/parse_coerce.h" #include "parser/parse_expr.h" #include "utils/array.h" @@ -575,3 +576,36 @@ raw_array_subscript_handler(PG_FUNCTION_ARGS) PG_RETURN_POINTER(&sbsroutines); } + +/* + * array_subscript_handler_support() + * + * Planner support function for array_subscript_handler() + */ +Datum +array_subscript_handler_support(PG_FUNCTION_ARGS) +{ + Node *rawreq = (Node *) PG_GETARG_POINTER(0); + Node *ret = NULL; + + if (IsA(rawreq, SupportRequestModifyInPlace)) + { + /* + * We can optimize in-place subscripted assignment if the refexpr is + * the array being assigned to. We don't need to worry about array + * references within the refassgnexpr or the subscripts; however, if + * there's no refassgnexpr then it's a fetch which there's no need to + * optimize. + */ + SupportRequestModifyInPlace *req = (SupportRequestModifyInPlace *) rawreq; + Param *refexpr = (Param *) linitial(req->args); + + if (refexpr && IsA(refexpr, Param) && + refexpr->paramkind == PARAM_EXTERN && + refexpr->paramid == req->paramid && + lsecond(req->args) != NULL) + ret = (Node *) refexpr; + } + + PG_RETURN_POINTER(ret); +} diff --git a/src/include/catalog/pg_proc.dat b/src/include/catalog/pg_proc.dat index 9575524007..79a9dc383a 100644 --- a/src/include/catalog/pg_proc.dat +++ b/src/include/catalog/pg_proc.dat @@ -1598,14 +1598,20 @@ proname => 'cardinality', prorettype => 'int4', proargtypes => 'anyarray', prosrc => 'array_cardinality' }, { oid => '378', descr => 'append element onto end of array', - proname => 'array_append', proisstrict => 'f', - prorettype => 'anycompatiblearray', + proname => 'array_append', prosupport => 'array_append_support', + proisstrict => 'f', prorettype => 'anycompatiblearray', proargtypes => 'anycompatiblearray anycompatible', prosrc => 'array_append' }, +{ oid => '8680', descr => 'planner support for array_append', + proname => 'array_append_support', prorettype => 'internal', + proargtypes => 'internal', prosrc => 'array_append_support' }, { oid => '379', descr => 'prepend element onto front of array', - proname => 'array_prepend', proisstrict => 'f', - prorettype => 'anycompatiblearray', + proname => 'array_prepend', prosupport => 'array_prepend_support', + proisstrict => 'f', prorettype => 'anycompatiblearray', proargtypes => 'anycompatible anycompatiblearray', prosrc => 'array_prepend' }, +{ oid => '8681', descr => 'planner support for array_prepend', + proname => 'array_prepend_support', prorettype => 'internal', + proargtypes => 'internal', prosrc => 'array_prepend_support' }, { oid => '383', proname => 'array_cat', proisstrict => 'f', prorettype => 'anycompatiblearray', @@ -12160,8 +12166,12 @@ # subscripting support for built-in types { oid => '6179', descr => 'standard array subscripting support', - proname => 'array_subscript_handler', prorettype => 'internal', + proname => 'array_subscript_handler', + prosupport => 'array_subscript_handler_support', prorettype => 'internal', proargtypes => 'internal', prosrc => 'array_subscript_handler' }, +{ oid => '8682', descr => 'planner support for array_subscript_handler', + proname => 'array_subscript_handler_support', prorettype => 'internal', + proargtypes => 'internal', prosrc => 'array_subscript_handler_support' }, { oid => '6180', descr => 'raw array subscripting support', proname => 'raw_array_subscript_handler', prorettype => 'internal', proargtypes => 'internal', prosrc => 'raw_array_subscript_handler' }, diff --git a/src/include/nodes/supportnodes.h b/src/include/nodes/supportnodes.h index 5f7bcde891..1ca6e477ac 100644 --- a/src/include/nodes/supportnodes.h +++ b/src/include/nodes/supportnodes.h @@ -6,10 +6,10 @@ * This file defines the API for "planner support functions", which * are SQL functions (normally written in C) that can be attached to * another "target" function to give the system additional knowledge - * about the target function. All the current capabilities have to do - * with planning queries that use the target function, though it is - * possible that future extensions will add functionality to be invoked - * by the parser or executor. + * about the target function. The name is now something of a misnomer, + * since some of the call sites are in the executor not the planner, + * but "function support function" would be a confusing name so we + * stick with "planner support function". * * A support function must have the SQL signature * supportfn(internal) returns internal @@ -343,4 +343,51 @@ typedef struct SupportRequestOptimizeWindowClause * optimizations are possible. */ } SupportRequestOptimizeWindowClause; +/* + * The ModifyInPlace request allows the support function to detect whether + * a call to its target function can be allowed to modify a read/write + * expanded object in-place. The context is that we are considering a + * PL/pgSQL (or similar PL) assignment of the form "x := f(x, ...)" where + * the variable x is of a type that can be represented as an expanded object + * (see utils/expandeddatum.h). If f() can usefully optimize by modifying + * the passed-in object in-place, then this request can be implemented to + * instruct PL/pgSQL to pass a read-write expanded pointer to the variable's + * value. (Note that there is no guarantee that later calls to f() will + * actually do so. If f() receives a read-only pointer, or a pointer to a + * non-expanded object, it must follow the usual convention of not modifying + * the pointed-to object.) There are two requirements that must be met + * to make this safe: + * 1. f() must guarantee that it will not have modified the object if it + * fails. Otherwise the variable's value might change unexpectedly. + * 2. If the other arguments to f() ("..." in the above example) contain + * references to x, f() must be able to cope with that; or if that's not + * safe, the support function must scan the other arguments to verify that + * there are no other references to x. An example of the concern here is + * that in "arr := array_append(arr, arr[1])", if the array element type + * is pass-by-reference then array_append would receive a second argument + * that points into the array object it intends to modify. array_append is + * coded to make that safe, but other functions might not be able to cope. + * + * "args" is a node tree list representing the function's arguments. + * One or more nodes within the node tree will be PARAM_EXTERN Params + * with ID "paramid", which represent the assignment target variable. + * (Note that such references are not necessarily at top level in the list, + * for example we might have "x := f(x, g(x))". Generally it's only safe + * to optimize a reference that is at top level, else we're making promises + * about the behavior of g() as well as f().) + * + * If modify-in-place is safe, the support function should return the + * address of the Param node that is to return a read-write pointer. + * (At most one of the references is allowed to do so.) Otherwise, + * return NULL. + */ +typedef struct SupportRequestModifyInPlace +{ + NodeTag type; + + Oid funcid; /* PG_PROC OID of the target function */ + List *args; /* Arguments to the function */ + int paramid; /* ID of Param(s) representing variable */ +} SupportRequestModifyInPlace; + #endif /* SUPPORTNODES_H */ diff --git a/src/pl/plpgsql/src/expected/plpgsql_array.out b/src/pl/plpgsql/src/expected/plpgsql_array.out index e5db6d6087..4c6b3ce998 100644 --- a/src/pl/plpgsql/src/expected/plpgsql_array.out +++ b/src/pl/plpgsql/src/expected/plpgsql_array.out @@ -57,10 +57,11 @@ begin -- test scenarios for optimization of updates of R/W expanded objects a := array_append(a, 42); -- optimizable using "transfer" method a := a || a[3]; -- optimizable using "inplace" method + a := a[1] || a; -- ditto, but let's test array_prepend a := a || a; -- not optimizable raise notice 'a = %', a; end$$; -NOTICE: a = {1,2,3,42,3,1,2,3,42,3} +NOTICE: a = {1,1,2,3,42,3,1,1,2,3,42,3} create temp table onecol as select array[1,2] as f1; do $$ declare a int[]; begin a := f1 from onecol; raise notice 'a = %', a; end$$; diff --git a/src/pl/plpgsql/src/pl_exec.c b/src/pl/plpgsql/src/pl_exec.c index ae878782b8..e3643fc7a8 100644 --- a/src/pl/plpgsql/src/pl_exec.c +++ b/src/pl/plpgsql/src/pl_exec.c @@ -29,6 +29,7 @@ #include "mb/stringinfo_mb.h" #include "miscadmin.h" #include "nodes/nodeFuncs.h" +#include "nodes/supportnodes.h" #include "optimizer/optimizer.h" #include "parser/parse_coerce.h" #include "parser/parse_type.h" @@ -8404,7 +8405,7 @@ exec_check_rw_parameter(PLpgSQL_expr *expr, int paramid) Expr *sexpr = expr->expr_simple_expr; Oid funcid; List *fargs; - ListCell *lc; + Oid prosupport; /* Assume unsafe */ expr->expr_rwopt = PLPGSQL_RWOPT_NOPE; @@ -8473,64 +8474,51 @@ exec_check_rw_parameter(PLpgSQL_expr *expr, int paramid) { SubscriptingRef *sbsref = (SubscriptingRef *) sexpr; - /* We only trust standard varlena arrays to be safe */ - /* TODO: install some extensibility here */ - if (get_typsubscript(sbsref->refcontainertype, NULL) != - F_ARRAY_SUBSCRIPT_HANDLER) - return; - - /* We can optimize the refexpr if it's the target, otherwise not */ - if (sbsref->refexpr && IsA(sbsref->refexpr, Param)) - { - Param *param = (Param *) sbsref->refexpr; + funcid = get_typsubscript(sbsref->refcontainertype, NULL); - if (param->paramkind == PARAM_EXTERN && - param->paramid == paramid) - { - /* Found the Param we want to pass as read/write */ - expr->expr_rwopt = PLPGSQL_RWOPT_INPLACE; - expr->expr_rw_param = param; - return; - } - } - - return; + /* + * We assume that only the refexpr and refassgnexpr (if any) are + * relevant to the support function's decision. If that turns out to + * be a bad idea, we could incorporate the subscript expressions into + * the fargs list somehow. + */ + fargs = list_make2(sbsref->refexpr, sbsref->refassgnexpr); } else return; /* - * The top-level function must be one that we trust to be "safe". - * Currently we hard-wire the list, but it would be very desirable to - * allow extensions to mark their functions as safe ... + * The top-level function must be one that can handle in-place update + * safely. We allow functions to declare their ability to do that via a + * support function request. */ - if (!(funcid == F_ARRAY_APPEND || - funcid == F_ARRAY_PREPEND)) - return; - - /* - * The target variable (in the form of a Param) must appear as a direct - * argument of the top-level function. References further down in the - * tree can't be optimized; but on the other hand, they don't invalidate - * optimizing the top-level call, since that will be executed last. - */ - foreach(lc, fargs) + prosupport = get_func_support(funcid); + if (OidIsValid(prosupport)) { - Node *arg = (Node *) lfirst(lc); + SupportRequestModifyInPlace req; + Param *param; - if (arg && IsA(arg, Param)) - { - Param *param = (Param *) arg; + req.type = T_SupportRequestModifyInPlace; + req.funcid = funcid; + req.args = fargs; + req.paramid = paramid; - if (param->paramkind == PARAM_EXTERN && - param->paramid == paramid) - { - /* Found the Param we want to pass as read/write */ - expr->expr_rwopt = PLPGSQL_RWOPT_INPLACE; - expr->expr_rw_param = param; - return; - } - } + param = (Param *) + DatumGetPointer(OidFunctionCall1(prosupport, + PointerGetDatum(&req))); + + if (param == NULL) + return; /* support function fails */ + + /* Verify support function followed the API */ + Assert(IsA(param, Param)); + Assert(param->paramkind == PARAM_EXTERN); + Assert(param->paramid == paramid); + + /* Found the Param we want to pass as read/write */ + expr->expr_rwopt = PLPGSQL_RWOPT_INPLACE; + expr->expr_rw_param = param; + return; } } diff --git a/src/pl/plpgsql/src/sql/plpgsql_array.sql b/src/pl/plpgsql/src/sql/plpgsql_array.sql index 4a346203dc..da984a9941 100644 --- a/src/pl/plpgsql/src/sql/plpgsql_array.sql +++ b/src/pl/plpgsql/src/sql/plpgsql_array.sql @@ -53,6 +53,7 @@ begin -- test scenarios for optimization of updates of R/W expanded objects a := array_append(a, 42); -- optimizable using "transfer" method a := a || a[3]; -- optimizable using "inplace" method + a := a[1] || a; -- ditto, but let's test array_prepend a := a || a; -- not optimizable raise notice 'a = %', a; end$$; diff --git a/src/tools/pgindent/typedefs.list b/src/tools/pgindent/typedefs.list index 80328115a1..b6d3f9f659 100644 --- a/src/tools/pgindent/typedefs.list +++ b/src/tools/pgindent/typedefs.list @@ -2787,6 +2787,7 @@ SubscriptionRelState SummarizerReadLocalXLogPrivate SupportRequestCost SupportRequestIndexCondition +SupportRequestModifyInPlace SupportRequestOptimizeWindowClause SupportRequestRows SupportRequestSelectivity -- 2.43.5
Michel Pelletier <pelletier.michel@gmail.com> writes: > Here's a WIP patch for a pgexpanded example in src/test/modules. The > object is very simple and starts with an integer and increments that value > every time it is expanded. I added some regression tests that test two sql > functions that replicate the expansion issue that I'm seeing with my > extension. I took a look through this and have a few comments: * I really don't like the "increments when expanded" behavior. That's just insane from a semantic viewpoint: expanding and then flattening a value should not change it, at least not in any user-visible way. I get that you want the tests to exhibit how many times expansion happened, but the LOGF() debug printouts seem to serve that need just fine. How about we just make the objects store a palloc'd string, or some such? * context_callback_exobj_free() seems bizarre. I can see the usefulness of the LOGF() printout for the test's purposes, but there's no need for the pfree() because the whole context is about to go away. I'm concerned that people using this as a skeleton might think they need to do something equivalent, when they don't. * Maybe the answer to the above objection is to make the expanded object hold a malloc'd not palloc'd string, so that it's actually necessary to have an explicit free() to avoid a permanent memory leak. This'd be silly in a real use-case, and should be documented as such; but it seems good to have a callback function to demonstrate how to clean up resources outside the expanded object itself. * BTW, it might be good to make LOGF() print more than just the function's name; some indication of what it's been called on could be very valuable. * If we believe that this is a skeleton other people might use as a starting point, it'd be good to have more comments explaining what each bit is doing and why. For instance, I think it's important to note that a context callback function isn't required if deleting the memory context is sufficient to clean up the object. * It seems like test_expand() and test_expand_expand() belong to the pgexpanded.sql test script and should be defined there, rather than being part of the extension. * As the patch stands, the module would never be invoked by "make check", other than by manually invoking that in the new subdirectory. You need to hook it into the parent directory's Makefile. And you need meson infrastructure too, ie a meson.build file. (Should be able to mostly crib that from one of the sibling test modules.) * This does not seem like a great idea to me: +SET client_min_messages = 'debug1'; It's pure luck if the test output doesn't get messed up by somebody else's unrelated debug output, because there's elog(DEBUG1) in a lot of places and more could show up any day. I'd be inclined to make LOGF() emit messages at NOTICE level, and then the test doesn't have to mess with client_min_messages at all. * There's a whole bunch of minor ways in which this doesn't conform to project style: - We don't use markdown (.md) for per-directory README files. (There's been discussion of that, but we're not there today.) I would not use a URL to cross-reference our docs, either. - "create or replace function" isn't great style in test scripts, much less extension scripts. We're not expecting conflicting functions to exist, and if one does it's better to error out than silently overwrite it. - We put '#include "postgres.h"' in .c files, never in .h files. - At least the .c and .h files should carry standard header comments with a PGDG copyright notice. - Code layout and comments are frequently not per style. You can mostly fix this by running the .c and .h files through pgindent, but it might mangle your comments; usually best to make those look like project standard first. See https://www.postgresql.org/docs/devel/source-format.html - Declaring static functions in a .h file doesn't seem like a great plan. If the thing is only meant to be included in one .c file, why bother with a separate .h file at all? "static const ExpandedObjectMethods exobj_methods" belongs there even less, since you'd end with one copy per including file. - We don't put "/* Local Variables: */" comments into code files; we generally expect the source files to be agnostic about what editors people use on them. To move forward, I'd suggest dealing with these concerns first, and then as a separate patch start adding things like in-place-modification support. It'd be cool to see the results of that optimization manifest as fewer expansions visible in the test's output. regards, tom lane
On Tue, Dec 3, 2024 at 4:42 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
> Here's a WIP patch for a pgexpanded example in src/test/modules.
I didn't look at your patch yet, but in the meantime here's an update
that takes the next step towards what I promised.
Awesome! I made a support function for my set_element(matrix, i, j, value) function and it works great, but I do have a couple questions about how to move forward with some of my other methods. For reference, here's the set_element test function, and a trace of function calls, after implementing the support function for set_element, the expand_matrix function is called twice, first by nvals(), and later by the first set_element, but not the subsequent sets, so the true clause of the PLPGSQL_RWOPT_INPLACE case in plpgsql_param_eval_var_check works great and as expected:
postgres=# create or replace function test_se(graph matrix) returns matrix language plpgsql as
$$
declare
nvals bigint;
begin
graph = wait(graph);
nvals = nvals(graph);
raise notice 'nvals: %', nvals;
graph = set_element(graph, 4, 2, 42);
graph = set_element(graph, 4, 3, 43);
graph = set_element(graph, 4, 4, 44);
return graph;
end;
$$;
CREATE FUNCTION
postgres=# select nvals(test_se('int32'::matrix));
DEBUG: new_matrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: matrix_wait
DEBUG: DatumGetMatrix
DEBUG: expand_matrix <- wait expands with support function
DEBUG: new_matrix
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- nvals() reexpands
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
NOTICE: nvals: 0
DEBUG: scalar_int32
DEBUG: new_scalar
DEBUG: matrix_set_element
DEBUG: DatumGetMatrix < set_element does not reexpand, yay!
DEBUG: DatumGetScalar
DEBUG: context_callback_scalar_free
DEBUG: scalar_int32
DEBUG: new_scalar
DEBUG: matrix_set_element
DEBUG: DatumGetMatrix
DEBUG: DatumGetScalar
DEBUG: context_callback_scalar_free
DEBUG: scalar_int32
DEBUG: new_scalar
DEBUG: matrix_set_element
DEBUG: DatumGetMatrix
DEBUG: DatumGetScalar
DEBUG: context_callback_scalar_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: context_callback_matrix_free
DEBUG: context_callback_matrix_free
┌───────┐
│ nvals │
├───────┤
│ 3 │
└───────┘
$$
declare
nvals bigint;
begin
graph = wait(graph);
nvals = nvals(graph);
raise notice 'nvals: %', nvals;
graph = set_element(graph, 4, 2, 42);
graph = set_element(graph, 4, 3, 43);
graph = set_element(graph, 4, 4, 44);
return graph;
end;
$$;
CREATE FUNCTION
postgres=# select nvals(test_se('int32'::matrix));
DEBUG: new_matrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: matrix_wait
DEBUG: DatumGetMatrix
DEBUG: expand_matrix <- wait expands with support function
DEBUG: new_matrix
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- nvals() reexpands
DEBUG: new_matrix
DEBUG: context_callback_matrix_free
NOTICE: nvals: 0
DEBUG: scalar_int32
DEBUG: new_scalar
DEBUG: matrix_set_element
DEBUG: DatumGetMatrix < set_element does not reexpand, yay!
DEBUG: DatumGetScalar
DEBUG: context_callback_scalar_free
DEBUG: scalar_int32
DEBUG: new_scalar
DEBUG: matrix_set_element
DEBUG: DatumGetMatrix
DEBUG: DatumGetScalar
DEBUG: context_callback_scalar_free
DEBUG: scalar_int32
DEBUG: new_scalar
DEBUG: matrix_set_element
DEBUG: DatumGetMatrix
DEBUG: DatumGetScalar
DEBUG: context_callback_scalar_free
DEBUG: matrix_nvals
DEBUG: DatumGetMatrix
DEBUG: context_callback_matrix_free
DEBUG: context_callback_matrix_free
┌───────┐
│ nvals │
├───────┤
│ 3 │
└───────┘
My question is about nvals = nvals(graph) in that function above, the object is flattened and then rexpanded, even after the object was expanded and returned by wait() [1] into a r/w pointer. set_element honors the expanded object from wait(), but nvals does not. It seems like I want to be able to pass the argument as a RO pointer, but I'm not sure how to trigger the "else" clause in the PLPGSQL_RWOPT_INPLACE case. I can see how the support function triggers the if, but I don't see a similar way to trigger the else. I'm almost certainly missing something.
[1](Due to the asynchronous nature of the GraphBLAS API, there is a GrB_wait() function to wait until an object is "complete" computationally, but since this object was just expanded, there is no pending work, so the wait() is essentially a noop but a handy way for me to return a r/w pointer for subsequent operations).
set_element and wait are the simple case where there is only one reference to the r/w object in the argument list. As we've discussed I have many functions that can possibly have multiple references to the same object. The core operation of the library is matrix multiplication, and all other complex functions follow a very similar pattern, so I'll just focus on that one here:
CREATE FUNCTION mxm(
a matrix,
b matrix,
op semiring default null,
inout c matrix default null,
mask matrix default null,
accum binaryop default null,
descr descriptor default null
)
RETURNS matrix
a matrix,
b matrix,
op semiring default null,
inout c matrix default null,
mask matrix default null,
accum binaryop default null,
descr descriptor default null
)
RETURNS matrix
The order of arguments mostly follows the order in the C API. a and b are the left and right matrix operands and the matrix product is the return value. If c is not null, then it is a pre-created return value which may contain partial results already from some previous operations, otherwise mxm creates a new matrix of the correct dimensions and returns that. I think the inout is meaningless as it doesn't seem to change anything, but I'm using it as a visual indication in code that c can be the return value if it's not null.
Here's an example of doing Triangle Counting using Burkhardt's method [2], where a, b, and the mask are all the same adjacency matrix (here the Newman/karate graph. The 'plus_pair' semiring is optimized for structural counting, and the descriptor 's' tells suitesparse to only use the structure of the mask and not to consider the values):
CREATE OR REPLACE FUNCTION public.tcount_burkhardt(graph matrix)
RETURNS bigint
LANGUAGE plpgsql
AS $$
begin
graph = wait(graph);
graph = mxm(graph, graph, 'plus_pair_int32', mask=>graph, descr=>'s');
return reduce_scalar(graph) / 6;
end;
$$;
RETURNS bigint
LANGUAGE plpgsql
AS $$
begin
graph = wait(graph);
graph = mxm(graph, graph, 'plus_pair_int32', mask=>graph, descr=>'s');
return reduce_scalar(graph) / 6;
end;
$$;
postgres=# select tcount_burkhardt(graph) from karateg;
DEBUG: matrix_wait
DEBUG: DatumGetMatrix
DEBUG: expand_matrix <- wait expands and returns r/w pointer with support function
DEBUG: new_matrix
DEBUG: matrix_mxm <- mxm starts here
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- expanding left operand again
DEBUG: new_matrix
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- expanding right operand again
DEBUG: new_matrix
DEBUG: DatumGetSemiring
DEBUG: expand_semiring
DEBUG: new_semiring
DEBUG: new_matrix
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- expanding mask argument again
DEBUG: new_matrix
DEBUG: DatumGetDescriptor
DEBUG: expand_descriptor
DEBUG: new_descriptor
DEBUG: context_callback_matrix_free
DEBUG: context_callback_descriptor_free
DEBUG: context_callback_matrix_free
DEBUG: context_callback_semiring_free
DEBUG: context_callback_matrix_free
DEBUG: context_callback_matrix_free
DEBUG: matrix_reduce_scalar
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- reduce also re-expands matrix
DEBUG: new_matrix
DEBUG: new_scalar
DEBUG: scalar_div_int32
DEBUG: DatumGetScalar
DEBUG: new_scalar
DEBUG: cast_scalar_int64
DEBUG: DatumGetScalar
DEBUG: context_callback_scalar_free
DEBUG: context_callback_scalar_free
DEBUG: context_callback_matrix_free
DEBUG: context_callback_matrix_free
┌──────────────────┐
│ tcount_burkhardt │
├──────────────────┤
│ 45 │
└──────────────────┘
DEBUG: matrix_wait
DEBUG: DatumGetMatrix
DEBUG: expand_matrix <- wait expands and returns r/w pointer with support function
DEBUG: new_matrix
DEBUG: matrix_mxm <- mxm starts here
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- expanding left operand again
DEBUG: new_matrix
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- expanding right operand again
DEBUG: new_matrix
DEBUG: DatumGetSemiring
DEBUG: expand_semiring
DEBUG: new_semiring
DEBUG: new_matrix
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- expanding mask argument again
DEBUG: new_matrix
DEBUG: DatumGetDescriptor
DEBUG: expand_descriptor
DEBUG: new_descriptor
DEBUG: context_callback_matrix_free
DEBUG: context_callback_descriptor_free
DEBUG: context_callback_matrix_free
DEBUG: context_callback_semiring_free
DEBUG: context_callback_matrix_free
DEBUG: context_callback_matrix_free
DEBUG: matrix_reduce_scalar
DEBUG: DatumGetMatrix
DEBUG: matrix_get_flat_size
DEBUG: flatten_matrix
DEBUG: expand_matrix <- reduce also re-expands matrix
DEBUG: new_matrix
DEBUG: new_scalar
DEBUG: scalar_div_int32
DEBUG: DatumGetScalar
DEBUG: new_scalar
DEBUG: cast_scalar_int64
DEBUG: DatumGetScalar
DEBUG: context_callback_scalar_free
DEBUG: context_callback_scalar_free
DEBUG: context_callback_matrix_free
DEBUG: context_callback_matrix_free
┌──────────────────┐
│ tcount_burkhardt │
├──────────────────┤
│ 45 │
└──────────────────┘
mxm calls expand_matrix three times for each of the three arguments. Ideally I'd like the already expanded rw pointer from wait() to be honored by mxm so that it doesn't re-expand the object three times but, like set_element, not at all.
Hope that question makes sense, still going on the main theory that I'm not understanding the support function, and maybe the c argument thing being optional throws a wrench in the plan, and I'm happy to try and find a workaround for that. Maybe always requiring the result to be pre-constructed and then making c required and reassigning back to the same input argument is the right approach?
Some good news I always like seeing is that 45 is the right answer. Very close to having optimal sparse linear algebra in Postgres!
Thanks for your help! I'll move onto your comments on the test module next.
-Michel
On Fri, Dec 6, 2024 at 4:51 PM Michel Pelletier <pelletier.michel@gmail.com> wrote:
My question is about nvals = nvals(graph) in that function above, the object is flattened and then rexpanded, even after the object was expanded and returned by wait() [1] into a r/w pointer.
...
mxm calls expand_matrix three times for each of the three arguments. Ideally I'd like the already expanded rw pointer from wait() to be honored by mxm so that it doesn't re-expand the object three times but, like set_element, not at all.
My bad, sorry for the long confusing email, I figured out that I was calling the wrong macro when getting my matrix datum and inadvertently expanding RO pointers as well, I've fixed that issue, and everything is working great! No extra expansions and my support functions are working well, I need to go through a few more places in the API to add more support but otherwise the fixes Tom has put into plpgsql have worked perfectly and the library now appears to be behaving optimally! I can get down to doing some benchmarks and head-to-head with the C and Python bindings to compare against.
Thanks for your help Tom, I'm looking forward to the changes being released soon! In the meanwhile I'll keep a locally patched version for ongoing testing purposes.
-Michel
Michel Pelletier <pelletier.michel@gmail.com> writes: > My bad, sorry for the long confusing email, I figured out that I was > calling the wrong macro when getting my matrix datum and inadvertently > expanding RO pointers as well, I've fixed that issue, and everything is > working great! No extra expansions and my support functions are working > well, I need to go through a few more places in the API to add more support > but otherwise the fixes Tom has put into plpgsql have worked perfectly and > the library now appears to be behaving optimally! I can get down to doing > some benchmarks and head-to-head with the C and Python bindings to compare > against. So, just to clarify where we're at: you are satisfied that the current patch-set does what you need? The other task we'd talked about was generalizing the existing heuristics in exec_assign_value() and plpgsql_exec_function() that say that array-type values should be forced into expanded R/W form when being assigned to an array-type PL/pgSQL variable. The argument for that is that the PL/pgSQL function might subsequently do a lot of subscripted accesses to the array (which'd benefit from working with an expanded array) while never doing another assignment and thus not having any opportunity to revisit the decision. The counter-argument is that it might *not* do such accesses, so that the expansion was just a waste of cycles. So this is squishy enough that I'd prefer to have some solid use-cases to look at before trying to generalize it. It's sounding to me like you're going to end up in a place where all your values are passed around in expanded form already and so you have little need for that optimization. If so, I'd prefer not to go any further than the present patch-set for now. Adding "type support" hooks as discussed would be a substantial amount of work, so I'd like to have a more compelling case for it before doing that. regards, tom lane
On Wed, Dec 18, 2024 at 12:22 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
> My bad, sorry for the long confusing email, I figured out that I was
> calling the wrong macro when getting my matrix datum and inadvertently
> expanding RO pointers as well, I've fixed that issue, and everything is
> working great! No extra expansions and my support functions are working
> well, I need to go through a few more places in the API to add more support
> but otherwise the fixes Tom has put into plpgsql have worked perfectly and
> the library now appears to be behaving optimally! I can get down to doing
> some benchmarks and head-to-head with the C and Python bindings to compare
> against.
So, just to clarify where we're at: you are satisfied that the current
patch-set does what you need?
I have some updates on this thread based on some graph algorithms I've ported from the Python/C graphblas libraries.
All of the plpgsql expanded object optimizations so far are working well, I can minimize object expansion in most cases, there are a couple I haven't been able to work around but I'm still getting excellent benchmarking numbers on some large test graphs:
LiveJournal Orkut
Nodes 3,997,962 3,072,441
Edges 34,681,185 117,185,037
Triangles 177,820,130 627,583,972
Seconds Edges/S Seconds Edges/S
Tri Count LL 2.80s 12,386,138 32.03s 3,658,602
Tri Count LU 1.91s 18,157,688 16.38s 7,156,338
Tri Centrality 1.55s 22,374,958 12.22s 9,589,610
Page Rank 8.10s 4,281,628 23.14s 5,064,176
Nodes 3,997,962 3,072,441
Edges 34,681,185 117,185,037
Triangles 177,820,130 627,583,972
Seconds Edges/S Seconds Edges/S
Tri Count LL 2.80s 12,386,138 32.03s 3,658,602
Tri Count LU 1.91s 18,157,688 16.38s 7,156,338
Tri Centrality 1.55s 22,374,958 12.22s 9,589,610
Page Rank 8.10s 4,281,628 23.14s 5,064,176
That's on a 2020 era 4 core economy laptop and is in line with what the C/Python/Julia bindings get on similar hardware.
There are a few cases where I have to force an expansion, I work around this by calling a `wait()` function, which expands the datum, calls GrB_wait() on it (a nop in this case) and returns a r/w pointer. You can see this in the following Triangle Counting function which is a matrix multiplication of a graph to itself, using itself as a mask. This matrix reduces to the triangle count (times six):
create or replace function tcount_b(graph matrix) returns bigint language plpgsql as
$$
begin
graph = wait(graph);
graph = mxm(graph, graph, 'plus_pair_int32', mask=>graph, descr=>'s');
return reduce_scalar(graph) / 6;
end;
$$;
$$
begin
graph = wait(graph);
graph = mxm(graph, graph, 'plus_pair_int32', mask=>graph, descr=>'s');
return reduce_scalar(graph) / 6;
end;
$$;
DEBUG: new_matrix
DEBUG: flatten_matrix
DEBUG: matrix_wait
DEBUG: expand_matrix -- expansion happens here in wait()
DEBUG: new_matrix
DEBUG: matrix_mxm -- mxm does not re-expand the object, good!
DEBUG: expand_semiring
DEBUG: new_semiring
DEBUG: new_matrix
DEBUG: expand_descriptor
DEBUG: new_descriptor
DEBUG: matrix_reduce_scalar -- neither does reduce, good!
DEBUG: new_scalar
DEBUG: scalar_div_int32
DEBUG: new_scalar
DEBUG: cast_scalar_int64
DEBUG: flatten_matrix
DEBUG: matrix_wait
DEBUG: expand_matrix -- expansion happens here in wait()
DEBUG: new_matrix
DEBUG: matrix_mxm -- mxm does not re-expand the object, good!
DEBUG: expand_semiring
DEBUG: new_semiring
DEBUG: new_matrix
DEBUG: expand_descriptor
DEBUG: new_descriptor
DEBUG: matrix_reduce_scalar -- neither does reduce, good!
DEBUG: new_scalar
DEBUG: scalar_div_int32
DEBUG: new_scalar
DEBUG: cast_scalar_int64
If I take out the call to wait(), then mxm calls expand_matrix 3 times as it did before your optimizations.
The other task we'd talked about was generalizing the existing
heuristics in exec_assign_value() and plpgsql_exec_function() that
say that array-type values should be forced into expanded R/W form
when being assigned to an array-type PL/pgSQL variable. The argument
for that is that the PL/pgSQL function might subsequently do a lot of
subscripted accesses to the array (which'd benefit from working with
an expanded array) while never doing another assignment and thus not
having any opportunity to revisit the decision. The counter-argument
is that it might *not* do such accesses, so that the expansion was
just a waste of cycles. So this is squishy enough that I'd prefer to
have some solid use-cases to look at before trying to generalize it.
It's sounding to me like you're going to end up in a place where all
your values are passed around in expanded form already and so you have
little need for that optimization.
If so, I'd prefer not to go any
further than the present patch-set for now. Adding "type support"
hooks as discussed would be a substantial amount of work, so I'd
like to have a more compelling case for it before doing that.
I agree it makes sense to have more use cases before making deeper changes. I only work with expanded forms, but need to call wait() to pre-expand the object to avoid multiple expansions in functions that can take the same object in multiple parameters. This is a pretty common pattern in GraphBLAS (and linear algebra in general) where (many) matrices are commutable to themselves in several ways like multiplication, element-wise operations, and element masking.
I'm not sure if eliminating wait() is a good enough use case, it would definitely be nice to get rid of but I can document it pretty thoroughly and it's relatively easy to catch.
-Michel
Michel Pelletier <pelletier.michel@gmail.com> writes: > On Wed, Dec 18, 2024 at 12:22 PM Tom Lane <tgl@sss.pgh.pa.us> wrote: >> So, just to clarify where we're at: you are satisfied that the current >> patch-set does what you need? > There are a few cases where I have to force an expansion, I work around > this by calling a `wait()` function, which expands the datum, calls > GrB_wait() on it (a nop in this case) and returns a r/w pointer. You can > see this in the following Triangle Counting function which is a matrix > multiplication of a graph to itself, using itself as a mask. This matrix > reduces to the triangle count (times six): > create or replace function tcount_b(graph matrix) returns bigint language > plpgsql as > $$ > begin > graph = wait(graph); > graph = mxm(graph, graph, 'plus_pair_int32', mask=>graph, > descr=>'s'); > return reduce_scalar(graph) / 6; > end; > $$; > ... > I agree it makes sense to have more use cases before making deeper > changes. I only work with expanded forms, but need to call wait() to > pre-expand the object to avoid multiple expansions in functions that can > take the same object in multiple parameters. Hmm. I agree that the wait() call is a bit ugly, but there are at least two things that seem worth looking into before we go so far as inventing type-support infrastructure: 1. Why isn't the incoming "graph" object already expanded? It often would be read-only, but that seems like it might be enough given your description of GraphBLAS' behavior. 2. If the problem is primarily with passing the same object to multiple parameters of a function, couldn't you detect and optimize that within the function? It would be messier than just blindly applying DatumGetWhatever() to each parameter position; but with a bit of thought I bet you could create some support logic that would hide most of the mess. regards, tom lane
On Mon, Dec 23, 2024 at 8:26 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Michel Pelletier <pelletier.michel@gmail.com> writes:
> ...
> I agree it makes sense to have more use cases before making deeper
> changes. I only work with expanded forms, but need to call wait() to
> pre-expand the object to avoid multiple expansions in functions that can
> take the same object in multiple parameters.
Hmm. I agree that the wait() call is a bit ugly, but there are at
least two things that seem worth looking into before we go so far
as inventing type-support infrastructure:
2. If the problem is primarily with passing the same object to
multiple parameters of a function, couldn't you detect and optimize
that within the function? It would be messier than just blindly
applying DatumGetWhatever() to each parameter position; but with a
bit of thought I bet you could create some support logic that would
hide most of the mess.
Ah that's a great idea, and it works beautifully! Now I can do an efficient triangle count without even needing a function, see below expand_matrix is only called once:
postgres=# select reduce_scalar(mxm(graph, graph, mask=>graph, c=>graph)) / 6 as tcount from vlivejournals ;
DEBUG: matrix_mxm
DEBUG: DatumGetMatrix
DEBUG: expand_matrix -- only called once!
DEBUG: new_matrix
DEBUG: DatumGetMatrixMaybeA
DEBUG: DatumGetMatrixMaybeAB
DEBUG: DatumGetMatrixMaybeABC
DEBUG: matrix_reduce_scalar
DEBUG: DatumGetMatrix
DEBUG: new_scalar
DEBUG: scalar_div_int32
DEBUG: new_scalar
DEBUG: scalar_out
┌─────────────────┐
│ tcount │
├─────────────────┤
│ int32:177820130 │
└─────────────────┘
DEBUG: matrix_mxm
DEBUG: DatumGetMatrix
DEBUG: expand_matrix -- only called once!
DEBUG: new_matrix
DEBUG: DatumGetMatrixMaybeA
DEBUG: DatumGetMatrixMaybeAB
DEBUG: DatumGetMatrixMaybeABC
DEBUG: matrix_reduce_scalar
DEBUG: DatumGetMatrix
DEBUG: new_scalar
DEBUG: scalar_div_int32
DEBUG: new_scalar
DEBUG: scalar_out
┌─────────────────┐
│ tcount │
├─────────────────┤
│ int32:177820130 │
└─────────────────┘
What a wonderful Christmas present to me, thank you Tom!
That pretty much resolves my main issues. I'm still in an exploratory phase but I think this gets me pretty far. Is this something that has to wait for 18 to be released? Also do you need any further testing or code reviewing from me?
-Michel