Thread: making EXPLAIN extensible

making EXPLAIN extensible

From
Robert Haas
Date:
Prior to PostgreSQL 10, EXPLAIN had just 2 options: VACUUM and
ANALYZE. Now, we're up to 12 options, which is already quite a lot,
and there's plenty more things that somebody might like to do.
However, not all of those things necessarily need to be part of the
core code. My original reason for wanting to extend EXPLAIN was that I
was thinking about an extension that would want to do a bunch of
things and one of those things would be to add some information to the
EXPLAIN output. It wouldn't make sense for core to have an EXPLAIN
option whose whole purpose is to cater to the needs of some extension,
so that made me think of providing some extensibility infrastructure.

However, there are other use cases, too, basically any of the normal
reasons why extensibility is useful and desirable. You might need to
get some information out a query plan that 99% of people don't care
about. You could come up with your own way of formatting a query plan,
but that's a big pain. It's a lot nicer if you can just add the detail
that you care about to the EXPLAIN output without needing to modify
PostgreSQL itself. Even if you think of something that really ought to
be included in the EXPLAIN output by PostgreSQL, you can roll an
extension out much quicker than you can get a change upstreamed and
released. So I think EXPLAIN extensibility is, as a general concept,
useful.

So here are some patches.

0001 allows a loadable module to register new EXPLAIN options.
Currently, EXPLAIN (FUNGUS) will error out, but if you want to make it
work, this patch is for you. This patch also allows you to stash some
state related to your new option, or options, in the ExplainState.
Core options have hard-coded structure members; e.g. EXPLAIN (BUFFERS)
sets es->buffers. If you add EXPLAIN (FUNGUS), there won't be an
es->fungus, but you can get about the same effect using the new
facilities provided here.

0002 provides hooks that you can use to make your new EXPLAIN options
actually do something. In particular, this adds a new hook that is
called once per PlanState node, and a new nook that is called once per
PlannedStmt. Each is called at an appropriate point for you to tack on
more output after what EXPLAIN would already produce.

0003 adds a new contrib module called pg_overexplain, which adds
EXPLAIN (DEBUG) and EXPLAIN (RANGE_TABLE). I actually think this is
quite useful for planner hacking, and maybe a few more options would
be, too. Right now, if you want to see stuff that EXPLAIN doesn't
clearly show, you have to use SET debug_print_plan = true, and that
output is so verbose that finding the parts you actually want to see
is quite difficult. Assuming it gives you the details you need,
EXPLAIN (RANGE_TABLE) looks way, way better to me, and if we end up
committing these patches I anticipate using this semi-regularly.

There are plenty of debatable things in this patch set, and I mention
some of them in the commit messages. The hook design in 0002 is a bit
simplistic and could be made more complex; there's lots of stuff that
could be added to or removed from 0003, much of which comes down to
what somebody hacking on the planner would actually want to see. I'm
happy to bikeshed all of that stuff; this is all quite preliminary and
I'm not committed to the details. The only thing that would disappoint
me is if somebody said "this whole idea of making EXPLAIN extensible
is stupid and pointless and we shouldn't ever do it." I will argue
against that vociferously. I think even what I have here is enough to
disprove that hypothesis, but I have a bunch of ideas about how to do
more. Some of those require additional infrastructure and are best
proposed with that other infrastructure; some can be done with just
this, but I ran out of time to code up examples so here is what I have
got so far.

Hope you like it, sorry if you don't.

-- 
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Thom Brown
Date:
On Fri, 28 Feb 2025 at 19:26, Robert Haas <robertmhaas@gmail.com> wrote:
>
> Prior to PostgreSQL 10, EXPLAIN had just 2 options: VACUUM and
> ANALYZE. Now, we're up to 12 options, which is already quite a lot,
> and there's plenty more things that somebody might like to do.
> However, not all of those things necessarily need to be part of the
> core code. My original reason for wanting to extend EXPLAIN was that I
> was thinking about an extension that would want to do a bunch of
> things and one of those things would be to add some information to the
> EXPLAIN output. It wouldn't make sense for core to have an EXPLAIN
> option whose whole purpose is to cater to the needs of some extension,
> so that made me think of providing some extensibility infrastructure.
>
> However, there are other use cases, too, basically any of the normal
> reasons why extensibility is useful and desirable. You might need to
> get some information out a query plan that 99% of people don't care
> about. You could come up with your own way of formatting a query plan,
> but that's a big pain. It's a lot nicer if you can just add the detail
> that you care about to the EXPLAIN output without needing to modify
> PostgreSQL itself. Even if you think of something that really ought to
> be included in the EXPLAIN output by PostgreSQL, you can roll an
> extension out much quicker than you can get a change upstreamed and
> released. So I think EXPLAIN extensibility is, as a general concept,
> useful.
>
> So here are some patches.
>
> 0001 allows a loadable module to register new EXPLAIN options.
> Currently, EXPLAIN (FUNGUS) will error out, but if you want to make it
> work, this patch is for you. This patch also allows you to stash some
> state related to your new option, or options, in the ExplainState.
> Core options have hard-coded structure members; e.g. EXPLAIN (BUFFERS)
> sets es->buffers. If you add EXPLAIN (FUNGUS), there won't be an
> es->fungus, but you can get about the same effect using the new
> facilities provided here.
>
> 0002 provides hooks that you can use to make your new EXPLAIN options
> actually do something. In particular, this adds a new hook that is
> called once per PlanState node, and a new nook that is called once per
> PlannedStmt. Each is called at an appropriate point for you to tack on
> more output after what EXPLAIN would already produce.
>
> 0003 adds a new contrib module called pg_overexplain, which adds
> EXPLAIN (DEBUG) and EXPLAIN (RANGE_TABLE). I actually think this is
> quite useful for planner hacking, and maybe a few more options would
> be, too. Right now, if you want to see stuff that EXPLAIN doesn't
> clearly show, you have to use SET debug_print_plan = true, and that
> output is so verbose that finding the parts you actually want to see
> is quite difficult. Assuming it gives you the details you need,
> EXPLAIN (RANGE_TABLE) looks way, way better to me, and if we end up
> committing these patches I anticipate using this semi-regularly.
>
> There are plenty of debatable things in this patch set, and I mention
> some of them in the commit messages. The hook design in 0002 is a bit
> simplistic and could be made more complex; there's lots of stuff that
> could be added to or removed from 0003, much of which comes down to
> what somebody hacking on the planner would actually want to see. I'm
> happy to bikeshed all of that stuff; this is all quite preliminary and
> I'm not committed to the details. The only thing that would disappoint
> me is if somebody said "this whole idea of making EXPLAIN extensible
> is stupid and pointless and we shouldn't ever do it." I will argue
> against that vociferously. I think even what I have here is enough to
> disprove that hypothesis, but I have a bunch of ideas about how to do
> more. Some of those require additional infrastructure and are best
> proposed with that other infrastructure; some can be done with just
> this, but I ran out of time to code up examples so here is what I have
> got so far.
>
> Hope you like it, sorry if you don't.

"pg_overexplain"? I love this name! And the idea sounds like a natural
evolution, so +1.

Some questions:

One thing I am wondering is whether extensions should be required to
prefix their EXPLAIN option with the extension name to avoid
collisions.

If two extensions happen to choose the same name, it won't be possible
to use both simultaneously.

In what order would the options be applied? Would it be deterministic,
or weighted within the extension's configuration, or based on the
order of the options in the list?

Would explain extensions be capable of modifying pre-existing core
option output, or just append to output?

Should there be a way of determining which lines are output by which
option? An extension may output similar output to core output, making
it difficult or impossible to discern which is which.

Does there need to be any security considerations so that things like
RLS don't inadvertently become leaky?

Thom



Re: making EXPLAIN extensible

From
Isaac Morland
Date:
On Fri, 28 Feb 2025 at 15:09, Thom Brown <thom@linux.com> wrote:
 
One thing I am wondering is whether extensions should be required to
prefix their EXPLAIN option with the extension name to avoid
collisions.

If two extensions happen to choose the same name, it won't be possible
to use both simultaneously.

Could the call that processes the registration automatically prepend the extension name to the supplied explain option name? So if extension X registers option O it would be registered as X_O rather than returning an error if O doesn't follow the proper pattern.

Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Fri, Feb 28, 2025 at 3:09 PM Thom Brown <thom@linux.com> wrote:
> "pg_overexplain"? I love this name! And the idea sounds like a natural
> evolution, so +1.

Thanks. I thought about things like pg_hyperexplain or
pg_explain_debug, but in the end I didn't like any of them better than
overexplain. :-)

> One thing I am wondering is whether extensions should be required to
> prefix their EXPLAIN option with the extension name to avoid
> collisions.

I considered that. One advantage of doing that is that you could
support autoloading. Right now, you have to LOAD 'pg_overexplain' or
put it in session_preload_libraries or shared_preload_libraries in
order to use it. If you required people to type EXPLAIN
(pg_overexplain.range_table) instead of just EXPLAIN (range_table),
then you could react to not finding any such option by trying to
autoload a .so with the part of the name before the dot.

But you can probably see that this idea has a couple of pretty serious
weaknesses:

1. It is much more verbose. I theorize that people will be unhappy
about having to type EXPLAIN (pg_overexplain.range_table) rather than
just EXPLAIN (range_table). One could try to address this by renaming
the extension to something shorter, like just 'oe'. Having to type
EXPLAIN (oe.range_table) wouldn't be nearly as annoying. However, this
seems like a pretty clear case of letting the tail wag the dog.

2. autoloading could have security concerns. This is probably fixable,
but we'd need to be sure that providing a new way to trigger loading a
module didn't open up any security holes.

> If two extensions happen to choose the same name, it won't be possible
> to use both simultaneously.

That's true. Of course, a lot depends on whether we end up with 3 or 5
or 8 EXPLAIN extensions or more like 30 or 50 or 80. In the former
case, the people writing those extensions will probably mostly know
about each other and can just use different names. In the latter case
it's a problem. My guess is it's the former case.

> In what order would the options be applied? Would it be deterministic,
> or weighted within the extension's configuration, or based on the
> order of the options in the list?

I'm not entirely sure I know which question you're asking here. If
you're asking what happens if two modules try to register the same
EXPLAIN option name and then a user uses it, one of the registrations
will win and the other will lose. I think the second one wins. As I
say above, I assume we'll find a way to not try to do that. However, I
think more likely you're asking: if you load pg_fingernailexplain and
pg_toenailexplain and then do EXPLAIN (toenail, fingernail) SELECT
..., in what order will the options take effect? For the answer to
that question, see the commit message for 0002.

> Would explain extensions be capable of modifying pre-existing core
> option output, or just append to output?

The interfaces we have are really only going to work for appending.
Modifying would be cool, but I think it's mostly impractical. We have
a framework for emitting stuff into EXPLAIN output in a way that takes
into account whether you're in text mode or json or yaml or whatever,
and this patch just builds on that existing framework to allow you to
make extra calls to those emit-some-output functions at useful places.
As a result, the patch is small and simple. If we had an existing
framework for modifying stuff, then we could perhaps provide suitable
places to call those functions, too. But they don't exist, and it's
not easy to see how they could be created. I think you would need some
kind of major redesign of explain.c, and I don't know how to do that
without making it bloated, slow, and unmaintainable.

If somebody comes up with a way of allowing certain limited types of
modifications to EXPLAIN output with small, elegant-looking code
changes, and if those changes seem like useful things for an extension
to want to do, I'm totally on board. But I currently don't have an
idea like that.

> Should there be a way of determining which lines are output by which
> option? An extension may output similar output to core output, making
> it difficult or impossible to discern which is which.

I don't think this is really going to be a problem.

> Does there need to be any security considerations so that things like
> RLS don't inadvertently become leaky?

It's possible that there may be some security considerations, and
that's worth thinking about. However, RLS disclaims support for
side-channel attacks, so it's already understood to be (very) leaky.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> EXPLAIN output. It wouldn't make sense for core to have an EXPLAIN
> option whose whole purpose is to cater to the needs of some extension,
> so that made me think of providing some extensibility infrastructure.

Making EXPLAIN extensible sounds like a good idea.. FWIW, There is a
discussion [0]
for showing FDW remote plans ( postgres_fdw specifically), and I think
we  will need to
add some new options to EXPLAIN to make that possible.

Have not looked at your patches, but I will do so now.


Regards,

Sami Imseih
Amazon Web Services (AWS)


[0] https://www.postgresql.org/message-id/CAA5RZ0vXiOiodrNQ-Va4FCAkXMpGA%3DGZDeKjFBRgRvHGuW7s7Q%40mail.gmail.com



Re: making EXPLAIN extensible

From
Srinath Reddy
Date:
Hi Robert,
thanks for working on this and +1 for the idea.
i have reviewed 1,2 patches using 3rd patch(pg_overexplain module) and they LGTM,will review more the 3rd patch.

Regards,
Srinath Reddy Sadipiralla
EDB:http://www.enterprisedb.com

Re: making EXPLAIN extensible

From
Alena Rybakina
Date:
Hi! I agree with your idea to expand the explain.

On 28.02.2025 22:26, Robert Haas wrote:
0002 provides hooks that you can use to make your new EXPLAIN options
actually do something. In particular, this adds a new hook that is
called once per PlanState node, and a new nook that is called once per
PlannedStmt. Each is called at an appropriate point for you to tack on
more output after what EXPLAIN would already produce.

0003 adds a new contrib module called pg_overexplain, which adds
EXPLAIN (DEBUG) and EXPLAIN (RANGE_TABLE). I actually think this is
quite useful for planner hacking, and maybe a few more options would
be, too. Right now, if you want to see stuff that EXPLAIN doesn't
clearly show, you have to use SET debug_print_plan = true, and that
output is so verbose that finding the parts you actually want to see
is quite difficult. Assuming it gives you the details you need,
EXPLAIN (RANGE_TABLE) looks way, way better to me, and if we end up
committing these patches I anticipate using this semi-regularly.


There are plenty of debatable things in this patch set, and I mention
some of them in the commit messages. The hook design in 0002 is a bit
simplistic and could be made more complex; there's lots of stuff that
could be added to or removed from 0003, much of which comes down to
what somebody hacking on the planner would actually want to see. I'm
happy to bikeshed all of that stuff; this is all quite preliminary and
I'm not committed to the details. The only thing that would disappoint
me is if somebody said "this whole idea of making EXPLAIN extensible
is stupid and pointless and we shouldn't ever do it." I will argue
against that vociferously. I think even what I have here is enough to
disprove that hypothesis, but I have a bunch of ideas about how to do
more. Some of those require additional infrastructure and are best
proposed with that other infrastructure; some can be done with just
this, but I ran out of time to code up examples so here is what I have
got so far.

Hope you like it, sorry if you don't.

while writing the AQO extension [0] we were just adding a hook (we called it ExplainOnePlan_hook) similar to this one to add a description for the node used in the plan, in particular its identifier and the cardinality that is used during query planning. We also added a guc that allows user to disable this debug information, since it can only be useful when analyzing a problematic query, but not all the time. Therefore, I think this work is necessary and needed to provide the necessary output of additional information about the plan, which may be necessary for extensions like this.

[0] https://github.com/postgrespro/aqo

-- 
Regards,
Alena Rybakina
Postgres Professional

Re: making EXPLAIN extensible

From
Matheus Alcantara
Date:
Hi, +1 for the idea. I Haven't reviewed the patches yet, but I would like to
share some thoughts.

On Fri, Feb 28, 2025 at 5:32 PM Robert Haas <robertmhaas@gmail.com> wrote:
>
> > One thing I am wondering is whether extensions should be required to
> > prefix their EXPLAIN option with the extension name to avoid
> > collisions.
>
> I considered that. One advantage of doing that is that you could
> support autoloading. Right now, you have to LOAD 'pg_overexplain' or
> put it in session_preload_libraries or shared_preload_libraries in
> order to use it. If you required people to type EXPLAIN
> (pg_overexplain.range_table) instead of just EXPLAIN (range_table),
> then you could react to not finding any such option by trying to
> autoload a .so with the part of the name before the dot.
>
> But you can probably see that this idea has a couple of pretty serious
> weaknesses:
>
> 1. It is much more verbose. I theorize that people will be unhappy
> about having to type EXPLAIN (pg_overexplain.range_table) rather than
> just EXPLAIN (range_table). One could try to address this by renaming
> the extension to something shorter, like just 'oe'. Having to type
> EXPLAIN (oe.range_table) wouldn't be nearly as annoying. However, this
> seems like a pretty clear case of letting the tail wag the dog.
>
> 2. autoloading could have security concerns. This is probably fixable,
> but we'd need to be sure that providing a new way to trigger loading a
> module didn't open up any security holes.
>
> > If two extensions happen to choose the same name, it won't be possible
> > to use both simultaneously.
>
> That's true. Of course, a lot depends on whether we end up with 3 or 5
> or 8 EXPLAIN extensions or more like 30 or 50 or 80. In the former
> case, the people writing those extensions will probably mostly know
> about each other and can just use different names. In the latter case
> it's a problem. My guess is it's the former case.
>
It would make sense (or possible) to have some kind of validation that returns
an error when using an option that is ambiguous? If a option is unique for all
options registered via extensions than the extension name prefix is not needed,
otherwise an error is returned forcing the user to specify the extension name.

This is a similar behaviour when, e.g we have a where clause that is
referencing a column that is present in multiple tables used in the query:
ERROR:  42702: column reference "b" is ambiguous LINE 1: select * from t inner
join t2 on t.a = t2.a where b = 10;

--
Matheus Alcantara



Re: making EXPLAIN extensible

From
Matthias van de Meent
Date:
On Fri, 28 Feb 2025 at 20:26, Robert Haas <robertmhaas@gmail.com> wrote:
>
> Prior to PostgreSQL 10, EXPLAIN had just 2 options: VACUUM and
> ANALYZE.

I think you meant "some time prior to PostgreSQL 10".
PostgreSQL 9.0 had 5 options, of which COSTS, BUFFERS, and FORMAT were
newly added, so only before 9.0 we had 2 options.
PostgreSQL 9.2 then added TIMING on top of that, for a total of 6
options prior to PostgreSQL 10.

> It wouldn't make sense for core to have an EXPLAIN
> option whose whole purpose is to cater to the needs of some extension,
> so that made me think of providing some extensibility infrastructure.

+1, Neon would greatly appreciate infrastructure to allow extending EXPLAIN.

> 0001 allows a loadable module to register new EXPLAIN options.
> Currently, EXPLAIN (FUNGUS) will error out, but if you want to make it
> work, this patch is for you. This patch also allows you to stash some
> state related to your new option, or options, in the ExplainState.
> Core options have hard-coded structure members; e.g. EXPLAIN (BUFFERS)
> sets es->buffers. If you add EXPLAIN (FUNGUS), there won't be an
> es->fungus, but you can get about the same effect using the new
> facilities provided here.

Does this work with parallel workers' stats?
I can't seem to figure out whether or where parallel workers would
pass through their extended explain statistics, mostly because of the
per-backend nature of ID generation making the pointers of
ExplainState->extension_state unshareable.

Kind regards,

Matthias van de Meent



Re: making EXPLAIN extensible

From
Robert Haas
Date:
First, thanks to all who have said they like this idea. I wasn't
expecting this much enthusiasm, to be honest. Woohoo!

On Mon, Mar 3, 2025 at 8:27 AM Matheus Alcantara
<matheusssilv97@gmail.com> wrote:
> It would make sense (or possible) to have some kind of validation that returns
> an error when using an option that is ambiguous? If a option is unique for all
> options registered via extensions than the extension name prefix is not needed,
> otherwise an error is returned forcing the user to specify the extension name.

I'm not saying that this couldn't be done -- it definitely could --
but to me it seems somewhat pointless. I mean, you'd just get the
error when you try to load the second of the two extensions, and then
what are you supposed to do about it at that point? Really, it's
incumbent on EXPLAIN-extension developers to avoid picking names that
conflict with other EXPLAIN extensions that the same users might want
to use. If they don't do that, everything sucks. By jiggering things
around you can cause the result to be either (a) some options are
ignored (current behavior) or (b) errors occur during extension
loading (your proposal) or (c) errors occur at runtime or even (d)
suddenly the conflicting options need to be spelled in a very verbose
way with a module name prefix instead of using them normally. But I
think no matter which of those things happen, life is pretty bad for
the user. The only real solution, AFAICS, is for the extension
developers to be thoughtful about the option names that they pick.

I'm not saying that's definitely going to happen, and I can be
convinced to add something to the patch to cater to it. But only if we
all agree on exactly what to add and exactly why that's going to be
better than doing nothing. My current view - as explained above - is
that anything we add is just going to be increasing the amount of code
without really making the user experience any better. If that's
correct, it's better to just keep it simple, as I have done.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Mon, Mar 3, 2025 at 9:14 AM Matthias van de Meent
<boekewurm+postgres@gmail.com> wrote:
> I think you meant "some time prior to PostgreSQL 10".
> PostgreSQL 9.0 had 5 options, of which COSTS, BUFFERS, and FORMAT were
> newly added, so only before 9.0 we had 2 options.
> PostgreSQL 9.2 then added TIMING on top of that, for a total of 6
> options prior to PostgreSQL 10.

Probably I meant 9 rather than 10, then.

> +1, Neon would greatly appreciate infrastructure to allow extending EXPLAIN.

Cool.

> Does this work with parallel workers' stats?
> I can't seem to figure out whether or where parallel workers would
> pass through their extended explain statistics, mostly because of the
> per-backend nature of ID generation making the pointers of
> ExplainState->extension_state unshareable.

I don't fully understand what your question is. I think there are a
couple of separate points to consider here.

First, I don't think we ever store an ExplainState in DSM. If we do,
then the per-backend nature of ID generation is a fundamental design
issue and needs to be rethought. Otherwise, I don't see why it
matters.

Second, I did not add a hook to allow an extension to add data to a
"Worker N" section. I'm open to suggestions.

Third, regardless of parallel query, there is a general problem with
this infrastructure if what you want to do is print out some
instrumentation data. Sure, the hooks might allow you to get control
at a point where you can print some stuff, but how are you supposed to
get the stuff you want to print? planduration, bufusage, and
mem_counters are passed down to ExplainOnePlan(); and there's other
stuff in struct Instrumentation that is used in ExplainNode(), but
those approaches don't seem to scale nicely to arbitrary new things
that somebody might want to measure. While I welcome ideas about how
to fix that, my current view is that it's a job for a separate patch
set.

In general, it's expected that each parallel-aware node may register a
shm_toc entry using the plan_node_id as the key. So if you wanted
per-worker instrumentation of any sort of some particular node, you
could possibly add it to that chunk of memory. This would work well,
for example, for a custom scan, or any other case where the node is
under the control over the same code that is trying to instrument
stuff. A patch to core could extend both the node's DSM footprint and
the explain.c code that prints data from it. However, if you want to
do something like "for every executor node, count the number of flying
spaghetti monster tendrils that pass through the computer during the
execution of that node," there's not really any great way of doing
that today, with or without this patch, and with or without parallel
query. I mean, you can patch core, but that's it; there's no
extensibility here.

I'm not sure if any of this is responsive to your actual question; if
not, please help me get on the right track.

Thanks,

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Guillaume Lelarge
Date:
On 28/02/2025 20:26, Robert Haas wrote:
> Prior to PostgreSQL 10, EXPLAIN had just 2 options: VACUUM and
> ANALYZE. Now, we're up to 12 options, which is already quite a lot,
> and there's plenty more things that somebody might like to do.
> However, not all of those things necessarily need to be part of the
> core code. My original reason for wanting to extend EXPLAIN was that I
> was thinking about an extension that would want to do a bunch of
> things and one of those things would be to add some information to the
> EXPLAIN output. It wouldn't make sense for core to have an EXPLAIN
> option whose whole purpose is to cater to the needs of some extension,
> so that made me think of providing some extensibility infrastructure.
> 
> However, there are other use cases, too, basically any of the normal
> reasons why extensibility is useful and desirable. You might need to
> get some information out a query plan that 99% of people don't care
> about. You could come up with your own way of formatting a query plan,
> but that's a big pain. It's a lot nicer if you can just add the detail
> that you care about to the EXPLAIN output without needing to modify
> PostgreSQL itself. Even if you think of something that really ought to
> be included in the EXPLAIN output by PostgreSQL, you can roll an
> extension out much quicker than you can get a change upstreamed and
> released. So I think EXPLAIN extensibility is, as a general concept,
> useful.
> 
> So here are some patches.
> 
> 0001 allows a loadable module to register new EXPLAIN options.
> Currently, EXPLAIN (FUNGUS) will error out, but if you want to make it
> work, this patch is for you. This patch also allows you to stash some
> state related to your new option, or options, in the ExplainState.
> Core options have hard-coded structure members; e.g. EXPLAIN (BUFFERS)
> sets es->buffers. If you add EXPLAIN (FUNGUS), there won't be an
> es->fungus, but you can get about the same effect using the new
> facilities provided here.
> 
> 0002 provides hooks that you can use to make your new EXPLAIN options
> actually do something. In particular, this adds a new hook that is
> called once per PlanState node, and a new nook that is called once per
> PlannedStmt. Each is called at an appropriate point for you to tack on
> more output after what EXPLAIN would already produce.
> 
> 0003 adds a new contrib module called pg_overexplain, which adds
> EXPLAIN (DEBUG) and EXPLAIN (RANGE_TABLE). I actually think this is
> quite useful for planner hacking, and maybe a few more options would
> be, too. Right now, if you want to see stuff that EXPLAIN doesn't
> clearly show, you have to use SET debug_print_plan = true, and that
> output is so verbose that finding the parts you actually want to see
> is quite difficult. Assuming it gives you the details you need,
> EXPLAIN (RANGE_TABLE) looks way, way better to me, and if we end up
> committing these patches I anticipate using this semi-regularly.
> 
> There are plenty of debatable things in this patch set, and I mention
> some of them in the commit messages. The hook design in 0002 is a bit
> simplistic and could be made more complex; there's lots of stuff that
> could be added to or removed from 0003, much of which comes down to
> what somebody hacking on the planner would actually want to see. I'm
> happy to bikeshed all of that stuff; this is all quite preliminary and
> I'm not committed to the details. The only thing that would disappoint
> me is if somebody said "this whole idea of making EXPLAIN extensible
> is stupid and pointless and we shouldn't ever do it." I will argue
> against that vociferously. I think even what I have here is enough to
> disprove that hypothesis, but I have a bunch of ideas about how to do
> more. Some of those require additional infrastructure and are best
> proposed with that other infrastructure; some can be done with just
> this, but I ran out of time to code up examples so here is what I have
> got so far.
> 
> Hope you like it, sorry if you don't.
> 

I definitely LOVE it. I tried your patches and it works great. No real 
surprise here :) I tried to code my own library (entirely based on 
yours), and it's quite nice. Patch attached, not intended to be applied 
on the repo, but just a nice use case.

This library adds "Tip" line for each tip it can give on a specific 
node. Right now, it only handles "Rows Removed by Filter" on a 
sequential scan, but there's much more we could add to it.

Here is an example on how to use it:

postgres=# show shared_preload_libraries ;
  shared_preload_libraries
--------------------------
  pg_explaintips
(1 row)

postgres=# create table t1 (id integer); 
  

CREATE TABLE 
  

postgres=# insert into t1 select generate_series(1, 1000); 
  

INSERT 0 1000
postgres=# explain (analyze,costs off,tips) select * from t1 where id>2; 

                           QUERY PLAN
---------------------------------------------------------------
  Seq Scan on t1 (actual time=0.042..0.337 rows=998.00 loops=1)
    Filter: (id > 2)
    Rows Removed by Filter: 2
    Buffers: shared hit=5
  Planning:
    Buffers: shared hit=4
  Planning Time: 0.079 ms
  Execution Time: 0.479 ms
(8 rows)

postgres=# explain (analyze,costs off,tips) select * from t1 where id<2; 

                          QUERY PLAN
-------------------------------------------------------------
  Seq Scan on t1 (actual time=0.014..0.113 rows=1.00 loops=1)
    Filter: (id < 2)
    Rows Removed by Filter: 999
    Buffers: shared hit=5
    Tips: You should probably add an index!
  Planning Time: 0.035 ms
  Execution Time: 0.127 ms
(7 rows)

postgres=# explain (analyze,costs off,tips off) select * from t1 where 
id<2;
                          QUERY PLAN
-------------------------------------------------------------
  Seq Scan on t1 (actual time=0.009..0.067 rows=1.00 loops=1)
    Filter: (id < 2)
    Rows Removed by Filter: 999
    Buffers: shared hit=5
  Planning:
    Buffers: shared hit=5
  Planning Time: 0.070 ms
  Execution Time: 0.076 ms
(8 rows)

Just great. Hope your patchs will find their way in the 18 release. 
Thanks a lot.


-- 
Guillaume Lelarge
Consultant
https://dalibo.com
Attachment

Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 28/2/2025 20:26, Robert Haas wrote:
> So here are some patches.
Yes, this is a big pain for extension developers. As I remember, it was 
discussed multiple times in the hackers' mailing list.
Because there is no explain node hook, I use a patch in almost each of 
my extensions: I write optimisation helpers, and it is necessary to show 
which node was influenced and how. I guess pg_hint_plan will also profit 
from this extra extensibility.

Passing through the patches, I would say that changing the order of 0001 
and 0002 would make them more independent.
Also, I'm ok with the floating order of extension messages in the 
explain output. We get used to living with dependencies on extension 
load order (pg_stat_statements quite annoyingly impacts queryid, for 
example), and this issue should be solved generally, in my opinion.
I support the way where extensions are allowed to print info but not 
restructure or remove something.
Wait for the commit!

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Tue, Mar 4, 2025 at 8:56 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
> Passing through the patches, I would say that changing the order of 0001
> and 0002 would make them more independent.

Hmm, I thought this order made sense, but I could reorder them if
there's some compelling reason to do so. If there's no particularly
compelling reason, it would be less work to commit them in this order.

> Also, I'm ok with the floating order of extension messages in the
> explain output. We get used to living with dependencies on extension
> load order (pg_stat_statements quite annoyingly impacts queryid, for
> example), and this issue should be solved generally, in my opinion.

I've often thought that the solution to this class of problems could
be to have extensions not manipulate a hook variable directly, but
instead call a function to which they pass their callback function and
an integer priority. Then we could call hook functions in priority
order. One difficulty is that this requires extension authors to agree
on what the priority order should be. In some cases that might not be
too hard, but it isn't apparent how it would work here.

IMHO, it's reasonable for the author of an EXPLAIN extension to say
"well, I see Robert already created an extension with an option called
DEBUG, so I will name my option TROUBLESHOOT," or something of that
sort. But if Robert gave the DEBUG hook priority 50, should I also
give my hook priority 50, or should I make it 40 or 25 or 1 or 100 or
what? Even if I know about all of the other extensions it's not really
clear what I should do. Actually, in this case, it feels like it would
be better if the user could control the ordering somehow, but I feel
like that might be over-engineering.

> I support the way where extensions are allowed to print info but not
> restructure or remove something.
> Wait for the commit!

Cool, thanks.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 4/3/2025 15:23, Robert Haas wrote:
> On Tue, Mar 4, 2025 at 8:56 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
>> Passing through the patches, I would say that changing the order of 0001
>> and 0002 would make them more independent.
> 
> Hmm, I thought this order made sense, but I could reorder them if
> there's some compelling reason to do so. If there's no particularly
> compelling reason, it would be less work to commit them in this order.
I have no compelling reason except avoiding adding/removing explain.h 
into the head of auto_explain.c and file_fdw.c
> 
>> Also, I'm ok with the floating order of extension messages in the
>> explain output. We get used to living with dependencies on extension
>> load order (pg_stat_statements quite annoyingly impacts queryid, for
>> example), and this issue should be solved generally, in my opinion.
> 
> I've often thought that the solution to this class of problems could
> be to have extensions not manipulate a hook variable directly, but
> instead call a function to which they pass their callback function and
> an integer priority. Then we could call hook functions in priority
> order. One difficulty is that this requires extension authors to agree
> on what the priority order should be. In some cases that might not be
> too hard, but it isn't apparent how it would work here.
> 
> IMHO, it's reasonable for the author of an EXPLAIN extension to say
> "well, I see Robert already created an extension with an option called
> DEBUG, so I will name my option TROUBLESHOOT," or something of that
> sort. But if Robert gave the DEBUG hook priority 50, should I also
> give my hook priority 50, or should I make it 40 or 25 or 1 or 100 or
> what? Even if I know about all of the other extensions it's not really
> clear what I should do. Actually, in this case, it feels like it would
> be better if the user could control the ordering somehow, but I feel
> like that might be over-engineering.
I think the same way. It would be clearer for an observer to have a 
dependency on load order everywhere than different orders in different 
places with no strong guarantees.

Also, I think this feature is quite close to the discussion on the 
possibility of adding an extensible list field into Query, PlanState, 
Plan, etc. nodes to let extensions gather and transfer some additional 
data starting with the first 'analyze' hook up to the end of execution.
For example, in solving user issues, I frequently need to know 
predictions on the number of groups in Memoize, IncrementalSort and some 
other nodes. Such extensibility could allow an extension to gather such 
internal data during the planning stage and show it in the explain 
without any changes in the core!

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Tue, Mar 4, 2025 at 10:12 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
> Also, I think this feature is quite close to the discussion on the
> possibility of adding an extensible list field into Query, PlanState,
> Plan, etc. nodes to let extensions gather and transfer some additional
> data starting with the first 'analyze' hook up to the end of execution.
> For example, in solving user issues, I frequently need to know
> predictions on the number of groups in Memoize, IncrementalSort and some
> other nodes. Such extensibility could allow an extension to gather such
> internal data during the planning stage and show it in the explain
> without any changes in the core!

If you're saying there's a -hackers discussion on this, could you
provide a link? I haven't seen it.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 4/3/2025 16:14, Robert Haas wrote:
> On Tue, Mar 4, 2025 at 10:12 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
>> Also, I think this feature is quite close to the discussion on the
>> possibility of adding an extensible list field into Query, PlanState,
>> Plan, etc. nodes to let extensions gather and transfer some additional
>> data starting with the first 'analyze' hook up to the end of execution.
>> For example, in solving user issues, I frequently need to know
>> predictions on the number of groups in Memoize, IncrementalSort and some
>> other nodes. Such extensibility could allow an extension to gather such
>> internal data during the planning stage and show it in the explain
>> without any changes in the core!
> 
> If you're saying there's a -hackers discussion on this, could you
> provide a link? I haven't seen it.
> 
I wouldn't say there is a thread in the mailing list. I mentioned this 
direction of extensibility multiple times (for example, [1,2]) with no 
reaction. However, letting extensions show data in explan gives this 
idea additional impulse.

[1] 
https://www.postgresql.org/message-id/30113d59-8678-49ca-a8fb-bbceacf7efb0%40gmail.com
[2] 
https://www.postgresql.org/message-id/CA%2BTgmoYXgBVCnFhrW3X1NxpdjWtJCYRKP38PQ-AdR-RJziTBUQ%40mail.gmail.com

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Tue, Mar 4, 2025 at 10:26 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
> I wouldn't say there is a thread in the mailing list. I mentioned this
> direction of extensibility multiple times (for example, [1,2]) with no
> reaction. However, letting extensions show data in explan gives this
> idea additional impulse.

I agree that it's worth considering. I'd like to get this patch set
committed first, because I feel like it's already good enough to let
people do a bunch of cool stuff, and then we can build on top of it
later to let people do even more cool stuff. I do have some ideas that
involve piping plan-time data through to the final plan so that it can
be made visible via EXPLAIN. However, I think there are several
challenging design questions that need to be figured out, including:

(1) exactly how do we pipe that plan-time data through to the final plan?

(2) how should the plan-time hooks be designed to let people do as
many interesting things as possible with as few hooks as possible?

It sounds like you might already have some ideas about how those
questions should be answered, but I haven't thought about it enough
yet to feel confident and I don't want to make those decisions now.
Let's keep this thread focused on these patches, and we can look at
what else to do once that's done.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Jeff Davis
Date:
+1 to the general idea, I didn't look at the patches yet.

On Fri, 2025-02-28 at 15:32 -0500, Robert Haas wrote:
> 1. It is much more verbose. I theorize that people will be unhappy
> about having to type EXPLAIN (pg_overexplain.range_table) rather than
> just EXPLAIN (range_table).

That was my first reaction.

> That's true. Of course, a lot depends on whether we end up with 3 or
> 5
> or 8 EXPLAIN extensions or more like 30 or 50 or 80. In the former
> case, the people writing those extensions will probably mostly know
> about each other and can just use different names.

I don't expect there to be zillions of extensions that only add new and
exciting explain options. Instead, it seems more likely that all
TableAM and CustomScan extensions will have 1-3 new explain options,
and that some of those might collide. For example, several may have a
EXPLAIN(PUSHDOWN) option that explains what work is being pushed down
into the TableAM/CustomScan.

In that case it's not even clear to me that a collision is a problem.
Would you really only want pushdown information from extension A, and
be upset that it also emits pushdown information from extension B?
Maybe we should just allow multiple extensions to use the same option
name?

Regards,
    Jeff Davis






Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Tue, Mar 4, 2025 at 1:53 PM Jeff Davis <pgsql@j-davis.com> wrote:
> I don't expect there to be zillions of extensions that only add new and
> exciting explain options. Instead, it seems more likely that all
> TableAM and CustomScan extensions will have 1-3 new explain options,
> and that some of those might collide. For example, several may have a
> EXPLAIN(PUSHDOWN) option that explains what work is being pushed down
> into the TableAM/CustomScan.
>
> In that case it's not even clear to me that a collision is a problem.
> Would you really only want pushdown information from extension A, and
> be upset that it also emits pushdown information from extension B?
> Maybe we should just allow multiple extensions to use the same option
> name?

One fairly big problem with that idea is that options need not be
simple Booleans.  If extension A adds PUSHDOWN { 'summary' | 'detail'
} and extension B adds PUSHDOWN { 'scan' | 'join' | 'agg' }, it's
definitely not going to work, even if we arrange to call handlers for
both extensions. If we want to have EXPLAIN keywords that have
well-defined meanings that span across different
FDWs/tableAMs/whatever, I think we should add those to core and
document what we expect extensions to do. This mechanism is really for
cases where you need a completely new option that only you will care
about.

Actually, I don't think custom scans or table AMs are the design
center for this feature. Keep in mind that, for a custom scan, we
already have ExplainCustomScan as part of CustomScanState->methods. We
don't currently have anything similar for table AMs, and you could
work around that with these hooks, by checking every node to see
whether it's a scan node and if so whether it scans a relation that
matches your table AM, but it would probably be better to have a
method for it, similar to what we have for CustomScan, if this is
something people want to do. It would be more efficient and require
less code.

Granted, those interfaces don't let you add completely new options,
but I think actually for custom scans and table AMs you most likely
want to just display whatever details you have unconditionally, or
conditional on es->verbose. I'm not sure there's a real reason to add
a new option that gates, say, postgres_fdw's display of join pushdown
information. That seems awfully specific. You'd need a reason why you
would want to control that display separately from everything else,
and the use case for that seems thin.

Where I see this being more useful is for people who want to display
additional information for plan nodes that they did not implement. For
example, my EXPLAIN (RANGE_TABLE) option dumps every
range-table-related fact it can find in the Plan tree. That includes
both information about which plan nodes (as displayed by EXPLAIN) are
scanning which RTIs, and also a list of all the RTIs and a bunch of
properties of each one. When you're trying to show information by
topic, as in this case, hooks on individual node types like custom
scan or (hypothetically) table AM callbacks don't help you get the job
done -- and neither do the existing options.

That having been said, the vigorous response to this proposal so far
suggests to me that people probably will want to use this for things
that are a bit different than what I had in mind, and that's fine.
Maybe there's even some way to adjust this proposal so that it can
suit some of those other use cases better. But, I'm doubtful that
letting unrelated extensions try to share the same option name is that
thing.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 4/3/2025 22:23, Robert Haas wrote:
> On Tue, Mar 4, 2025 at 1:53 PM Jeff Davis <pgsql@j-davis.com> wrote:
>> I don't expect there to be zillions of extensions that only add new and
>> exciting explain options. Instead, it seems more likely that all
>> TableAM and CustomScan extensions will have 1-3 new explain options,
>> and that some of those might collide. For example, several may have a
>> EXPLAIN(PUSHDOWN) option that explains what work is being pushed down
>> into the TableAM/CustomScan.
>>
>> In that case it's not even clear to me that a collision is a problem.
>> Would you really only want pushdown information from extension A, and
>> be upset that it also emits pushdown information from extension B?
>> Maybe we should just allow multiple extensions to use the same option
>> name?
> ...
> Actually, I don't think custom scans or table AMs are the design
> center for this feature. Keep in mind that, for a custom scan, we
> already have ExplainCustomScan as part of CustomScanState->methods. We
> don't currently have anything similar for table AMs, and you could
> work around that with these hooks, by checking every node to see
> whether it's a scan node and if so whether it scans a relation that
> matches your table AM, but it would probably be better to have a
> method for it, similar to what we have for CustomScan, if this is
> something people want to do. It would be more efficient and require
> less code.
+1. In my experience, ExplainCustomScan has always been enough for the 
CustomScan node.
As for extensions collision - for now, it would be better to treat 
extensions as independent actors, suggesting that the developer, 
designing a software solution based on an extensions' constellation, 
will arrange their behaviour during development.
For instance,  if your library exports a function or variable, adding a 
prefix is essential to prevent overlapping functions when another 
library is loaded.
I recall that Yurii Rashkovskii is a proponent of using multiple 
extensions within a single installation. Perhaps he has insights on this 
topic?

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Jeff Davis
Date:
On Tue, 2025-03-04 at 16:23 -0500, Robert Haas wrote:

> But, I'm doubtful that
> letting unrelated extensions try to share the same option name is
> that
> thing.

This sub-discussion started because we were wondering whether to prefix
the options. I'm just pointing out that, even if there is a collision,
and it happened to work, it's as likely to be a feature as a bug.

I didn't look into the technical details to see what might be required
to allow that kind of collaboration, and I am not suggesting you
redesign the entire feature around that idea.

Regards,
    Jeff Davis




Re: making EXPLAIN extensible

From
Jeff Davis
Date:
It would be good to clarify whether this is for (a) experimenting with
explain options that might be useful in core some day; or (b) special
developer-only options that would never be useful in core; or (c)
production-grade explain options specific to an extension.

On Tue, 2025-03-04 at 16:23 -0500, Robert Haas wrote:
> Where I see this being more useful is for people who want to display
> additional information for plan nodes that they did not implement.
> For
> example, my EXPLAIN (RANGE_TABLE) option dumps every
> range-table-related fact it can find in the Plan tree.

That sounds like (b) or maybe (a).

But the first motivation you listed when introducing the patch was: "It
wouldn't make sense for core to have an EXPLAIN option whose whole
purpose is to cater to the needs of some extension, so that made me
think of providing some extensibility infrastructure."

which sounds like (c).

And if (c) is part of the intended use, but not CustomScan or TableAM,
then what kind of extensions would need extension-specific explain
options?

I am not trying to push the patch in any particular direction. On the
contrary, I'd just like to know the scope of the feature so that I can
stop accidentally pushing it in some direction by asking questions
about out-of-scope use cases.

Regards,
    Jeff Davis




Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 5, 2025 at 12:20 PM Jeff Davis <pgsql@j-davis.com> wrote:
> I didn't look into the technical details to see what might be required
> to allow that kind of collaboration, and I am not suggesting you
> redesign the entire feature around that idea.

OK. It sounds to me like there is a good amount of support for going
forward with something like what I have, even though some people might
also like other things. What I feel is currently lacking is some
review of the actual code. Would anyone like to do that?

Here is a v2 with some documentation and regression tests for
pg_overexplain added.

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 5, 2025 at 12:52 PM Jeff Davis <pgsql@j-davis.com> wrote:
> It would be good to clarify whether this is for (a) experimenting with
> explain options that might be useful in core some day; or (b) special
> developer-only options that would never be useful in core; or (c)
> production-grade explain options specific to an extension.

I think it could be any of these.

> And if (c) is part of the intended use, but not CustomScan or TableAM,
> then what kind of extensions would need extension-specific explain
> options?
>
> I am not trying to push the patch in any particular direction. On the
> contrary, I'd just like to know the scope of the feature so that I can
> stop accidentally pushing it in some direction by asking questions
> about out-of-scope use cases.

Heh, no problem. I see it probably being most useful for extensions
that touch the planner in some way that cuts across multiple node
types. We don't have a lot of those in contrib yet, but that's partly
because we don't have much infrastructure to support them.

Alena Rybakina gives an example upthread: "while writing the AQO
extension [0] we were just adding a hook (we called it
ExplainOnePlan_hook) similar to this one to add a description for the
node used in the plan, in particular its identifier and the
cardinality that is used during query planning." Andrei Lephikov's
comment upthread suggests similar things: "I write optimisation
helpers, and it is necessary to show which node was influenced and
how".

Another example - and the one that motivated this work - was my
proposal http://postgr.es/m/CA+TgmoZY+baV-T-5ifDn6P=L=aV-VkVBrPmi0TQkcEq-5Finww@mail.gmail.com
- this is an offshoot of that work, several steps removed.

I'm not sure if I'm being clear here, but the difference between these
examples and the ones you proposed is whether what you're adding to
the planner in your extension looks more like a new type of node or
more like a new overall feature. A Custom Scan node or a new table AM
is a new kind of thing that you can scan; those sorts of examples
probably want to solve their problems in some other way, though they
could use this infrastructure if they really wanted. On the other
hand, Alena's example of adaptive query optimization or my example of
letting extensions nudge the planner are a new type of optimizer
capability that happens to live in an extension. They don't show up in
a specific place in the plan; they affect things at some higher level.
That kind of thing is where I expect this infrastructure to be high
value.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Tom Lane
Date:
Robert Haas <robertmhaas@gmail.com> writes:
> OK. It sounds to me like there is a good amount of support for going
> forward with something like what I have, even though some people might
> also like other things. What I feel is currently lacking is some
> review of the actual code. Would anyone like to do that?

Here's some comments on 0001 and 0002; I didn't have time for
0003 today.  But in any case, I think you should move forward
with committing 0001/0002 soon so other people can play around
in this space.  0003 can be left for later.

0001:

GetExplainExtensionState and SetExplainExtensionState should probably
have guards against extension_id < 0, even if it's just an Assert.
Or make the id unsigned?

SetExplainExtensionState is repalloc'ing ExplainExtensionNameArray;
why?  Wouldn't that invalidate any other pointers to the array?
It disturbs me that you have ExplainState carrying what could
easily become a stale pointer to that array; I'd try to get rid
of that.

RegisterExtensionExplainOption has "char *option_name", but I think
that should be "const char *", and the function should have the same
disclaimer as GetExplainExtensionId about that string needing to be
constant-or-never-freed.

This bit in explain.h seems non-idiomatic:

+struct ExplainState;
+typedef struct ExplainState ExplainState;

AFAIK it's sufficient to write the typedef line.  Also I'd
strongly recommend a comment saying that the struct is
defined in explain_state.h.

explain_state.h has the same pattern:

+struct ExplainState;
+typedef struct ExplainState ExplainState;

This is a little more problematic, because I'm pretty sure some
compilers will kvetch when they see the duplicate typedefs.
You need to either (a) write the typedef in only one file,
and use "struct ExplainState" in the other file, or (b)
use some dance like

#ifndef EXPLAINSTATE_TYPEDEF_DEFINED
typedef struct ExplainState ExplainState;
#define EXPLAINSTATE_TYPEDEF_DEFINED 1
#endif

in both files.  If memory serves we do have some headers using
the (b) pattern, but I don't love it particularly.

0002:

I'm fine with the order of additions being determined by module
load order.  Users who are feeling picky about that can arrange
the module load order as they wish.  If we put in a priority
mechanism then the order would be determined by module authors,
who probably don't want the responsibility, and not by the users
whose results are actually affected.

I'm quite confused by the #include additions in auto_explain.c and
file_fdw.c, and I strongly object to the ones in explain_state.h.
Surely those are unnecessary?

Anyway, these are all very minor concerns; overall I think
it's going in the right direction.

            regards, tom lane



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 5, 2025 at 1:53 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Here's some comments on 0001 and 0002; I didn't have time for
> 0003 today.  But in any case, I think you should move forward
> with committing 0001/0002 soon so other people can play around
> in this space.  0003 can be left for later.

Cool. Thank you very much for the review!

> GetExplainExtensionState and SetExplainExtensionState should probably
> have guards against extension_id < 0, even if it's just an Assert.
> Or make the id unsigned?

I like the Assert() idea better, so I added that. This way, an
extension can write "static int es_extension_id = -1;" or similar and
the Assert() will catch use-before-initialization errors.

> SetExplainExtensionState is repalloc'ing ExplainExtensionNameArray;
> why?  Wouldn't that invalidate any other pointers to the array?

Wow, ouch. That's a brown-paper-bag bug. It should be allocating and
then reallocating es->extension_state. The point is to make sure the
assignment at the tail end of the function isn't indexing off the
allocated portion of the array.

> RegisterExtensionExplainOption has "char *option_name", but I think
> that should be "const char *", and the function should have the same
> disclaimer as GetExplainExtensionId about that string needing to be
> constant-or-never-freed.

Added.

> This bit in explain.h seems non-idiomatic:
> explain_state.h has the same pattern:

Fixed, I hope.

> 0002:
>
> I'm fine with the order of additions being determined by module
> load order.  Users who are feeling picky about that can arrange
> the module load order as they wish.  If we put in a priority
> mechanism then the order would be determined by module authors,
> who probably don't want the responsibility, and not by the users
> whose results are actually affected.

Check.

> I'm quite confused by the #include additions in auto_explain.c and
> file_fdw.c, and I strongly object to the ones in explain_state.h.
> Surely those are unnecessary?

They are necessary but they should have been part of 0001. Because
0001 moves the definition of ExplainState to explain_state.h, files
that need to access to the members of that structure now need to
include that header file. As for the includes in explain_state.h, it
needs definitions for DefElem, ParseState, and PlannedStmt.

> Anyway, these are all very minor concerns; overall I think
> it's going in the right direction.

I am very happy to hear that. Thanks!

v3 attached.

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Tom Lane
Date:
Robert Haas <robertmhaas@gmail.com> writes:
> On Wed, Mar 5, 2025 at 1:53 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> I'm quite confused by the #include additions in auto_explain.c and
>> file_fdw.c, and I strongly object to the ones in explain_state.h.
>> Surely those are unnecessary?

> They are necessary but they should have been part of 0001.

Ah.  I was mistakenly assuming that 0001 would compile on its own ;-)

> Because
> 0001 moves the definition of ExplainState to explain_state.h, files
> that need to access to the members of that structure now need to
> include that header file. As for the includes in explain_state.h, it
> needs definitions for DefElem, ParseState, and PlannedStmt.

Got it.  So does that mean we can remove any #include's from explain.h
after moving the struct definition?

            regards, tom lane



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 5, 2025 at 4:00 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Ah.  I was mistakenly assuming that 0001 would compile on its own ;-)

Oopsie.

> Got it.  So does that mean we can remove any #include's from explain.h
> after moving the struct definition?

Good question. It looks like "lib/stringinfo.h" can come out but the
other two are still needed, so there is not much gain. But I've made
that change in the attached v4.

I also added "= NULL" to a couple of variable initialization in 0003
in this version, because Andres pointed out to me that cfbot was
seeing compiler warnings. It's a little odd that it warned, because
the variables were initialized by a switch over all enum values, but I
guess some compiler doesn't find that convincing enough.

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Tom Lane
Date:
Robert Haas <robertmhaas@gmail.com> writes:
> On Wed, Mar 5, 2025 at 4:00 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> Got it.  So does that mean we can remove any #include's from explain.h
>> after moving the struct definition?

> Good question. It looks like "lib/stringinfo.h" can come out but the
> other two are still needed, so there is not much gain. But I've made
> that change in the attached v4.

OK.

v4 has addressed most of my nitpicks, but you still have typedefs
for ExplainState in both header files.  My bet is that at least
one buildfarm animal will complain about that.  I could be wrong
though, maybe all such compilers are in disuse now.

Also I noted one more nitpick: I think SetExplainExtensionState
needs to grow the array with repalloc0 not repalloc, else you
risk GetExplainExtensionState returning garbage pointers for
not-yet-set array entries.

In perhaps-less-nitpicky territory, I wonder how well the
"per backend" notion of extension and option IDs plays with
extensions that assign those during _PG_init, as I see you've
done in 0003.  What I'm concerned about is what happens when
the extension is loaded using shared_preload_libraries.  I
think what will happen is

(1) IDs will be assigned in the postmaster, and nothing bad
happens as long as TopMemoryContext already exists, which it will.

(2) Such IDs propagate to backends via fork(), and thus are
effectively system-wide.

(3) But ... in an EXEC_BACKEND build, it doesn't work like
that.  Child processes will repeat the postmaster's processing
of shared_preload_libraries.  Probably they will load the
modules in the same order and get the same ID assignments,
but I'm not quite sure that's guaranteed.

In short, I wonder if somebody might build code that depends
on the IDs being the same across a cluster, and find that
it mostly works but sometimes not on Windows.  Maybe we don't
care given that we explicitly disclaimed them being the same.
But it's probably worth a bit of thought.

            regards, tom lane



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 5, 2025 at 4:38 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> v4 has addressed most of my nitpicks, but you still have typedefs
> for ExplainState in both header files.  My bet is that at least
> one buildfarm animal will complain about that.  I could be wrong
> though, maybe all such compilers are in disuse now.

Ugh, I suck at this, sorry. Adjusted in v5. It's hard to avoid the
conclusion that our IWYU configuration must be fairly lenient, because
every change seems to surface more source files that are depending on
indirect includes.

> Also I noted one more nitpick: I think SetExplainExtensionState
> needs to grow the array with repalloc0 not repalloc, else you
> risk GetExplainExtensionState returning garbage pointers for
> not-yet-set array entries.

Hardly a nitpick, thanks. I've tried to fix this.

> In short, I wonder if somebody might build code that depends
> on the IDs being the same across a cluster, and find that
> it mostly works but sometimes not on Windows.  Maybe we don't
> care given that we explicitly disclaimed them being the same.
> But it's probably worth a bit of thought.

I think that if the only thing a backend does with the ID returned by
GetExplainExtensionId is call GetExplainExtensionState() and
SetExplainExtensionState(), there's no problem here. So to me, the
question here is whether there's a reason to use that ID for anything
else.

If there is, maybe we should discard the integer IDs and just use
strings directly. I could make GetExplainExtensionState() and
SetExplainExtensionState() take const char *extension_name arguments
instead of int extension_id, and just get rid of GetExplainExtensionId
entirely. I chose the current design because I thought that the IDs
did not need to be shared and I thought it would be a bit extravagant
to have an EXPLAIN extension need to do a hash table lookup for every
plan node, but maybe it wouldn't really matter in practice.

But I wonder what else you think that the ID might get used for. If
you're trying to do some kind of general coordination across all
backends using shared memory, we have ShmemInitStruct() and
GetNamedDSMSegment() for that purpose, and those do indeed use names,
and an extension should probably use those interfaces for that kind of
work rather than trying to do something with the explain extension ID
that is not intended.

The best argument that I can see for maybe using that extension ID for
some other purpose is parallel query. The ExplainState is not shared
across backends and presumably never will be, so there's no immediate
problem. But you can perhaps imagine someone wanting to do more than
this infrastructure allows. In particular, I'm imagining that people
might want to run extensions that annotate parts of the plan with
extension-provided data. If such annotations were added at plan time,
and if they used the extension ID, that still wouldn't give rise to
any cross-backend use, but if they were added at execution time, then
parallel query could be happening, and if the extension ID were then
used, trouble could occur.

But to me that sounds a little too hypothetical for me to be concerned
about it. For one thing, we haven't actually agreed on doing anything
of that sort. More importantly, using integer extension IDs for this
feature does not require that we use the integer extension IDs for
that feature, if and when it gets implemented, and it certainly
doesn't require that we use the same ones. And I feel like it might be
slightly surprising if it did, because they seem like two different
features that maybe ought to work in different ways. Those
hypothetical future features seem like part of the planner or part of
the executor, respectively, so I can't really see a reason why they
would pull their main identifier out of the EXPLAIN system.

So I guess I can't really think of a convincing reason why the current
design would cause a problem for anybody. Of course, my imagination
might be lacking.

Regarding commit timeframe, you expressed a preference for 0001 and
0002 to be committed "soon," but I'm going to be leaving town for a
week on Saturday and would prefer not to put myself in a situation
where somebody is expecting me to fix the buildfarm while I am gone.
So I'm happy to commit them today or tomorrow if you don't mind being
responsible for anything that blows up while I'm gone, or I'm happy to
have you commit them whenever you want, but barring one of those
outcomes, I'm going to deal with it on or after March 17th.

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Tom Lane
Date:
Robert Haas <robertmhaas@gmail.com> writes:
> If there is, maybe we should discard the integer IDs and just use
> strings directly. I could make GetExplainExtensionState() and
> SetExplainExtensionState() take const char *extension_name arguments
> instead of int extension_id, and just get rid of GetExplainExtensionId
> entirely. I chose the current design because I thought that the IDs
> did not need to be shared and I thought it would be a bit extravagant
> to have an EXPLAIN extension need to do a hash table lookup for every
> plan node, but maybe it wouldn't really matter in practice.

I find a good deal of attraction in getting rid of the IDs and
just using names.  Nor do I believe we need a hash table.
(1) Surely there will not be so many extensions using this within a
single EXPLAIN that a simple loop with strcmp()'s isn't good enough.
(2) The IDs aren't free either; where will an extension keep the
ID it assigned?  We're trying to move away from global variables.

But, if you're convinced otherwise, the current design is OK.

Other than that point, I think 0001 and 0002 are ready.

> Regarding commit timeframe, you expressed a preference for 0001 and
> 0002 to be committed "soon," but I'm going to be leaving town for a
> week on Saturday and would prefer not to put myself in a situation
> where somebody is expecting me to fix the buildfarm while I am gone.
> So I'm happy to commit them today or tomorrow if you don't mind being
> responsible for anything that blows up while I'm gone, or I'm happy to
> have you commit them whenever you want, but barring one of those
> outcomes, I'm going to deal with it on or after March 17th.

Sure, I will backstop you if you want to push and run.  There's
at least one nearby thread where some interest has been expressed in
using this (the postgres_fdw include-the-remote-EXPLAIN business).
So I think there is value in having it in the tree sooner.

            regards, tom lane



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Thu, Mar 6, 2025 at 4:16 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> I find a good deal of attraction in getting rid of the IDs and
> just using names.  Nor do I believe we need a hash table.
> (1) Surely there will not be so many extensions using this within a
> single EXPLAIN that a simple loop with strcmp()'s isn't good enough.
> (2) The IDs aren't free either; where will an extension keep the
> ID it assigned?  We're trying to move away from global variables.
>
> But, if you're convinced otherwise, the current design is OK.

Interesting. I hadn't even considered just iterating every time to
find the ID, but I agree with you that might be totally fine. As you
say, we're not expecting there to be many extensions here. I can try
coding that up and see how it looks (or you can, if you like).

I don't buy your second point, though. Our globals are probably going
to turn into thread-locals at some point, and we may want to do things
like bundle a bunch of related ones together in a struct, but I don't
see any real hope of getting rid of them altogether, and if we do, the
one integer you need to power this kind of extension will hardly be
the biggest problem.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Peter Eisentraut
Date:
On 06.03.25 21:23, Robert Haas wrote:
> On Wed, Mar 5, 2025 at 4:38 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> v4 has addressed most of my nitpicks, but you still have typedefs
>> for ExplainState in both header files.  My bet is that at least
>> one buildfarm animal will complain about that.  I could be wrong
>> though, maybe all such compilers are in disuse now.
> 
> Ugh, I suck at this, sorry. Adjusted in v5. It's hard to avoid the
> conclusion that our IWYU configuration must be fairly lenient, because
> every change seems to surface more source files that are depending on
> indirect includes.

Just to clarify this:  Nobody has gone through and used IWYU to clean up 
indirect includes, as you appear to imagine here.  My recent IWYU work 
was, besides putting some infrastructure in place, to clean up includes 
that are completely unneeded.  Indirect includes cleanup is a different 
project that is not currently happening, AFAIK.

Also, benign typedef redefinitions are a C11 feature.  In practice, all 
compilers currently in play support it, and the only problem you'll get 
is from the buildfarm members that are explicitly set up to warn about 
accidental C11 use.  We could probably have a discussion about that, but 
for this patch set, it's probably better to just deal with the status quo.




Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Thu, Mar 6, 2025 at 4:24 PM Robert Haas <robertmhaas@gmail.com> wrote:
> On Thu, Mar 6, 2025 at 4:16 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> > I find a good deal of attraction in getting rid of the IDs and
> > just using names.  Nor do I believe we need a hash table.
> > (1) Surely there will not be so many extensions using this within a
> > single EXPLAIN that a simple loop with strcmp()'s isn't good enough.
> > (2) The IDs aren't free either; where will an extension keep the
> > ID it assigned?  We're trying to move away from global variables.
> >
> > But, if you're convinced otherwise, the current design is OK.
>
> Interesting. I hadn't even considered just iterating every time to
> find the ID, but I agree with you that might be totally fine. As you
> say, we're not expecting there to be many extensions here. I can try
> coding that up and see how it looks (or you can, if you like).

Here's v6, doing it that way. I found that the simplest thing to do
was just push the call to GetExplainExtensionId() inside
Get/SetExplainExtensionState(). With this approach, the backend-scope
IDs still exist, but they are private to explain_state.c. An alternate
design could be to make each individual ExplainState have its own list
of extension names alongside its own list of opaque pointers, so that
the IDs become ExplainState-scoped rather than backend-scoped. At the
moment, that seems to me to be just deciding to make the code more
complicated for no obvious benefit, but maybe I'm missing something.
At any rate, my overall conclusion here is that this is giving up a
probably-insignificant amount of performance for an
also-not-terribly-significant abstraction improvement, so I find it a
little hard to get excited about it one way or the other, but it's
fine.

Tom, what do you think?

Thanks,

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Fri, Mar 7, 2025 at 9:38 AM Peter Eisentraut <peter@eisentraut.org> wrote:
> Just to clarify this:  Nobody has gone through and used IWYU to clean up
> indirect includes, as you appear to imagine here.  My recent IWYU work
> was, besides putting some infrastructure in place, to clean up includes
> that are completely unneeded.  Indirect includes cleanup is a different
> project that is not currently happening, AFAIK.

OK, thanks. I wonder whether that's a good use of effort or just not
worth worrying about.

> Also, benign typedef redefinitions are a C11 feature.  In practice, all
> compilers currently in play support it, and the only problem you'll get
> is from the buildfarm members that are explicitly set up to warn about
> accidental C11 use.  We could probably have a discussion about that, but
> for this patch set, it's probably better to just deal with the status quo.

Agreed. +1 for having a discussion at some point, though, because the
effect of the current rules seems to be that you have to write "struct
BananaSplit *" in a bunch of places instead of just 'BananaSplit *" to
avoid redefining the typedef. That's worth doing it if it solves a
real problem, but if compilers where it is a real problem are extinct
in the wild, then I think I would prefer not to have to add the
"struct" keyword in a bunch of places just for compliance with
historical compiler behavior.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Tom Lane
Date:
Robert Haas <robertmhaas@gmail.com> writes:
> On Fri, Mar 7, 2025 at 9:38 AM Peter Eisentraut <peter@eisentraut.org> wrote:
>> Also, benign typedef redefinitions are a C11 feature.  In practice, all
>> compilers currently in play support it, and the only problem you'll get
>> is from the buildfarm members that are explicitly set up to warn about
>> accidental C11 use.  We could probably have a discussion about that, but
>> for this patch set, it's probably better to just deal with the status quo.

> Agreed. +1 for having a discussion at some point, though, because the
> effect of the current rules seems to be that you have to write "struct
> BananaSplit *" in a bunch of places instead of just 'BananaSplit *" to
> avoid redefining the typedef.

I'd be +1 if there's a way to allow that particular thing without
thereby opening the floodgates to every other C11 feature.  I expect
not all of C11 is universal yet, so I think the buildfarm animals
that are using -std=gnu99 are mostly doing us a service.  But yeah,
this particular thing is a pain in the rear.

            regards, tom lane



Re: making EXPLAIN extensible

From
Tom Lane
Date:
Robert Haas <robertmhaas@gmail.com> writes:
> Here's v6, doing it that way. I found that the simplest thing to do
> was just push the call to GetExplainExtensionId() inside
> Get/SetExplainExtensionState().

Fair enough.

> Tom, what do you think?

The header comments for explain_state.c could use a spellcheck.
I noticed s/offests/offsets/ and s/direcrtly/directly/.
Other than that, I think 0001 and 0002 are good to go.
I've still not looked at 0003.

            regards, tom lane



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
Hi,


>>> EXPLAIN output. It wouldn't make sense for core to have an EXPLAIN
>>> option whose whole purpose is to cater to the needs of some extension,
>>> so that made me think of providing some extensibility infrastructure.

>> Making EXPLAIN extensible sounds like a good idea.. FWIW, There is a
>> discussion [0]
>> for showing FDW remote plans ( postgres_fdw specifically), and I think
>> we  will need to
>> add some new options to EXPLAIN to make that possible.

> Have not looked at your patches, but I will do so now.

Over the past few days I have had a chance to experiment with these
patches and good news is that it has allowed me to extend EXPLAIN
for postrges_fdw to show remote plans. I will share the update and
patch for this in [0], but thought it will be good to share here as well.

postgres=# explain (remote_plans) select * from t_r1, t1_r1;
                                 QUERY PLAN
----------------------------------------------------------------------------
 Nested Loop  (cost=200.00..83272.80 rows=6553600 width=16)
   Plan Node ID: 0
   ->  Foreign Scan on t_r1  (cost=100.00..673.20 rows=2560 width=8)
         Plan Node ID: 1
   ->  Materialize  (cost=100.00..686.00 rows=2560 width=8)
         Plan Node ID: 2
         ->  Foreign Scan on t1_r1  (cost=100.00..673.20 rows=2560 width=8)
               Plan Node ID: 3
 Remote Plans:
       Seq Scan on t  (cost=0.00..32.60 rows=2260 width=8)
       Statement Name: Plan Node ID = 1

       Seq Scan on t1  (cost=0.00..32.60 rows=2260 width=8)
       Statement Name: Plan Node ID = 3
(14 rows)

I do have some findings/suggestions:

1/ As you can see form the output above, I used explain_per_node_hook
to append a "Plan Node ID" to the explain output. I really don't like having it
there, and prefer that it gets added to the top line of the node.

i.e.
   ->  Foreign Scan on t_r1  (cost=100.00..673.20 rows=2560 width=8) (node_id=1)
   ->  Materialize  (cost=100.00..686.00 rows=2560 width=8) (node_id=2)
         ->  Foreign Scan on t1_r1  (cost=100.00..673.20 rows=2560
width=8) (node_id=3)

Can we add a hook at that point [1] which will allow an extension to modify
the first line of a node? I think this is not just useful for my case, but also
for other use-cases in which some high level node details could be placed.
what do you think?

2/ I registered an options handler, and I wanted this options handler to
validate that my new extension option is not used with the analyze
option.

So, the behavior is if the core explain option was first in the list, it worked,
but if it was first in the list it does not.

postgres=# explain (analyze, remote_plans) select from t_r1, t1_r1;
ERROR:  EXPLAIN options REMOTE_PLANS and ANALYZE cannot be used together

postgres=# explain (remote_plans, analyze) select from t_r1, t1_r1;
                                                       QUERY PLAN

-------------------------------------------------------------------------------------------------------------------------
 Nested Loop  (cost=200.00..147337.37 rows=11648569 width=0) (actual
time=3.222..3.236 rows=0.00 loops=1)
 ...
....

This is because the ApplyExtensionExplainOption is called inside the main
loop that parses the options, so when ApplyExtensionExplainOption, the core
option may or may not have been set yet. I also think this will break if there
are multiple extension options that need to be validated together.

One way I thought to fix this is to allow the user to register another handler
for validation, which can then be called after the parse option loop, and after
all the in-core options have been validated against each other. Right after
this line.

+    /* if the summary was not set explicitly, set default value */
+    es->summary = (summary_set) ? es->summary : es->analyze;

What do you think?


[0]
https://www.postgresql.org/message-id/flat/CAP%2BB4TD%3Diy-C2EnsrJgjpwSc7_4pd3Xh-gFzA0bwsw3q8u860g%40mail.gmail.com
[1] https://github.com/postgres/postgres/blob/master/src/backend/commands/explain.c#L2013

Thanks

--
Sami Imseih
Amazon Web Services (AWS)



Re: making EXPLAIN extensible

From
Tom Lane
Date:
Sami Imseih <samimseih@gmail.com> writes:
> 1/ As you can see form the output above, I used explain_per_node_hook
> to append a "Plan Node ID" to the explain output. I really don't like having it
> there, and prefer that it gets added to the top line of the node.

> i.e.
>    ->  Foreign Scan on t_r1  (cost=100.00..673.20 rows=2560 width=8) (node_id=1)
>    ->  Materialize  (cost=100.00..686.00 rows=2560 width=8) (node_id=2)
>          ->  Foreign Scan on t1_r1  (cost=100.00..673.20 rows=2560
> width=8) (node_id=3)

> Can we add a hook at that point [1] which will allow an extension to modify
> the first line of a node? I think this is not just useful for my case, but also
> for other use-cases in which some high level node details could be placed.
> what do you think?

I think this is a seriously bad idea.  The first line is already
overloaded; we don't need several different extensions adding more
stuff to it.  Plus, this doesn't consider what to do in non-text
output formats.  ISTM you should be good with adding a new
"Plan Node ID" property to each node.  No, you don't get to put it
first.  Too bad.  (If we did try to cater to that, what shall we do
with multiple extensions that all think they should be first?)

The validation point is an interesting one.  I agree that we don't
want the behavior to depend on the order in which options are
written.

            regards, tom lane



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> I think this is a seriously bad idea.  The first line is already
> overloaded; we don't need several different extensions adding more
> stuff to it.

Fair enough.

> Plus, this doesn't consider what to do in non-text
> output formats.

the hook will be a no-op for non-text formats, which is not
desirable behavior. I get that also.

I have no strong feelings for this, but wanted to see what
others think.

thanks!

--
Sami Imseih



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> The validation point is an interesting one.  I agree that we don't
> want the behavior to depend on the order in which options are
> written.

Here is what I applied on top of v6-0001 to correct this issue. Attaching it
as a text file only as Robert may have a different opinion on how to fix
this.

I felt the best way is to create another handler for registering a validation
function. This means we have to loop through the options list twice,
but I don't think that is a problem.

postgres=# explain (remote_plans, analyze) select * from t_r1;
ERROR:  EXPLAIN options REMOTE_PLANS and ANALYZE cannot be used together
postgres=# explain (analyze, remote_plans) select * from t_r1;
ERROR:  EXPLAIN options REMOTE_PLANS and ANALYZE cannot be used together

Regards,

Sami

Attachment

Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 3/12/25 20:58, Sami Imseih wrote:
>> I think this is a seriously bad idea.  The first line is already
>> overloaded; we don't need several different extensions adding more
>> stuff to it.
> 
> Fair enough.
> 
>> Plus, this doesn't consider what to do in non-text
>> output formats.
> 
> the hook will be a no-op for non-text formats, which is not
> desirable behavior. I get that also.
> 
> I have no strong feelings for this, but wanted to see what
> others think.
I'm against it. For me, the best model there is to allow extensions to 
add something and nothing more. If it wants to change the core explain 
code - use ExplainOneQuery_hook instead.
The reason here is to reduce possible competition among extensions.

I already have troubles with conflict on queryid modifications and 
potential conflict in the planner_hook - if someone invents another 
extension that will provide a plan tree. So, it would be better to 
reduce conflicts whenever possible.

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Thu, Mar 13, 2025 at 5:52 PM Sami Imseih <samimseih@gmail.com> wrote:
> > The validation point is an interesting one.  I agree that we don't
> > want the behavior to depend on the order in which options are
> > written.
>
> Here is what I applied on top of v6-0001 to correct this issue. Attaching it
> as a text file only as Robert may have a different opinion on how to fix
> this.
>
> I felt the best way is to create another handler for registering a validation
> function. This means we have to loop through the options list twice,
> but I don't think that is a problem.

Hmm. I thought about this at some stage in the development of these
patches, but nothing made it into the final version. I agree we should
probably do something about it, but I don't really like the idea of
calling the second hook "validate" as you've done here, because most
validation can and should be done in the regular apply hook. You only
need the second hook if you want to check the values of options
against the values of other options.

I wonder if we should just add a "plain" hook to the bottom of
ParseExplainOptionList, like post_parse_explain_options_list_hook or
something, and then EXPLAIN options that don't need this kind of
validation can just do nothing and those that do can do the usual
dance to add themselves to the hook. We could also keep it as you have
it here, with an extra handler that is called per option, but what if
some loadable module adds two new options LEFT and RIGHT and wants to
check that you don't specify LEFT and RIGHT together? Either they
register the same validate handler for both, or they register the real
validate handler for one and a no-op handler for the other. Neither of
those options seems very appealing.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> You only
> need the second hook if you want to check the values of options
> against the values of other options.
+1

I did not think of adding a new hook, because there must be a really good
reason to add a new hook. I think it's justified for this case. It's better than
my approach since the extension author can just put all their checks in one
place rather than having to register a bunch of handlers.

> some loadable module adds two new options LEFT and RIGHT and wants to
> check that you don't specify LEFT and RIGHT together? Either they
> register the same validate handler for both, or they register the real
> validate handler for one and a no-op handler for the other. Neither of
> those options seems very appealing.

When I thought about this, I figured that one of the options will register
a validate handler and the other option will set the handler to NULL. But,
I do see why this is not appealing.

--
Sami



Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 3/7/25 16:05, Robert Haas wrote:
I have attempted to use hooks, proposed in 0002, in my extensions.
At first, it worked great. My patch reduced a lot, and the only things 
that I need in the planner to improve its predictions are the 
selectivity hook and the create_plan hook - the last one needed to pass 
data from the best_path chosen to the plan and needed to compare 
prediction and reality at the end of execution.

Some questions:
1. I think, hooks ExplainOneQuery_hook_type, explain_per_plan_hook_type, 
explain_per_node_hook_type deserve to be moved to explain_format.h
At least, inside the hook, we usually use functions like ExplainProperty.
2. In my patch I inserted the hook before the line 1894:
/* in text format, the first line ends here */
Why have you chosen a different way? I don't have specific reasons to 
insist, except the extension data right under the node looks better to 
me personally.

Some changes of the explain format have already been made in the current 
master. Applying this proposal now would reduce additional work for 
extension (and fork) maintainers in the next version.

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Mon, Mar 17, 2025 at 11:54 PM Sami Imseih <samimseih@gmail.com> wrote:
> +1
>
> I did not think of adding a new hook, because there must be a really good
> reason to add a new hook. I think it's justified for this case. It's better than
> my approach since the extension author can just put all their checks in one
> place rather than having to register a bunch of handlers.

Do you want to propose a patch?

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Tue, Mar 18, 2025 at 8:02 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
> Some questions:
> 1. I think, hooks ExplainOneQuery_hook_type, explain_per_plan_hook_type,
> explain_per_node_hook_type deserve to be moved to explain_format.h
> At least, inside the hook, we usually use functions like ExplainProperty.

-1, because the hooks will be called from explain.c, not explain_state.c.

> 2. In my patch I inserted the hook before the line 1894:
> /* in text format, the first line ends here */
> Why have you chosen a different way? I don't have specific reasons to
> insist, except the extension data right under the node looks better to
> me personally.

Tom discusses why we shouldn't try to add to the first line in
http://postgr.es/m/2234935.1741809008@sss.pgh.pa.us and I'm fully in
agreement.

I've committed 0001 and 0002 for now. The additional hook for
cross-option validation can be added in a separate commit. v6-0003,
now v7-0001, needs more substantive review before commit. I hope it
gets some, and soon.

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 3/18/25 14:33, Robert Haas wrote:
> On Tue, Mar 18, 2025 at 8:02 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
>> Some questions:
>> 1. I think, hooks ExplainOneQuery_hook_type, explain_per_plan_hook_type,
>> explain_per_node_hook_type deserve to be moved to explain_format.h
>> At least, inside the hook, we usually use functions like ExplainProperty.
> 
> -1, because the hooks will be called from explain.c, not explain_state.c.
Ok.
> 
>> 2. In my patch I inserted the hook before the line 1894:
>> /* in text format, the first line ends here */
>> Why have you chosen a different way? I don't have specific reasons to
>> insist, except the extension data right under the node looks better to
>> me personally.
> 
> Tom discusses why we shouldn't try to add to the first line in
> http://postgr.es/m/2234935.1741809008@sss.pgh.pa.us and I'm fully in
> agreement.
I agree with him, too. But, as you can see, I proposed not changing the 
first string or adding something there but just putting extension data 
under that line. Extra information about workers' state (not so 
important most of the time, I should say) sometimes makes it difficult 
to read.

> 
> I've committed 0001 and 0002 for now. The additional hook for
> cross-option validation can be added in a separate commit. v6-0003,
> now v7-0001, needs more substantive review before commit. I hope it
> gets some, and soon.
Ok, I am ready to review it thoroughly, if needed.

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Tue, Mar 18, 2025 at 9:58 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
> I agree with him, too. But, as you can see, I proposed not changing the
> first string or adding something there but just putting extension data
> under that line. Extra information about workers' state (not so
> important most of the time, I should say) sometimes makes it difficult
> to read.

OK, I wasn't sure if you meant just before or just after emitting the
end-of-first-line newline.

If you mean just after, that would amount to deciding that information
coming from extensions goes before information from core rather than
after. I thought about that and it's defensible, but in the end I
thought it made more sense for core info to come first. We could
bikeshed this endlessly, but there's no single choice that's going to
make everybody 100% happy, and adding a whole bunch of extra hooks to
cater to various preferences about exactly how the output should look
does not seem worth it to me.

> > I've committed 0001 and 0002 for now. The additional hook for
> > cross-option validation can be added in a separate commit. v6-0003,
> > now v7-0001, needs more substantive review before commit. I hope it
> > gets some, and soon.
> Ok, I am ready to review it thoroughly, if needed.

Yeah, I don't know if Tom is going to jump in here, but if we want to
get something into this release we don't have much time. I personally
think this is good enough that it would be better to have it than
nothing and we can always change stuff later. I'm not going to be too
sympathetic if somebody complains about a backward compatibility break
for pg_overexplain; it's labelled as a developer tool that shows
information about internals. That said, I don't want to ship something
and then have everybody say "what is this trash?".

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> Do you want to propose a patch?

yes, will attach a patch shortly.

--
Sami



Re: making EXPLAIN extensible

From
Tom Lane
Date:
Robert Haas <robertmhaas@gmail.com> writes:
> If you mean just after, that would amount to deciding that information
> coming from extensions goes before information from core rather than
> after. I thought about that and it's defensible, but in the end I
> thought it made more sense for core info to come first. We could
> bikeshed this endlessly, but there's no single choice that's going to
> make everybody 100% happy, and adding a whole bunch of extra hooks to
> cater to various preferences about exactly how the output should look
> does not seem worth it to me.

FWIW, I am fairly strongly against that.  Every extension author is
going to feel that their information is so important it should come
first.  Other people might have a different opinion about that, and
in any case they can't all be first.

There's certainly room for bikeshedding here, but I think it'd be good
to get some actual experience before redesigning what you've done.

            regards, tom lane



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Tue, Mar 18, 2025 at 2:40 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> FWIW, I am fairly strongly against that.  Every extension author is
> going to feel that their information is so important it should come
> first.  Other people might have a different opinion about that, and
> in any case they can't all be first.

Yep.

> There's certainly room for bikeshedding here, but I think it'd be good
> to get some actual experience before redesigning what you've done.

I think so, too, although obviously I'm biased.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> > Do you want to propose a patch?
>
> yes, will attach a patch shortly.

Attached is a patch to add a hook to allow extensions
to add additional option validations. The hook takes
in the ExplainState as an argument and returns void.
It is expected the extension will raise an error if the
validation fails.

--
Sami Imseih
Amazon Web Services (AWS)

Attachment

Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Tue, Mar 18, 2025 at 11:21 PM Sami Imseih <samimseih@gmail.com> wrote:
> > > Do you want to propose a patch?
> >
> > yes, will attach a patch shortly.
>
> Attached is a patch to add a hook to allow extensions
> to add additional option validations. The hook takes
> in the ExplainState as an argument and returns void.
> It is expected the extension will raise an error if the
> validation fails.

Since the new hook will be called from explain_state.c, please declare
it in explain_state.h.

I think the header-file comment should be just one line rather than a
block comment as we do for other hooks in this area. If you think a
more detailed explanation is needed, I'd put that in the .c file.

Is there any value in passing "options" or "pstate" to the hook?
Couldn't the hook want to use "pstate" for error reporting purposes?

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> On Tue, Mar 18, 2025 at 11:21 PM Sami Imseih <samimseih@gmail.com> wrote:
> > > > Do you want to propose a patch?
> > >
> > > yes, will attach a patch shortly.
> >
> > Attached is a patch to add a hook to allow extensions
> > to add additional option validations. The hook takes
> > in the ExplainState as an argument and returns void.
> > It is expected the extension will raise an error if the
> > validation fails.
>
> Since the new hook will be called from explain_state.c, please declare
> it in explain_state.h.

done

> I think the header-file comment should be just one line rather than a
> block comment as we do for other hooks in this area. If you think a
> more detailed explanation is needed, I'd put that in the .c file.

on second thought, no reason for the detailed explanation...

> Is there any value in passing "options" or "pstate" to the hook?
> Couldn't the hook want to use "pstate" for error reporting purposes?

... as I made the hook signature match that of
ParseExplainOptionList, so both pstate and the options list
are now available to the hook.

--

Sami Imseih
Amazon Web Services (AWS)

Attachment

Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 19, 2025 at 11:38 AM Sami Imseih <samimseih@gmail.com> wrote:
> ... as I made the hook signature match that of
> ParseExplainOptionList, so both pstate and the options list
> are now available to the hook.

This version LGTM, except it's not pgindent-clean.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> On Wed, Mar 19, 2025 at 11:38 AM Sami Imseih <samimseih@gmail.com> wrote:
> > ... as I made the hook signature match that of
> > ParseExplainOptionList, so both pstate and the options list
> > are now available to the hook.
>
> This version LGTM, except it's not pgindent-clean.

ugh, sorry about that. ran it for both modified files and it found one
indentation correction from v2.

> index 1d4be3c18ac..8fe1ca5c73e 100644
38c38
< +        (*explain_validate_options_hook) (es, options, pstate);
---
> +        (*explain_validate_options_hook)(es, options, pstate);
59c59
< 2.47.1

v3 attached.


--
Sami Imseih
Amazon Web Services (AWS)

Attachment

Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 19, 2025 at 1:41 PM Sami Imseih <samimseih@gmail.com> wrote:
> ugh, sorry about that. ran it for both modified files and it found one
> indentation correction from v2.

Yeah, that's the one I noticed. I think this should be uncontroversial
but I'll give it a day or two in case anyone wants to complain.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 19/3/2025 18:41, Sami Imseih wrote:
>> On Wed, Mar 19, 2025 at 11:38 AM Sami Imseih <samimseih@gmail.com> wrote:
>>> ... as I made the hook signature match that of
>>> ParseExplainOptionList, so both pstate and the options list
>>> are now available to the hook.
>>
>> This version LGTM, except it's not pgindent-clean.
> 
> ugh, sorry about that. ran it for both modified files and it found one
> indentation correction from v2.
> 
>> index 1d4be3c18ac..8fe1ca5c73e 100644
> 38c38
> < +        (*explain_validate_options_hook) (es, options, pstate);
> ---
>> +        (*explain_validate_options_hook)(es, options, pstate);
> 59c59
> < 2.47.1
> 
> v3 attached.
I want to criticize this patch a bit. ;) Why do you think this hook is 
not redundant? May we avoid it at all? It seems to perform a too-narrow 
task.
For example, if you want to separate parameters that may be used on a 
plain EXPLAIN and on EXPLAIN ANALYZE to get behaviour like the following:

EXPLAIN (BUFFERS ON, TIMING ON) SELECT * FROM pg_class;
ERROR: EXPLAIN option TIMING requires ANALYZE

It would be better to add the parameter "type: EXPLAIN_ONLY | 
ANALYZE_ONLY | BOTH" to the RegisterExtensionExplainOption() routine. 
This value will be saved inside the ExplainExtensionOption structure and 
processed by the core inside the ParseExplainOptionList.

 > ERROR:  EXPLAIN options REMOTE_PLANS and ANALYZE cannot be used
 > together
I think the additional parameter covers the case you provided, isn't it?

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> Why do you think this hook is not redundant?
what is it redundant with?

> It would be better to add the parameter "type: EXPLAIN_ONLY |
> ANALYZE_ONLY | BOTH" to the RegisterExtensionExplainOption() routine.
> This value will be saved inside the ExplainExtensionOption structure and
> processed by the core inside the ParseExplainOptionList.

hmm, IIUC, what you are describing is flag that will be limited to
only check if an option can be used with EXPLAIN_ONLY, ANALYZE_ONLY
or both. But what about if I have a case to check against between other
extension options? let's say ExtensionAOptionA and ExtensionAoptionB.
How would that work with the way you are suggesting?

--
Sami Imseih
Amazon Web Services (AWS)



Re: making EXPLAIN extensible

From
Tom Lane
Date:
Sami Imseih <samimseih@gmail.com> writes:
>> It would be better to add the parameter "type: EXPLAIN_ONLY |
>> ANALYZE_ONLY | BOTH" to the RegisterExtensionExplainOption() routine.
>> This value will be saved inside the ExplainExtensionOption structure and
>> processed by the core inside the ParseExplainOptionList.

> hmm, IIUC, what you are describing is flag that will be limited to
> only check if an option can be used with EXPLAIN_ONLY, ANALYZE_ONLY
> or both. But what about if I have a case to check against between other
> extension options? let's say ExtensionAOptionA and ExtensionAoptionB.
> How would that work with the way you are suggesting?

More generally, I think that's quite a constricted view of what sorts
of checks an extension might need to make.  As an example, suppose
an extension's option conflicts with "GENERIC_PLAN on" for some reason
(perhaps it wants to modify the plan).

I think the current hook proposal is fine.  It leaves all the
validation work to be done by the extension(s), sure, but the amount
of code required to handle that one case is tiny, and the door is
left open to handle cases we didn't foresee.

            regards, tom lane



Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 19/3/2025 21:51, Sami Imseih wrote:
>> Why do you think this hook is not redundant?
> what is it redundant with?
> 
>> It would be better to add the parameter "type: EXPLAIN_ONLY |
>> ANALYZE_ONLY | BOTH" to the RegisterExtensionExplainOption() routine.
>> This value will be saved inside the ExplainExtensionOption structure and
>> processed by the core inside the ParseExplainOptionList.
> 
> hmm, IIUC, what you are describing is flag that will be limited to
> only check if an option can be used with EXPLAIN_ONLY, ANALYZE_ONLY
> or both. But what about if I have a case to check against between other
> extension options? let's say ExtensionAOptionA and ExtensionAoptionB.
> How would that work with the way you are suggesting?
That makes sense. It would be more effective to include the meaningful 
example in Robert's extension for v7-0001.

I'm sorry, I was confused; previously, the difficulties faced by 
extension developers were always attributed to him (refer to the 
discussion on the selectivity hook). However, now you're introducing a 
hook for a trivial operation that could simply be resolved at the end of 
execution within a per-node hook with tiny inconsistency in output. I'm 
pleased to see a change in the narrative.

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
>
> On 19/3/2025 21:51, Sami Imseih wrote:
> >> Why do you think this hook is not redundant?
> > what is it redundant with?
> >
> >> It would be better to add the parameter "type: EXPLAIN_ONLY |
> >> ANALYZE_ONLY | BOTH" to the RegisterExtensionExplainOption() routine.
> >> This value will be saved inside the ExplainExtensionOption structure and
> >> processed by the core inside the ParseExplainOptionList.
> >
> > hmm, IIUC, what you are describing is flag that will be limited to
> > only check if an option can be used with EXPLAIN_ONLY, ANALYZE_ONLY
> > or both. But what about if I have a case to check against between other
> > extension options? let's say ExtensionAOptionA and ExtensionAoptionB.
> > How would that work with the way you are suggesting?
> That makes sense. It would be more effective to include the meaningful
> example in Robert's extension for v7-0001.

While it will be good to provide an example in v7-0001, I can't see
any of the pg_overexplain options needing such validation.

> I'm sorry, I was confused; previously, the difficulties faced by
> extension developers were always attributed to him (refer to the
> discussion on the selectivity hook). However, now you're introducing a
> hook for a trivial operation that could simply be resolved at the end of
> execution within a per-node hook with tiny inconsistency in output. I'm
> pleased to see a change in the narrative.

I think you are referring to the idea to put a hook before
/* in text format, the first line ends here */

After considering the comments, I do see that was not a good proposal. You are
correct, and the per-node hook is enough.


--
Sami Imseih
Amazon Web Services (AWS)



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Thu, Mar 20, 2025 at 3:04 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
> I'm sorry, I was confused; previously, the difficulties faced by
> extension developers were always attributed to him (refer to the
> discussion on the selectivity hook). However, now you're introducing a
> hook for a trivial operation that could simply be resolved at the end of
> execution within a per-node hook with tiny inconsistency in output. I'm
> pleased to see a change in the narrative.

It sounds like we're sufficiently in agreement so I've committed the
patch. I agree with you that this could be done within the per-node
hook, but that would be awkward, and this is better. I take note of
your frustration with the difficulty of getting hooks added and to
some extent I share it; on the flip side, there are plenty of
poorly-considered hook proposals, too.

I've rebased my pg_overexplain patch and attach that here.

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> It sounds like we're sufficiently in agreement so I've committed the
> patch.

Thanks!

> I've rebased my pg_overexplain patch and attach that here.

I spent some time playing around with this extension, and I can
see the value.

1/ provides a good example for future extensions on how to
use the new hooks. it definitely would be nice to add an example
for the hook added to check options against each other as mentioned
by Andrei here [0] but I could not find a good reason to do so with
pg_overexplain.

2/ I agree that this is better than debug_print_plan to avoid all the
verbosity of that output emitting at once. But maybe it will be useful
to provide options to show other parts like the target list, quals, etc.
I wonder if something like this will be better?

explain (query_tree_show range_table) select ..
explain (query_tree_show qualification) select ..
explain (query_tree_show target_list) select ..

The user can choose which information they want to see by passing
a value to an option called query_tree_show ( or a better option name)

Initially, we can only support range_table, but future enhancements
could include other parts.

Maybe a good idea, maybe not, what do you think?


Besides that, I found a few minor things:


1/ trailing whitespace

+     <literal>Subplans Needing Rewind</literal>. Integer IDs of subplans that


2/ typos
 * We don't try to print everything here. Information that would be displyed

 * to limit the display length by # of columsn or # of characters, but for



[0] https://www.postgresql.org/message-id/8e8b6f17-e1e6-4b16-84d4-37ded802c787%40gmail.com


--

Sami Imseih
Amazon Web Services (AWS)



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Thu, Mar 20, 2025 at 8:37 PM Sami Imseih <samimseih@gmail.com> wrote:
> 1/ provides a good example for future extensions on how to
> use the new hooks. it definitely would be nice to add an example
> for the hook added to check options against each other as mentioned
> by Andrei here [0] but I could not find a good reason to do so with
> pg_overexplain.

Yeah, there may be a good idea lurking somewhere here but I have not
yet come up with it.

> 2/ I agree that this is better than debug_print_plan to avoid all the
> verbosity of that output emitting at once. But maybe it will be useful
> to provide options to show other parts like the target list, quals, etc.
> I wonder if something like this will be better?
>
> explain (query_tree_show range_table) select ..
> explain (query_tree_show qualification) select ..
> explain (query_tree_show target_list) select ..

But why would those options be exclusive of each other? Surely you
could want more than one of those things, which suggests that separate
options are better.

Also, the reason I didn't do the quals or the target list is because
regular EXPLAIN already shows that stuff. There could possibly be some
value in trying to show things about those node trees that regular
EXPLAIN doesn't, but that quickly gets into (a) speculation about what
people actually want to see and/or (b) extremely voluminous output
which is the whole reason debug_print_plan is hard to use in the first
place.

> 1/ trailing whitespace
> 2/ typos

Acknowledged, thanks.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Sami Imseih
Date:
> > explain (query_tree_show range_table) select ..
> > explain (query_tree_show qualification) select ..
> > explain (query_tree_show target_list) select ..
>
> But why would those options be exclusive of each other? Surely you
> could want more than one of those things, which suggests that separate
> options are better.

Yeah, that's a valid point. I guess I was thinking that if we ever
decide to have
an option for every part of the query tree, we will end up with many options
(a half dozen options or so), but not being able to support
"give me these 2 parts only" is not ideal either.

> EXPLAIN doesn't, but that quickly gets into (a) speculation about what
> people actually want to see and/or

Correct. This is why these hooks are useful and people can just
write whatever is useful to them, or if there is a convincing reason,
it can be added into pg_overexplain overtime.

overall this LGTM, and I don't really have other comments.

--
Sami Imseih
Amazon Web Services (AWS)



Re: making EXPLAIN extensible

From
Tom Lane
Date:
Sami Imseih <samimseih@gmail.com> writes:
> overall this LGTM, and I don't really have other comments.

I took a look through v8, and have just a couple of trivial nits:

+static void overexplain_debug_handler(ExplainState *, DefElem *,
+                                      ParseState *);
+static void overexplain_range_table_handler(ExplainState *, DefElem *,
+                                            ParseState *);

I cordially hate this parameter-name-less style of function
declaration, and consider it Stroustrup's single worst idea in C++.
It rests on the assumption that parameter names convey zero
information, which is wrongheaded for any function more complex than,
say, addition.  Also, even if you like this style, clang-tidy probably
won't (cf for example 035ce1feb).

pgoverexplain.sgml seems to have been formatted to fit in about an
85-column window.  Please don't do that --- you might as well have
made it 99 columns wide or any other random number, it still looks
like heck in 80 columns.

Other than that, there's room to debate exactly what to show.
But as long as we're agreed that we won't hold this module to
high cross-version compatibility standards, that doesn't seem
like a problem.  I'm okay with this as a starting point.

            regards, tom lane



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Fri, Mar 21, 2025 at 2:37 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> I took a look through v8, and have just a couple of trivial nits:

Thank you!

> +static void overexplain_debug_handler(ExplainState *, DefElem *,
> +                                      ParseState *);
> +static void overexplain_range_table_handler(ExplainState *, DefElem *,
> +                                            ParseState *);
>
> I cordially hate this parameter-name-less style of function
> declaration, and consider it Stroustrup's single worst idea in C++.
> It rests on the assumption that parameter names convey zero
> information, which is wrongheaded for any function more complex than,
> say, addition.  Also, even if you like this style, clang-tidy probably
> won't (cf for example 035ce1feb).

I don't hate the style but I didn't use it intentionally. Fixed now, I hope.

> pgoverexplain.sgml seems to have been formatted to fit in about an
> 85-column window.  Please don't do that --- you might as well have
> made it 99 columns wide or any other random number, it still looks
> like heck in 80 columns.

Oops. Fixed. Also, I tried to disclaim stability and comprehensibility
a little better and fixed a dumb mistake in the page title.

> Other than that, there's room to debate exactly what to show.
> But as long as we're agreed that we won't hold this module to
> high cross-version compatibility standards, that doesn't seem
> like a problem.  I'm okay with this as a starting point.

Great news, thanks again!

Here's v9, which also adds 'SET debug_parallel_query = off' to the
pg_overexplain tests, per CI, because the test results are not (and
cannot realistically be made) stable under under that option.

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 3/21/25 20:54, Robert Haas wrote:
> On Fri, Mar 21, 2025 at 2:37 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Here's v9, which also adds 'SET debug_parallel_query = off' to the
> pg_overexplain tests, per CI, because the test results are not (and
> cannot realistically be made) stable under under that option.
I skimmed through the code and tested how it works.
It looks good and has no apparent architectural dependencies.
But I haven't scrutinised it line-by-line and do not intend to do so.
I wanna say I hate the design of this module. Having a strong necessity 
for extra explain tools (in the daily routine, all I have is the only 
single explain analyse verbose output to find out planner/executor bug, 
reproduce it and make a patch), I don't see a single case when I would 
use this module. It adds a burden to fix its output on a node change 
(you don't care, but it adds work to Postgres fork maintainers, too, for 
nothing). Also, it breaks my understanding of the principles of the 
Postgres code design - to start the discussion on how to show more, we 
need only the bare minimum of code and output lines.
In my opinion, it should show as few parameters as possible to 
demonstrate principles and test the code on a single planner node. It 
only deserves src/test/modules because it is not helpful for a broad 
audience.

-- 
regards, Andrei Lepikhov



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Sat, Mar 22, 2025 at 4:46 AM Andrei Lepikhov <lepihov@gmail.com> wrote:
> I skimmed through the code and tested how it works.
> It looks good and has no apparent architectural dependencies.
> But I haven't scrutinised it line-by-line and do not intend to do so.
> I wanna say I hate the design of this module. Having a strong necessity
> for extra explain tools (in the daily routine, all I have is the only
> single explain analyse verbose output to find out planner/executor bug,
> reproduce it and make a patch), I don't see a single case when I would
> use this module. It adds a burden to fix its output on a node change
> (you don't care, but it adds work to Postgres fork maintainers, too, for
> nothing). Also, it breaks my understanding of the principles of the
> Postgres code design - to start the discussion on how to show more, we
> need only the bare minimum of code and output lines.
> In my opinion, it should show as few parameters as possible to
> demonstrate principles and test the code on a single planner node. It
> only deserves src/test/modules because it is not helpful for a broad
> audience.

Gee, thanks for the ringing endorsement. :-)

I think *I* will use it pretty regularly; I already have. In my
experience, using debug_query_plan for this sort of thing sucks quite
a lot. Finding the information you need in the output takes a long
time because the output is quite long. This is much more
understandable, at least for me. I agree with you that a trivial test
module could demonstrate that the hook works, but a trivial example
would not demonstrate that the hook can be used to do something
actually useful. It sounds like what I've written also fails the
"actually useful" test for you, but it doesn't for me. I'm not going
to insist on shipping this if I'm the ONLY one who would ever get any
use out of it, but I doubt that's the case.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Sat, Mar 22, 2025 at 12:10 PM Robert Haas <robertmhaas@gmail.com> wrote:
> I'm not going
> to insist on shipping this if I'm the ONLY one who would ever get any
> use out of it, but I doubt that's the case.

Hearing no other votes against this, I have committed it, but now I
wonder if that is going to break the buildfarm, because I just noticed
that the changes I made in v9 seem not to have resolved the problem
with debug_parallel_query, for reasons I do not yet understand.
Investigating...

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 26, 2025 at 2:09 PM Robert Haas <robertmhaas@gmail.com> wrote:
> Hearing no other votes against this, I have committed it, but now I
> wonder if that is going to break the buildfarm, because I just noticed
> that the changes I made in v9 seem not to have resolved the problem
> with debug_parallel_query, for reasons I do not yet understand.
> Investigating...

I pushed a couple of fixes for this, but I still see one failure, on drongo:

diff --strip-trailing-cr -U3
C:/prog/bf/root/HEAD/pgsql/contrib/pg_overexplain/expected/pg_overexplain.out
C:/prog/bf/root/HEAD/pgsql.build/testrun/pg_overexplain/regress/results/pg_overexplain.out
--- C:/prog/bf/root/HEAD/pgsql/contrib/pg_overexplain/expected/pg_overexplain.out
2025-03-26 20:33:47.834896600 +0000
+++ C:/prog/bf/root/HEAD/pgsql.build/testrun/pg_overexplain/regress/results/pg_overexplain.out
2025-03-26 21:57:55.332431300 +0000
@@ -419,7 +419,7 @@
                Plan Node ID: 5
                extParam: 0
                allParam: 0
-         ->  Index Scan using daucus_id_idx on daucus v2_2 (actual
rows=0.12 loops=8)
+         ->  Index Scan using daucus_id_idx on daucus v2_2 (actual
rows=0.13 loops=8)
                Index Cond: (id = v1.id)
                Index Searches: 8
                Disabled Nodes: 0

As some of you will recall, I hate floating point arithmetic with a
fiery passion, but I'm still surprised to learn that 1/8 rounded to
two decimal places does not produce the same answer on all modern
systems. It's not even like this is some weird architecture -- it's
listed as x86_64. I'll look into a fix for this in the morning.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Michael Paquier
Date:
On Wed, Mar 26, 2025 at 02:09:32PM -0400, Robert Haas wrote:
> On Sat, Mar 22, 2025 at 12:10 PM Robert Haas <robertmhaas@gmail.com> wrote:
> > I'm not going
> > to insist on shipping this if I'm the ONLY one who would ever get any
> > use out of it, but I doubt that's the case.
>
> Hearing no other votes against this, I have committed it, but now I
> wonder if that is going to break the buildfarm, because I just noticed
> that the changes I made in v9 seem not to have resolved the problem
> with debug_parallel_query, for reasons I do not yet understand.
> Investigating...

There's at least one buildfarm member complaining with the tests of
pg_overexplain, not for the reason you are mentioning here:
https://buildfarm.postgresql.org/cgi-bin/show_log.pl?nm=drongo&dt=2025-03-26%2020%3A33%3A20
--
Michael

Attachment

Re: making EXPLAIN extensible

From
Tom Lane
Date:
Robert Haas <robertmhaas@gmail.com> writes:
> -         ->  Index Scan using daucus_id_idx on daucus v2_2 (actual
> rows=0.12 loops=8)
> +         ->  Index Scan using daucus_id_idx on daucus v2_2 (actual
> rows=0.13 loops=8)

Even if this had not failed in the buildfarm, it would certainly
have caused problems for somebody.  We have a very strong rule
that you *do not* make regression test results dependent on exact
computed cost or rowcount values, because those are inherently
platform-dependent.  I do not understand how you thought that
pg_overexplain might be exempt from that problem.

            regards, tom lane



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Wed, Mar 26, 2025 at 10:40 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Robert Haas <robertmhaas@gmail.com> writes:
> > -         ->  Index Scan using daucus_id_idx on daucus v2_2 (actual
> > rows=0.12 loops=8)
> > +         ->  Index Scan using daucus_id_idx on daucus v2_2 (actual
> > rows=0.13 loops=8)
>
> Even if this had not failed in the buildfarm, it would certainly
> have caused problems for somebody. We have a very strong rule
> that you *do not* make regression test results dependent on exact
> computed cost or rowcount values, because those are inherently
> platform-dependent.  I do not understand how you thought that
> pg_overexplain might be exempt from that problem.

Is this rule written down someplace?

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Andrei Lepikhov
Date:
On 3/26/25 19:09, Robert Haas wrote:
> On Sat, Mar 22, 2025 at 12:10 PM Robert Haas <robertmhaas@gmail.com> wrote:
>> I'm not going
>> to insist on shipping this if I'm the ONLY one who would ever get any
>> use out of it, but I doubt that's the case.
> 
> Hearing no other votes against this, I have committed it, but now I
> wonder if that is going to break the buildfarm, because I just noticed
> that the changes I made in v9 seem not to have resolved the problem
> with debug_parallel_query, for reasons I do not yet understand.
> Investigating...
I’m afraid to sound like a bore, but I think pg_overexplain should 
include a call into the hook call chain (see attachment). Who knows, 
maybe this extension will be used someday in production?

-- 
regards, Andrei Lepikhov
Attachment

Re: making EXPLAIN extensible

From
Man Zeng
Date:
I tried it out and felt good about this feature.
But I found that the following information is not very complete.

postgres=# select * from pg_get_loaded_modules();
 module_name  | version |     file_name     
--------------+---------+-------------------
              |         | pg_overexplain.so
 auto_explain | 18devel | auto_explain.so
 plpgsql      | 18devel | plpgsql.so
(3 rows)

Some minor changes may be needed.

// PG_MODULE_MAGIC;
PG_MODULE_MAGIC_EXT(
                    .name = "pg_overexplain",
                    .version = PG_VERSION
);

Re: making EXPLAIN extensible

From
Pavel Luzanov
Date:
One more suggestion to improve the documentation.
It lacks installations actions, something like in auto_explain:
To use pg_overexplain, simply load it into the server.
You can load it into an individual session:

LOAD 'pg_overexplain';
(You must be superuser to do that.)
Another way is to preload it into some or all sessions
by including pg_overexplain in session_preload_libraries
or shared_preload_libraries in postgresql.conf. 
-- 
Pavel Luzanov
Postgres Professional: https://postgrespro.com

Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Thu, Mar 27, 2025 at 5:34 PM Andrei Lepikhov <lepihov@gmail.com> wrote:
> I’m afraid to sound like a bore, but I think pg_overexplain should
> include a call into the hook call chain (see attachment). Who knows,
> maybe this extension will be used someday in production?

Oh, bother, that was a silly mistake on my part. Thanks, committed.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Fri, Mar 28, 2025 at 12:13 AM Man Zeng <zengman@halodbtech.com> wrote:
> // PG_MODULE_MAGIC;
> PG_MODULE_MAGIC_EXT(
>                                         .name = "pg_overexplain",
>                                         .version = PG_VERSION
> );

Good catch, committed.

--
Robert Haas
EDB: http://www.enterprisedb.com



Re: making EXPLAIN extensible

From
Robert Haas
Date:
On Fri, Mar 28, 2025 at 5:39 AM Pavel Luzanov <p.luzanov@postgrespro.ru> wrote:
> One more suggestion to improve the documentation.
> It lacks installations actions, something like in auto_explain:
>
> To use pg_overexplain, simply load it into the server.
> You can load it into an individual session:
>
> LOAD 'pg_overexplain';
>
> (You must be superuser to do that.)
> Another way is to preload it into some or all sessions
> by including pg_overexplain in session_preload_libraries
> or shared_preload_libraries in postgresql.conf.

Yeah, I think this could make sense, but I'm not sure how much detail
to include. pg_overexplain is a little unusual in being one of few
contrib modules where all you have to do is load it -- there's no
CREATE EXTENSION required, and shared_preload_libraries is not
required either. So, it probably makes sense to explain something, but
if we eventually have 10 more modules that work like this, we wouldn't
want a lengthy explanation in each one.

Another thing that we might want to do is adjust the documentation for
EXPLAIN. At the very least, the "See also" section should probably
reference pg_overexplain, but we might also want some text there
explaining the general concept that EXPLAIN is now extensible via
loadable modules.

--
Robert Haas
EDB: http://www.enterprisedb.com