Re: Bug: Missing check_stack_depth() in GRAPH_TABLE rewriter - Mailing list pgsql-hackers

From Ashutosh Bapat
Subject Re: Bug: Missing check_stack_depth() in GRAPH_TABLE rewriter
Date
Msg-id CAExHW5sEQOK3F+pkZcFy=FX-dOhdittBS6A7Hf88HT841bunLg@mail.gmail.com
Whole thread
In response to Bug: Missing check_stack_depth() in GRAPH_TABLE rewriter  (SATYANARAYANA NARLAPURAM <satyanarlapuram@gmail.com>)
List pgsql-hackers
On Sat, Apr 11, 2026 at 9:28 AM SATYANARAYANA NARLAPURAM
<satyanarlapuram@gmail.com> wrote:
>
> Hi Hackers,
>
> Two recursive functions, generate_setop_from_pathqueries() and
> generate_queries_for_path_pattern_recurse() in GRAPH_TABLE rewriter are missing check_stack_depth() calls. Consider a
propertygraph with  400 edge tables with a 2-hop pattern produce 160,000 path queries and therefore 160,000 recursion
framesin 
> generate_setop_from_pathqueries(). This can easily exceed the OS stack limit for some systems and crashes the
backend.
>
> I am proposing a stop gap fix of checking stack depth by calling check_stack_depth for now but in future we may want
toreduce the recursion depth (for example use a balanced tree to reduce depth from O(N) to O(log N). 
>
> Attached a patch for adding the check_stack_depth() check.
>
> Repro:
>
> CREATE TABLE sv (id int PRIMARY KEY);
>   INSERT INTO sv VALUES (1);
>
>   DO $$
>   BEGIN
>     FOR i IN 1..400 LOOP
>       EXECUTE format(
>         'CREATE TABLE se_%s (id int PRIMARY KEY, src int, dst int)', i);
>     END LOOP;
>   END$$;
>
>   DO $$
>   DECLARE
>     sql text;
>     edges text := '';
>   BEGIN
>     FOR i IN 1..400 LOOP
>       IF i > 1 THEN edges := edges || ', '; END IF;
>       edges := edges || format(
>         'se_%s KEY (id) SOURCE KEY (src) REFERENCES sv (id) '
>         'DESTINATION KEY (dst) REFERENCES sv (id)', i);
>     END LOOP;
>     EXECUTE 'CREATE PROPERTY GRAPH g VERTEX TABLES (sv KEY (id)) '
>          || 'EDGE TABLES (' || edges || ')';
>   END$$;
>
>   SELECT * FROM GRAPH_TABLE(g
>     MATCH (a)-[e1]->(b)-[e2]->(c)
>     COLUMNS(a.id AS a_id))
>   LIMIT 1;
>

Thanks for the report. I could reproduce the segfault on my laptop.
The attached patch fixes it and gives ERROR:  stack depth limit
exceeded.

generate_queries_for_path_pattern_recurse() - has to work in a linear
fashion since the elements need to be processed in an order. Each
permutation of elements produces one query. These queries can be
arranged in a balanced tree as you  suggest OR when constructing the
setop tree we could generate it in divide-and-conquer manner. However,
the tree will be flattened in the planner anyway (See
flatten_simple_union_all() and pull_up_simple_union_all()). Thus the
final planning will require a deeper stack anyway. The code complexity
doesn't seem to be worth it.

I also looked at a few commits that add check_stack_depth() to see if
we add tests for these scenarios. But I didn't find any. So no tests
added with this commit.



--
Best Wishes,
Ashutosh Bapat

Attachment

pgsql-hackers by date:

Previous
From: Antonin Houska
Date:
Subject: Re: repack: fix uninitialized DecodingWorkerShared.initialized
Next
From: Ashutosh Bapat
Date:
Subject: Re: Typo fixes in Graph table files