pgsql: Fix distinctness check for queries with grouping sets - Mailing list pgsql-committers

From Richard Guo
Subject pgsql: Fix distinctness check for queries with grouping sets
Date
Msg-id E1vSsnw-003aIp-0Z@gemulon.postgresql.org
Whole thread Raw
List pgsql-committers
Fix distinctness check for queries with grouping sets

query_is_distinct_for() is intended to determine whether a query never
returns duplicates of the specified columns.  For queries using
grouping sets, if there are no grouping expressions, the query may
contain one or more empty grouping sets.  The goal is to detect
whether there is exactly one empty grouping set, in which case the
query would return a single row and thus be distinct.

The previous logic in query_is_distinct_for() was incomplete because
the check was insufficiently thorough and could return false when it
could have returned true.  It failed to consider cases where the
DISTINCT clause is used on the GROUP BY, in which case duplicate empty
grouping sets are removed, leaving only one.  It also did not
correctly handle all possible structures of GroupingSet nodes that
represent a single empty grouping set.

To fix, add a check for the groupDistinct flag, and expand the query's
groupingSets tree into a flat list, then verify that the expanded list
contains only one element.

No backpatch as this could result in plan changes.

Author: Richard Guo <guofenglinux@gmail.com>
Reviewed-by: David Rowley <dgrowleyml@gmail.com>
Discussion: https://postgr.es/m/CAMbWs480Z04NtP8-O55uROq2Zego309+h3hhaZhz6ztmgWLEBw@mail.gmail.com

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/f00484c170f56199c3eeacc82bd72f8c1e3baf6b

Modified Files
--------------
src/backend/optimizer/plan/analyzejoins.c | 19 ++++---
src/test/regress/expected/join.out        | 83 +++++++++++++++++++++++++++++++
src/test/regress/sql/join.sql             | 32 ++++++++++++
3 files changed, 127 insertions(+), 7 deletions(-)


pgsql-committers by date:

Previous
From: Richard Guo
Date:
Subject: pgsql: Fix const-simplification for index expressions and predicate
Next
From: Dean Rasheed
Date:
Subject: pgsql: doc: Fix statement about ON CONFLICT and deferrable constraints.