Home > mailing lists

pgsql: Remove GROUP BY columns that are functionally dependent on other - Mailing list pgsql-committers

From	Tom Lane
Subject	pgsql: Remove GROUP BY columns that are functionally dependent on other
Date	February 11, 2016 22:35:18
Msg-id	E1aTzpL-0005ln-LQ@gemulon.postgresql.org Whole thread Raw
List	pgsql-committers

Tree view

Remove GROUP BY columns that are functionally dependent on other columns.

If a GROUP BY clause includes all columns of a non-deferred primary key,
as well as other columns of the same relation, those other columns are
redundant and can be dropped from the grouping; the pkey is enough to
ensure that each row of the table corresponds to a separate group.
Getting rid of the excess columns will reduce the cost of the sorting or
hashing needed to implement GROUP BY, and can indeed remove the need for
a sort step altogether.

This seems worth testing for since many query authors are not aware of
the GROUP-BY-primary-key exception to the rule about queries not being
allowed to reference non-grouped-by columns in their targetlists or
HAVING clauses.  Thus, redundant GROUP BY items are not uncommon.  Also,
we can make the test pretty cheap in most queries where it won't help
by not looking up a rel's primary key until we've found that at least
two of its columns are in GROUP BY.

David Rowley, reviewed by Julien Rouhaud

Branch
------
master

Details
-------
http://git.postgresql.org/pg/commitdiff/d4c3a156cb46dcd1f9f97a8011bd94c544079bb5

Modified Files
--------------
src/backend/catalog/pg_constraint.c      |  94 ++++++++++++++++++
src/backend/optimizer/plan/planner.c     | 159 +++++++++++++++++++++++++++++++
src/include/catalog/pg_constraint_fn.h   |   3 +
src/test/regress/expected/aggregates.out |  64 +++++++++++++
src/test/regress/expected/join.out       |  10 +-
src/test/regress/sql/aggregates.sql      |  31 ++++++
src/test/regress/sql/join.sql            |   4 +-
7 files changed, 360 insertions(+), 5 deletions(-)

pgsql-committers by date:

From: Tom Lane
Date: 11 February 2016, 20:51:53
Subject: pgsql: Move pg_constraint.h function declarations to new file pg_constr

From: Tom Lane
Date: 11 February 2016, 22:52:19
Subject: pgsql: Refactor check_functional_grouping() to use get_primary_key_attn

pgsql: Remove GROUP BY columns that are functionally dependent on other - Mailing list pgsql-committers

Previous

Next