Home > mailing lists

Re: POC: GROUP BY optimization - Mailing list pgsql-hackers

From	Andrei Lepikhov
Subject	Re: POC: GROUP BY optimization
Date	April 22 06:51:02
Msg-id	7a338db0-df0d-4e2e-ba29-abf4893f41e8@postgrespro.ru Whole thread Raw
In response to	Re: POC: GROUP BY optimization (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-hackers

Tree view

On 4/12/24 06:44, Tom Lane wrote:
> If this patch were producing better results I'd be more excited
> about putting more work into it.  But on the basis of what I'm
> seeing right now, I think maybe we ought to give up on it.
Let me show current cases where users have a profit with this tiny 
improvement (see queries and execution results in query.sql):
1. 'Not optimised query text' — when we didn't consider group-by 
ordering during database development.
2. 'Accidental pathkeys' - we didn't see any explicit orderings, but 
accidentally, the planner used merge join that caused some orderings and 
we can utilise it.
3. 'Uncertain scan path' — We have options regarding which index to use, 
and because of that, we can't predict the optimal group-by ordering 
before the start of query planning.
4. 'HashAgg V/S GroupAgg' — sometimes, the GroupAgg strategy outperforms 
HashAgg just because we don't need any ordering procedure at all.

And the last thing here — this code introduces the basics needed to add 
more sophisticated strategies, like ordering according to uniqueness or 
preferring to set constant-width columns first in grouping.

-- 
regards,
Andrei Lepikhov
Postgres Professional

Attachment

query.sql

pgsql-hackers by date:

From: Peter Geoghegan
Date: 22 April, 05:52:16
Subject: Re: Assert failure in _bt_preprocess_array_keys

From: Michael Paquier
Date: 22 April, 08:04:04
Subject: Re: [PATCH] Replace magic constant 3 with NUM_MERGE_MATCH_KINDS

Re: POC: GROUP BY optimization - Mailing list pgsql-hackers

Attachment

Previous

Next