Home > mailing lists

Re: Parallel Aggregate - Mailing list pgsql-hackers

From	Haribabu Kommi
Subject	Re: Parallel Aggregate
Date	December 21, 2015 23:35:41
Msg-id	CAJrrPGcheLgAW0WaGXcQvXN=Hc9N4LWgMXMnc7r7c7akHbKD8g@mail.gmail.com Whole thread Raw
In response to	Re: Parallel Aggregate (Paul Ramsey <pramsey@cleverelephant.ca>)
List	pgsql-hackers

Tree view

On Tue, Dec 22, 2015 at 2:16 AM, Paul Ramsey <pramsey@cleverelephant.ca> wrote:
> Shouldn’t parallel aggregate come into play regardless of scan selectivity?
> I know in PostGIS land there’s a lot of stuff like:
>
> SELECT ST_Union(geom) FROM t GROUP BY areacode;
>
> Basically, in the BI case, there’s often no filter at all. Hoping that’s
> considered a prime case for parallel agg :)

Yes, the latest patch attached in the thread addresses this issue.
But it still lacks of proper cost calculation and comparison with
original aggregate cost.

The parallel aggregate selects only when the number of groups
should be at least less than 1/4 of rows that are getting selected.
Otherwise, doing aggregation two times for more number of
records leads to performance drop compared to original aggregate.

Regards,
Hari Babu
Fujitsu Australia

pgsql-hackers by date:

From: David Rowley
Date: 21 December 2015, 23:28:27
Subject: Re: Patch to improve a few appendStringInfo* calls

From: David Rowley
Date: 21 December 2015, 23:38:20
Subject: Re: Parallel Aggregate

Re: Parallel Aggregate - Mailing list pgsql-hackers

Previous

Next