Home > mailing lists

Re: Parallel Aggregate - Mailing list pgsql-hackers

From	Robert Haas
Subject	Re: Parallel Aggregate
Date	December 23, 2015 18:12:53
Msg-id	CA+TgmoakXoWdc_BoAs9-Mp6N0k2zfzrF6cpd_3PDTacde8yaJQ@mail.gmail.com Whole thread
In response to	Re: Parallel Aggregate (David Rowley <david.rowley@2ndquadrant.com>)
Responses	Re: Parallel Aggregate
List	pgsql-hackers

Tree view

On Mon, Dec 21, 2015 at 6:38 PM, David Rowley
<david.rowley@2ndquadrant.com> wrote:
> On 22 December 2015 at 04:16, Paul Ramsey <pramsey@cleverelephant.ca> wrote:
>>
>> Shouldn’t parallel aggregate come into play regardless of scan
>> selectivity?
>
> I'd say that the costing should take into account the estimated number of
> groups.
>
> The more tuples that make it into each group, the more attractive parallel
> grouping should seem. In the extreme case if there's 1 tuple per group, then
> it's not going to be of much use to use parallel agg, this would be similar
> to a scan with 100% selectivity. So perhaps the costings for it can be
> modeled around a the parallel scan costing, but using the estimated groups
> instead of the estimated tuples.

Generally, the way that parallel costing is supposed to work (with the
parallel join patch, anyway) is that you've got the same nodes costed
the same way you would otherwise, but the row counts are lower because
you're only processing 1/Nth of the rows.  That's probably not exactly
the whole story here, but it's something to think about.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

pgsql-hackers by date:

From: Fabien COELHO
Date: 23 December 2015, 18:07:28
Subject: Re: pgbench --latency-limit option

From: Corey Huinker
Date: 23 December 2015, 18:16:05
Subject: Re: [POC] FETCH limited by bytes.

Re: Parallel Aggregate - Mailing list pgsql-hackers

Previous

Next