Home > mailing lists

Re: Parallel Aggregates for string_agg and array_agg - Mailing list pgsql-hackers

From	Mark Dilger
Subject	Re: Parallel Aggregates for string_agg and array_agg
Date	May 2, 2018 00:35:46
Msg-id	ADEC5025-2BD4-42C8-B63F-004D89EEE8D4@gmail.com Whole thread Raw
In response to	Re: Parallel Aggregates for string_agg and array_agg (Andres Freund <andres@anarazel.de>)
Responses	Re: Parallel Aggregates for string_agg and array_agg
List	pgsql-hackers

Tree view

> On May 1, 2018, at 2:11 PM, Andres Freund <andres@anarazel.de> wrote:
> 
> Hi,
> 
> On 2018-05-01 14:09:39 -0700, Mark Dilger wrote:
>> I don't care which order the data is in, as long as x[i] and y[i] are
>> matched correctly.  It sounds like this patch would force me to write
>> that as, for example:
>> 
>> select array_agg(a order by a, b) AS x, array_agg(b order by a, b) AS y
>>  from generate_a_b_func(foo);
>> 
>> which I did not need to do before.
> 
> Why would it require that? Rows are still processed row-by-row even if
> there's parallelism, no?

I was responding in part to Tom's upthread statement:

  Your own example of assuming that separate aggregates are computed
  in the same order reinforces my point, I think.  In principle, anybody
  who's doing that should write

      array_agg(e order by x),
      array_agg(f order by x),
      string_agg(g order by x)

  because otherwise they shouldn't assume that;

It seems Tom is saying that you can't assume separate aggregates will be
computed in the same order.  Hence my response.  What am I missing here?

mark

pgsql-hackers by date:

From: David Rowley
Date: 02 May 2018, 00:34:07
Subject: Re: Remove mention in docs that foreign keys on partitioned tablesare not supported

From: Andres Freund
Date: 02 May 2018, 00:38:32
Subject: Re: Parallel Aggregates for string_agg and array_agg

Re: Parallel Aggregates for string_agg and array_agg - Mailing list pgsql-hackers

Previous

Next