Home > mailing lists

Contradictory behavior of array_agg(distinct) aggregate. - Mailing list pgsql-hackers

From	Konstantin Knizhnik
Subject	Contradictory behavior of array_agg(distinct) aggregate.
Date	December 4, 2024 09:44:07
Msg-id	7d3268df-abc2-4eb3-8e2b-59df084b579a@garret.ru Whole thread Raw
Responses	Re: Contradictory behavior of array_agg(distinct) aggregate. Re: Contradictory behavior of array_agg(distinct) aggregate.
List	pgsql-hackers

Tree view

Hi hackers!


Is it only me who consider that current behavior of array_agg(distinct) 
contradicts to interpretation of nulls in other cases ("null" is 
something like "unknown" which means that we can not say weather two 
nulls are the same or not). This is why it is allowed to insert multiple 
nulls in the unique column.

postgres=# create table t(x integer unique);
CREATE TABLE
postgres=# insert into t values (null),(null);
INSERT 0 2
postgres=# select count(distinct x) from t;
  count
-------
      0
(1 row)

postgres=# select array_agg(distinct x) from t;
  array_agg
-----------
  {NULL}
(1 row)

postgres=# select array_agg(x) from t;
   array_agg
-------------
  {NULL,NULL}
(1 row)


So what is the number of distinct "x" values in this case? I think that 
according to SQL model - 0 (as count(distinct) returns).
Why in this case array_agg(distinct x) returns non-empty array?

Yes, unlike most other aggregates, `array_agg` is not ignoring null values.
But is it correct to treat two null values as the same (non-distinct)?
IMHO correct result in this case should be either {} or NULL, either 
{NULL,NULL}.

pgsql-hackers by date:

From: "Zhijie Hou (Fujitsu)"
Date: 04 December 2024, 09:42:55
Subject: RE: Memory leak in WAL sender with pgoutput (v10~)

From: Tom Lane
Date: 04 December 2024, 10:03:17
Subject: Re: Contradictory behavior of array_agg(distinct) aggregate.

Contradictory behavior of array_agg(distinct) aggregate. - Mailing list pgsql-hackers

Previous

Next