Thread: "Join" on delimeter aggregate query

"Join" on delimeter aggregate query

From

Michael A Nachbaur

Date:

07 June 2003, 20:26:01

Hello everyone,

I've set up PostgreSQL as the authentication / configuration database for my 
mail server (Postfix + Courier-IMAP), and though it works beautifully, I need 
some help on my aliases query.

You see, define aliases in a database table as rows in a column in the form of 
"Source" and "Target".  The problem is that one source address can be 
delivered to multiple targets (e.g. internal mailing list, or a temporary 
forward to another address), but postfix only processes the first record 
returned from an SQL query.

Postfix can deliver to multiple targets, if you separate the targets with 
comas, like so:

Source             Target
joe@bob.org   address1@bob.org,address2@bob.org,....

What I would like to do, is something like the following (I know I'd need to 
group the query, but you get the idea):

Select DISTINCT(Source), JOIN(Target, ',') FROM Aliases

Is there any way this can be done with Postfix?

-- 
Michael A Nachbaur <mike@nachbaur.com>

Re: "Join" on delimeter aggregate query

From

Joe Conway

Date:

08 June 2003, 18:14:40

Michael A Nachbaur wrote:
> Source             Target
> joe@bob.org   address1@bob.org,address2@bob.org,....
> 

It isn't clear from your description what you want (to me, anyway), but 
I'd guess something like this:

regression=# select * from src2tgt;   source    |      target
-------------+------------------ joe@bob.org | address1@bob.org joe@bob.org | address2@bob.org tom@bob.org |
address3@bob.orgtom@bob.org | address4@bob.org
 
(4 rows)

create or replace function mylist(text,text) returns text as '
declare  result text;
begin  if $1 = '''' then    result := $2;  else    result := $1 || '','' || $2;  end if;  return result;
end;
' language 'plpgsql';

create aggregate tgtlist
(  BASETYPE = text,  SFUNC = mylist,  STYPE = text,  INITCOND = ''
);

regression=# select source, tgtlist(target) from src2tgt group by source;   source    |              tgtlist
-------------+----------------------------------- joe@bob.org | address1@bob.org,address2@bob.org tom@bob.org |
address3@bob.org,address4@bob.org
(2 rows)

HTH,

Joe

Re: "Join" on delimeter aggregate query

From

Eivind Kvedalen

Date:

08 June 2003, 20:45:44

Hi

You can create an aggregate function to solve this. A friend of mine asked
the same question a while ago, and I created a possible example solution
for him, which I paste here:

CREATE FUNCTION concat(varchar,varchar) RETURNS varchar    AS 'SELECT CASE          $1 WHEN \'\' THEN $2          ELSE
$1|| \',\'|| $2        END AS RESULT;'    LANGUAGE SQL; 

/* DROP AGGREGATE concat(varchar); */

CREATE AGGREGATE concat ( BASETYPE = varchar, SFUNC = concat, STYPE = varchar, INITCOND = ''
);

/* Example code */

DROP TABLE test;
CREATE TABLE test (     a varchar,     b varchar
);

INSERT INTO test VALUES ('A', '1');
INSERT INTO test VALUES ('A', '3');
INSERT INTO test VALUES ('A', '2');
INSERT INTO test VALUES ('B', 'a');
INSERT INTO test VALUES ('C', 'b');
INSERT INTO test VALUES ('C', 'c');

SELECT a, concat(b) FROM (SELECT a,b FROM test ORDER BY a,b) T GROUP BY a;

/*
a | concat
---+---------A | 1,2,3B | aC | b,c

*/

The ORDER BY is included to sort the rows before they are aggregated. I'm
not sure that this guarantees that they actually will be sorted, but maybe
some of the postgresql hackers can confirm/deny this? I guess this isn't
important to you, though.


On Fri, 6 Jun 2003, Michael A Nachbaur wrote:

> Hello everyone,
>
> I've set up PostgreSQL as the authentication / configuration database for my
> mail server (Postfix + Courier-IMAP), and though it works beautifully, I need
> some help on my aliases query.
>
> You see, define aliases in a database table as rows in a column in the form of
> "Source" and "Target".  The problem is that one source address can be
> delivered to multiple targets (e.g. internal mailing list, or a temporary
> forward to another address), but postfix only processes the first record
> returned from an SQL query.
>
> Postfix can deliver to multiple targets, if you separate the targets with
> comas, like so:
>
> Source             Target
> joe@bob.org   address1@bob.org,address2@bob.org,....
>
> What I would like to do, is something like the following (I know I'd need to
> group the query, but you get the idea):
>
> Select DISTINCT(Source), JOIN(Target, ',') FROM Aliases
>
> Is there any way this can be done with Postfix?
>
>

Eivind

--

| Mail: eivindkv@ifi.uio.no               | Lazy on IRC
| HP: www.stud.ifi.uio.no/~eivindkv       | "Jeg skal vrenge deg med håret
| Tlf: 22187123/93249534                  |  inn."
|                                         |               -- Yang Tse Lyse

Re: "Join" on delimeter aggregate query

From

Bruno Wolff III

Date:

09 June 2003, 04:22:33

On Fri, Jun 06, 2003 at 10:03:29 -0700, Michael A Nachbaur <mike@nachbaur.com> wrote:
> 
> Postfix can deliver to multiple targets, if you separate the targets with 
> comas, like so:
> 
> Source             Target
> joe@bob.org   address1@bob.org,address2@bob.org,....
> 
> What I would like to do, is something like the following (I know I'd need to 
> group the query, but you get the idea):
> 
> Select DISTINCT(Source), JOIN(Target, ',') FROM Aliases
> 
> Is there any way this can be done with Postfix?

You could write an aggregate function to concatenate email addresses with
comma separators.

Re: "Join" on delimeter aggregate query

From

Tom Lane

Date:

09 June 2003, 22:38:39

Eivind Kvedalen <eivindkv@ifi.uio.no> writes:
> SELECT a, concat(b) FROM (SELECT a,b FROM test ORDER BY a,b) T GROUP BY a;

> The ORDER BY is included to sort the rows before they are aggregated. I'm
> not sure that this guarantees that they actually will be sorted, but maybe
> some of the postgresql hackers can confirm/deny this?

This technique will work reliably as of 7.4, but it's not reliable
in existing releases.  The GROUP BY will do its own sort on A, and
unless qsort() is stable on your machine (which it's not, in most
implementations) the secondary ordering by B will be destroyed.

The fix in 7.4 simply makes the planner smart enough to notice that
the sub-select's output is already adequately sorted for grouping
by A.
        regards, tom lane

Re: "Join" on delimeter aggregate query

From

Tom Lane

Date:

10 June 2003, 00:46:21

Eivind Kvedalen <eivindkv@ifi.uio.no> writes:
> Ok. What I actually had in mind was whether the optimizer would remove the
> ORDER BY clause completely or not,

No.  If you put an ORDER BY in a subselect, I think the system should
honor it.
        regards, tom lane

Re: "Join" on delimeter aggregate query

From

Eivind Kvedalen

Date:

10 June 2003, 01:43:02

On Sun, 8 Jun 2003, Tom Lane wrote:

> Eivind Kvedalen <eivindkv@ifi.uio.no> writes:
> > SELECT a, concat(b) FROM (SELECT a,b FROM test ORDER BY a,b) T GROUP BY a;
>
> > The ORDER BY is included to sort the rows before they are aggregated. I'm
> > not sure that this guarantees that they actually will be sorted, but maybe
> > some of the postgresql hackers can confirm/deny this?
>
> This technique will work reliably as of 7.4, but it's not reliable
> in existing releases.  The GROUP BY will do its own sort on A, and
> unless qsort() is stable on your machine (which it's not, in most
> implementations) the secondary ordering by B will be destroyed.
>
> The fix in 7.4 simply makes the planner smart enough to notice that
> the sub-select's output is already adequately sorted for grouping
> by A.

Ok. What I actually had in mind was whether the optimizer would remove the
ORDER BY clause completely or not, as it isn't used in the top-level
SELECT query, and SQL doesn't in general guarantee ordered rows back
unless there's an ORDER BY in the top-level SELECT (I haven't read the SQL
standard, so I might very well be wrong here).

The GROUP BY sorts on A to do the grouping correctly, right? (That is,
removing the duplicates from A)

(Now, thinking more about this, removing the ORDER BY in the optimizer as
I suggested above would effectively remove the ORDER BY in created views,
right?)


Eivind

--

| Mail: eivindkv@ifi.uio.no               | Lazy on IRC
| HP: www.stud.ifi.uio.no/~eivindkv       | "Jeg skal vrenge deg med håret
| Tlf: 22187123/93249534                  |  inn."
|                                         |               -- Yang Tse Lyse

Re: "Join" on delimeter aggregate query

From

Michael A Nachbaur

Date:

10 June 2003, 02:27:18

Thanks very much, this helps immensely.  I've worked with functions before, 
but never aggregates.  I guess there's some more bedtime reading for me to 
look into now.

Re: sorting, this is not important to me, but I will keep the issues brought 
up by Tom Lane in mind when I use this.

On Saturday 07 June 2003 02:06 pm, Eivind Kvedalen wrote:
> Hi
>
> You can create an aggregate function to solve this. A friend of mine asked
> the same question a while ago, and I created a possible example solution
> for him, which I paste here:
>
> CREATE FUNCTION concat(varchar,varchar) RETURNS varchar
>      AS 'SELECT CASE
>            $1 WHEN \'\' THEN $2
>            ELSE $1 || \',\'|| $2
>          END AS RESULT;'
>      LANGUAGE SQL;
>
> /* DROP AGGREGATE concat(varchar); */
>
> CREATE AGGREGATE concat (
>   BASETYPE = varchar,
>   SFUNC = concat,
>   STYPE = varchar,
>   INITCOND = ''
> );
>
> /* Example code */
>
> DROP TABLE test;
> CREATE TABLE test (
>       a varchar,
>       b varchar
> );
>
> INSERT INTO test VALUES ('A', '1');
> INSERT INTO test VALUES ('A', '3');
> INSERT INTO test VALUES ('A', '2');
> INSERT INTO test VALUES ('B', 'a');
> INSERT INTO test VALUES ('C', 'b');
> INSERT INTO test VALUES ('C', 'c');
>
> SELECT a, concat(b) FROM (SELECT a,b FROM test ORDER BY a,b) T GROUP BY a;
>
> /*
>
>  a | concat
> ---+---------
>  A | 1,2,3
>  B | a
>  C | b,c
>
> */
>
> The ORDER BY is included to sort the rows before they are aggregated. I'm
> not sure that this guarantees that they actually will be sorted, but maybe
> some of the postgresql hackers can confirm/deny this? I guess this isn't
> important to you, though.
>
> On Fri, 6 Jun 2003, Michael A Nachbaur wrote:
> > Hello everyone,
> >
> > I've set up PostgreSQL as the authentication / configuration database for
> > my mail server (Postfix + Courier-IMAP), and though it works beautifully,
> > I need some help on my aliases query.
> >
> > You see, define aliases in a database table as rows in a column in the
> > form of "Source" and "Target".  The problem is that one source address
> > can be delivered to multiple targets (e.g. internal mailing list, or a
> > temporary forward to another address), but postfix only processes the
> > first record returned from an SQL query.
> >
> > Postfix can deliver to multiple targets, if you separate the targets with
> > comas, like so:
> >
> > Source             Target
> > joe@bob.org   address1@bob.org,address2@bob.org,....
> >
> > What I would like to do, is something like the following (I know I'd need
> > to group the query, but you get the idea):
> >
> > Select DISTINCT(Source), JOIN(Target, ',') FROM Aliases
> >
> > Is there any way this can be done with Postfix?
>
> Eivind

-- 
Michael A Nachbaur <mike@nachbaur.com>

Re: "Join" on delimeter aggregate query

From

Bruno Wolff III

Date:

10 June 2003, 10:56:20

On Mon, Jun 09, 2003 at 10:35:10 +0200, Eivind Kvedalen <eivindkv@ifi.uio.no> wrote:
> 
> Ok. What I actually had in mind was whether the optimizer would remove the
> ORDER BY clause completely or not, as it isn't used in the top-level
> SELECT query, and SQL doesn't in general guarantee ordered rows back
> unless there's an ORDER BY in the top-level SELECT (I haven't read the SQL
> standard, so I might very well be wrong here).

Allowing ordering in subselects is a feature (mostly for use with LIMIT)
and won't be removed because it isn't a top level select.