Thread: RE: [GENERAL] How to do this in Postgres

RE: [GENERAL] How to do this in Postgres

From
Fabian.Frederick@prov-liege.be
Date:
> Hi there,
>
> I tried all I could think of with the following problem, perhaps
> someone has another idea.
>
> I have a table where for each id there may (and often are) multiple
> rows with some kind of priority.
>   create table data ( id1 int4, id2 int4, <<lots of data>>,
> prio int4 );
> The minimal priority is not guaranteed to be 1. There are 200k
> different ids with up to 10 entries, summing up to 400k rows.
>
> Not I want to do something like this:
>
>     select * from data where <<prio is minimal per id pair>>.
>
> First attempt (deleting non minimal)
> ------------------------------------
>
>     select a.id1, a.id2, a.prio
>     into bugos
>     from a data, b data
>     where a.prio > b.prio and a.id1 = b.id1 and a.id2 = b.id2;
>
>     delete from data
>     where id1 = bogus.id1 and id2 = bogus.id2 and prio = bogus.prio;
>
> The join does not seem to complete. I am not sure whether I should
> have waited longer, but after 4h without significant disk access I
> do not think that this thing will ever return. Indexing didn't help.
>
> Second attempt (stored procedures)
> ----------------------------------
>
>     create function GetData( int4, int4 )
>     returns data
>     as 'select *
>         from data
>         where id1 = $1 and id2 = $2
>         order by prio
>         limit 1'
>     language 'sql';
>
>     select GetData(id1,id2) from <<table with unique ids>>;
>
> limit in functions is not yet implemented in postgres (6.5.2)
>
> Third attempt (use perl on dumped table)
> ----------------------------------------
>
> I don't want to :-)

IMHO, Perl is the solution,
some foreach architecture on @table should work...
Of course it could be ordinal (usable on fixed row system).
The best would be to recover row name as well(if someone has an
idea to do this ???):

    foreach $item in fetched_row{
        #Recover row name (Process runned once (rec#1)
        ???
        #Recover row content
    }


Regards, Fabian

RE: [GENERAL] How to do this in Postgres

From
Stuart Rison
Date:
Hello,

On Tue, 23 Nov 1999 Fabian.Frederick@prov-liege.be wrote:

> > Hi there,
> >
> > I tried all I could think of with the following problem, perhaps
> > someone has another idea.
> >
> > I have a table where for each id there may (and often are) multiple
> > rows with some kind of priority.
> >   create table data ( id1 int4, id2 int4, <<lots of data>>,
> > prio int4 );
> > The minimal priority is not guaranteed to be 1. There are 200k
> > different ids with up to 10 entries, summing up to 400k rows.
> >
> > Not I want to do something like this:
> >
> >     select * from data where <<prio is minimal per id pair>>.
> >

I'm going to stop here and try and give you a query that does exactly
that (I think).

How about:

select * from data
where (prio,id1,id2) = (
SELECT min(prio) from data d2 where d2.id1=1 and d2.id2=2 group by
d2.id1,d2.id2
);

Seems to work on my -very limited- test table.  Trickiest bit was to avoid
the "ERROR:  parser: Subselect has too many or too few fields." message
you get if you specify 'where id1=1 and id2=2' in the main SELECT and just
and 'AND prio=(SELECT min(prio)...' [do ignore this if it makes no
sense!].

Should be easy to turn that into a SELECT GetData( int4, int4 ) like
function if you want.

HTH,

Stuart.

Stuart C. G. Rison
Department of Biochemistry and Molecular Biology
6th floor, Darwin Building, University College London (UCL)
Gower Street, London, WC1E 6BT, United Kingdom
Tel. 0207 504 2303, Fax. 0207 380 7193
e-mail: rison@biochem.ucl.ac.uk