Thread: ALL() question
Hello, I have a problem with the ALL() subquery expression. I have three tables: - specimens - test_bits - specimen_test_bits The specimen_test_bits table contains two foreign keys, one to specimens(id), another to test_bits(id). Here is an output of specimen_test_bits: muridae=> select * from specimen_test_bits;specimen_id | test_bit_id -------------+------------- 46096 | 1 46096 | 2 46096 | 3 46096 | 4 52894 | 1 52894 | 3 12546 | 2 What I would like is a query that returns all the specimen_id of this table which have _all_ the given test_bit_id. So in this case, with test_bit_id 1,2,3,4 it should return only specimen_id 46096. With the following I got a syntax error: select specimen_id from specimen_test_bits where test_bit_id = all(1,2,3,4); The following works but no rows are returned : select specimen_id from specimen_test_bits where test_bit_id = all(select id from test_bits where id in (1,2,3,4)); Any idea how I could do this ? I guess the problem is my ALL() expression ... In advance thanks, Julien
Julien Cigar wrote: > > What I would like is a query that returns all the specimen_id of > this table which have _all_ the given test_bit_id. [snip] > With the following I got a syntax error: > select specimen_id > from specimen_test_bits > where test_bit_id = all(1,2,3,4); It's expecting an array here. You'd have to write = all('{1,2,3,4}') But that would have the same problem as... > The following works but no rows are returned : > select specimen_id > from specimen_test_bits > where test_bit_id = all(select id from test_bits where id in (1,2,3,4)); It's testing each row individually and of course one row can't match ALL four values. What you want to do is count the distinct values. Something like: SELECT specimen_id FROM foo GROUP BY specimen_id HAVING count(distinct test_bit_id) = 4 ; -- Richard Huxton Archonet Ltd
On Wed, 2007-11-14 at 11:56 +0000, Richard Huxton wrote: > Julien Cigar wrote: > > > > What I would like is a query that returns all the specimen_id of > > this table which have _all_ the given test_bit_id. > [snip] > > With the following I got a syntax error: > > select specimen_id > > from specimen_test_bits > > where test_bit_id = all(1,2,3,4); > > It's expecting an array here. You'd have to write > = all('{1,2,3,4}') > But that would have the same problem as... > > > The following works but no rows are returned : > > select specimen_id > > from specimen_test_bits > > where test_bit_id = all(select id from test_bits where id in (1,2,3,4)); > > It's testing each row individually and of course one row can't match ALL > four values. > > What you want to do is count the distinct values. Something like: > > SELECT > specimen_id > FROM foo > GROUP BY > specimen_id > HAVING > count(distinct test_bit_id) = 4 > ; > I don't think it would work, for example if I have: specimen_id | test_bit_id ------------+------------ 100 1 100 3 101 1 101 2 the test_bit_ids are parameters, so with the given test_bit_id 1,3 it would return specimen_id 101 too, which I don't want ... What I would like is the specimen_id which match _exactly_ the given test_bit_ids, so it should return only 100 in this example .. from the documentation ALL() can take a subquery too, not only an ARRAY (http://www.postgresql.org/docs/8.2/static/functions-subquery.html)
I finally found a solution: SELECT specimen_id FROM specimen_test_bits GROUP BY specimen_id HAVING array_accum(test_bit_id) = '{2,3,4}'; .. but I don't think it's very "clean" .. what do you think ? Thanks On Wed, 2007-11-14 at 15:50 +0100, Julien Cigar wrote: > On Wed, 2007-11-14 at 11:56 +0000, Richard Huxton wrote: > > Julien Cigar wrote: > > > > > > What I would like is a query that returns all the specimen_id of > > > this table which have _all_ the given test_bit_id. > > [snip] > > > With the following I got a syntax error: > > > select specimen_id > > > from specimen_test_bits > > > where test_bit_id = all(1,2,3,4); > > > > It's expecting an array here. You'd have to write > > = all('{1,2,3,4}') > > But that would have the same problem as... > > > > > The following works but no rows are returned : > > > select specimen_id > > > from specimen_test_bits > > > where test_bit_id = all(select id from test_bits where id in (1,2,3,4)); > > > > It's testing each row individually and of course one row can't match ALL > > four values. > > > > What you want to do is count the distinct values. Something like: > > > > SELECT > > specimen_id > > FROM foo > > GROUP BY > > specimen_id > > HAVING > > count(distinct test_bit_id) = 4 > > ; > > > > I don't think it would work, for example if I have: > specimen_id | test_bit_id > ------------+------------ > 100 1 > 100 3 > 101 1 > 101 2 > > the test_bit_ids are parameters, so with the given test_bit_id 1,3 it > would return specimen_id 101 too, which I don't want ... > What I would like is the specimen_id which match _exactly_ the given > test_bit_ids, so it should return only 100 in this example .. > > from the documentation ALL() can take a subquery too, not only an ARRAY > (http://www.postgresql.org/docs/8.2/static/functions-subquery.html) > >
The doc says "The right-hand side is a parenthesized subquery, which must return exactly one column..."
That's what you have if using "... where test_bit_id = all(select id from test_bits where id in (1,2,3,4));"
The doc continues "...The left-hand expression is evaluated and compared to each row of the subquery result using the given operator, which must yield a Boolean result ..."
So your where expression is equivalent to:
where test_bit_id = (select id from test_bits where id = 1) AND
test_bit_id = (select id from test_bits where id = 2) AND
test_bit_id = (select id from test_bits where id = 3) AND
test_bit_id = (select id from test_bits where id = 4);
test_bit_id = (select id from test_bits where id = 2) AND
test_bit_id = (select id from test_bits where id = 3) AND
test_bit_id = (select id from test_bits where id = 4);
The doc continues "... The result of ALL is "true" if all rows yield true ..."
Since test_bit_id can never be 1, 2, 3 and 4 at the same time the result of ALL will be false. So no records get returned.
>>> Julien Cigar <jcigar@ulb.ac.be> 2007-11-14 15:50 >>>
On Wed, 2007-11-14 at 11:56 +0000, Richard Huxton wrote:
> Julien Cigar wrote:
> >
> > What I would like is a query that returns all the specimen_id of
> > this table which have _all_ the given test_bit_id.
> [snip]
> > With the following I got a syntax error:
> > select specimen_id
> > from specimen_test_bits
> > where test_bit_id = all(1,2,3,4);
>
> It's expecting an array here. You'd have to write
> = all('{1,2,3,4}')
> But that would have the same problem as...
>
> > The following works but no rows are returned :
> > select specimen_id
> > from specimen_test_bits
> > where test_bit_id = all(select id from test_bits where id in (1,2,3,4));
>
> It's testing each row individually and of course one row can't match ALL
> four values.
>
> What you want to do is count the distinct values. Something like:
>
> SELECT
> specimen_id
> FROM foo
> GROUP BY
> specimen_id
> HAVING
> count(distinct test_bit_id) = 4
> ;
>
I don't think it would work, for example if I have:
specimen_id | test_bit_id
------------+------------
100 1
100 3
101 1
101 2
the test_bit_ids are parameters, so with the given test_bit_id 1,3 it
would return specimen_id 101 too, which I don't want ...
What I would like is the specimen_id which match _exactly_ the given
test_bit_ids, so it should return only 100 in this example ..
from the documentation ALL() can take a subquery too, not only an ARRAY
(http://www.postgresql.org/docs/8.2/static/functions-subquery.html)
---------------------------(end of broadcast)---------------------------
TIP 7: You can help support the PostgreSQL project by donating at
http://www.postgresql.org/about/donate
Julien Cigar wrote: > On Wed, 2007-11-14 at 11:56 +0000, Richard Huxton wrote: >> Julien Cigar wrote: >>> What I would like is a query that returns all the specimen_id of >>> this table which have _all_ the given test_bit_id. >> [snip] >>> With the following I got a syntax error: >>> select specimen_id >>> from specimen_test_bits >>> where test_bit_id = all(1,2,3,4); >> It's expecting an array here. You'd have to write >> = all('{1,2,3,4}') >> But that would have the same problem as... >> >>> The following works but no rows are returned : >>> select specimen_id >>> from specimen_test_bits >>> where test_bit_id = all(select id from test_bits where id in (1,2,3,4)); >> It's testing each row individually and of course one row can't match ALL >> four values. >> >> What you want to do is count the distinct values. Something like: >> >> SELECT >> specimen_id >> FROM foo >> GROUP BY >> specimen_id >> HAVING >> count(distinct test_bit_id) = 4 >> ; >> > > I don't think it would work, for example if I have: > specimen_id | test_bit_id > ------------+------------ > 100 1 > 100 3 > 101 1 > 101 2 > > the test_bit_ids are parameters, so with the given test_bit_id 1,3 it > would return specimen_id 101 too, which I don't want ... Not if you test for what you want too: ... FROM foo WHERE test_bit_id = ANY ('{1,3}') ...or... WHERE test_bit_id IN (1,3) > What I would like is the specimen_id which match _exactly_ the given > test_bit_ids, so it should return only 100 in this example .. > > from the documentation ALL() can take a subquery too, not only an ARRAY > (http://www.postgresql.org/docs/8.2/static/functions-subquery.html) Yes, but that doesn't help with your query - one row can't match ALL your values. -- Richard Huxton Archonet Ltd
Julien Cigar wrote: > I finally found a solution: > > SELECT specimen_id > FROM specimen_test_bits > GROUP BY specimen_id > HAVING array_accum(test_bit_id) = '{2,3,4}'; > > .. but I don't think it's very "clean" .. The key question is whether you can rely on getting (2,3,4) or whether you might get (4,3,2) or some other ordering. -- Richard Huxton Archonet Ltd
On Wed, Nov 14, 2007 at 02:39:02PM +0100, Julien Cigar wrote: > With the following I got a syntax error: > select specimen_id > from specimen_test_bits > where test_bit_id = all(1,2,3,4); where test_bit_id in (1,2,3,4) group by specimen_id having count(distinct test_bit_id) = 4; depesz -- quicksil1er: "postgres is excellent, but like any DB it requires a highly paid DBA. here's my CV!" :) http://www.depesz.com/ - blog dla ciebie (i moje CV)
Thanks :) it works as expected Julien On Wed, 2007-11-14 at 14:31 +0100, hubert depesz lubaczewski wrote: > On Wed, Nov 14, 2007 at 02:39:02PM +0100, Julien Cigar wrote: > > With the following I got a syntax error: > > select specimen_id > > from specimen_test_bits > > where test_bit_id = all(1,2,3,4); > > where test_bit_id in (1,2,3,4) > group by specimen_id > having count(distinct test_bit_id) = 4; > > depesz >
Julien Cigar escreveu: > Hello, > > I have a problem with the ALL() subquery expression. > I have three tables: > - specimens > - test_bits > - specimen_test_bits > > The specimen_test_bits table contains two foreign keys, one to > specimens(id), another to test_bits(id). > > Here is an output of specimen_test_bits: > > muridae=> select * from specimen_test_bits; > specimen_id | test_bit_id > -------------+------------- > 46096 | 1 > 46096 | 2 > 46096 | 3 > 46096 | 4 > 52894 | 1 > 52894 | 3 > 12546 | 2 > > What I would like is a query that returns all the specimen_id of > this table which have _all_ the given test_bit_id. So in this > case, with test_bit_id 1,2,3,4 it should return only > specimen_id 46096. > > With the following I got a syntax error: > select specimen_id > from specimen_test_bits > where test_bit_id = all(1,2,3,4); > > The following works but no rows are returned : > select specimen_id > from specimen_test_bits > where test_bit_id = all(select id from test_bits where id in (1,2,3,4)); > > Any idea how I could do this ? I guess the problem is my ALL() expression ... > Unclear, but works... SELECT DISTINCT stb.specimen_id FROM specimen_test_bits stb WHERE NOT EXISTS (SELECT * FROM (VALUES (1), (2), (3) , (4)) AS foo(id) WHERE NOT EXISTS (SELECT stb1.test_bit_id FROM specimen_test_bits stb1 WHERE foo.id = stb1.test_bit_id AND stb.specimen_id = stb1.specimen_id)); Osvaldo