Home > mailing lists

Re: count(*), EXISTS, indexes - Mailing list pgsql-sql

From	Itai Zukerman
Subject	Re: count(*), EXISTS, indexes
Date	April 11, 2003 20:22:33
Msg-id	87vfxkrap2.fsf@matt.w80.math-hat.com Whole thread Raw
In response to	Re: count(*), EXISTS, indexes (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: count(*), EXISTS, indexes (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-sql

Tree view

>> Define:
>>   CREATE TABLE A (x int PRIMARY KEY, real v);
>>   CREATE TABLE B (x int);
>
>> I'd like to calculate:
>>   SELECT sum(v) FROM A WHERE EXISTS (SELECT 1 FROM B WHERE A.x=B.x);
>> ...but then it won't use the primary key index on A.x.
>
> In CVS tip (7.4-to-be) I think
>
> SELECT sum(v) FROM A WHERE A.x IN (SELECT B.x FROM B);

Oops.  I guess I should've changed the example to "WHERE A.x>=B.x"
since that's what I'm really doing.  Oh, well, my goof.

> would probably work well.  In current releases I think all you can do is
> add an index to B.x and settle for the EXISTS() approach.

I don't think indexing B will help speed-wise if it still does the seq
scan through A.

>> PS.  B is relatively small, a few thousand rows, while A has well over
>> 500,000 rows.  The DISTINCT A.x should be about 10,000-50,000.
>
> BTW, how can a PRIMARY KEY column have fewer DISTINCT values than there
> are rows?

Oh, I just meant, "the DISTINCT A.x that are returned by:
 SELECT DISTINCT A.x FROM A,B WHERE A.x=B.x

".  Sorry for my lack of clarity...

-- 
Itai Zukerman  <http://www.math-hat.com/~zukerman/>

pgsql-sql by date:

From: Tom Lane
Date: 11 April 2003, 20:10:53
Subject: Re: count(*), EXISTS, indexes

From: Itai Zukerman
Date: 11 April 2003, 20:32:05
Subject: Re: count(*), EXISTS, indexes

Re: count(*), EXISTS, indexes - Mailing list pgsql-sql

Previous

Next