Home > mailing lists

Unable to get acceptable performance from EXCEPT - Mailing list pgsql-hackers

From	Alfred Perlstein
Subject	Unable to get acceptable performance from EXCEPT
Date	May 10, 2000 18:04:11
Msg-id	20000510153511.N28180@fw.wintelcom.net Whole thread Raw
Responses	Re: Unable to get acceptable performance from EXCEPT
List	pgsql-hackers

Tree view

=# select count(*) from ref_old;count 
-------10595
(1 row)

=# select count(*) from ref_new;count 
-------22997
(1 row)

=# select ref_id from ref_old except select ref_id from ref_new;

Takes over 10 minutes, probably closer to half an hour.

I've also tried using 'NOT IN ( select ref_id from ref_new )'

ref_id is an int4, this is on Postgresql 7.0.

This confuses me because the way I'd plan to execute this query would
be something like this: (pseudo code)

result retval;
sort(ref_old);
sort(ref_new);
i = k = 0;
while (i < count(ref_old)) {while(ref_old[i] > ref_new[k])    k++;while(ref_old[i] == ref_new[k])
i++;while(ref_old[i]< ref_new[k])    store(&retval, ref_old[i++]);
 
}
return (retval);

I can't imagine this algorithm would take over 10 minutes on my
hardware.  Can anyone shed some light on what's going on here?

Is there a way to formulate my SQL to get Postgresql to follow
this algorithm?

thanks,
-- 
-Alfred Perlstein - [bright@wintelcom.net|alfred@freebsd.org]
"I have the heart of a child; I keep it in a jar on my desk."

pgsql-hackers by date:

From: Bruce Momjian
Date: 10 May 2000, 17:37:11
Subject: Re: setproctitle() no longer used?

From: The Hermit Hacker
Date: 10 May 2000, 18:18:15
Subject: Re: setproctitle() no longer used?

Unable to get acceptable performance from EXCEPT - Mailing list pgsql-hackers

Previous

Next