Home > mailing lists

Re: where not exists - Mailing list pgsql-sql

From	Llew Sion Goodstadt
Subject	Re: where not exists
Date	March 20, 2002 08:41:12
Msg-id	004801c1cc1f$663c39c0$1c1d01a3@FGU028 Whole thread Raw
In response to	Re: where not exists ("Dag Arne Matre" <dag-arne@matreweb.com.antispam>)
List	pgsql-sql

Tree view

I ended up by using an external programme.
NOT EXISTS is just a set difference.
Doing set compares is really quick if both sets are sorted.
I use CRC64s for the data and just compare the resulting sorted sets of
(large CRC 64-bit) numbers.
Because everything hashes to a number, the memory requirements are not
that bad either (8 bytes per item ~256000 tuples per Mb).
The programme is in C++ but is as fast in something like Perl.
I.e. comparing millions of rows of data takes 10s of seconds rather than
10s of minutes.


Leo

> 
> 1) get items which are orphaned in a.
> CREATE TEMP TABLE orphans as
>     SELECT a.join1, a.join2
>         FROM a LEFT OUTER JOIN b ON a.join1 = b.join1 AND 
> a.join2 = b.join2
>         WHERE b.join1 IS NULL AND b.join2 IS NULL
> 
> D A
> 
> 
> "Llew" <leo.goodstadt@anat.ox.ac.uk> wrote in message
> news:a65qm1$2k6g$1@jupiter.hub.org...
> > Dear everyone,
> > What is the best way of removing rows which are not in 
> another table?

pgsql-sql by date:

From: Stephan Szabo
Date: 19 March 2002, 23:08:56
Subject: Re: What is Syntax for multiple FULL OUTER JOINS?

From: Kelly Burkhart
Date: 20 March 2002, 08:56:58
Subject: optimizer tuning/forcing correct index use

Re: where not exists - Mailing list pgsql-sql

Previous

Next