Home > mailing lists

Re: selecting duplicate records - Mailing list pgsql-sql

From	Christopher Browne
Subject	Re: selecting duplicate records
Date	September 23, 2003 00:31:24
Msg-id	m3r8282o4s.fsf@wolfe.cbbrowne.com Whole thread Raw
In response to	Re: selecting duplicate records (Christoph Haller <ch@rodos.fzk.de>)
Responses	Re: selecting duplicate records
List	pgsql-sql

Tree view

The world rejoiced as ch@rodos.fzk.de (Christoph Haller) wrote:
>> 1. How to select duplicate records only from a single table using a
> select
>> query.
>>
> e.g.
> select sid,count(sid) from location group by sid having count(sid)>1;
>
> Do you get the idea?
> Your request is pretty unspecific, so if this is not what you're asking
> for,
> try again.

The aggregate is likely to perform horrifically badly.  Here might
be an option:

Step 1.  Find all of the duplicates...

select a.* into temp table sid from some_table a, some_table b where a.oid < b.oid and   a.field1 = b.field1 and
a.field2= b.field2 and   a.field3 = b.field3 and    ...   a.fieldn = b.fieldn;

Step 2.  Look for the matching entries in the source table...

select a.* from some_table a, sid b where   a.field1 = b.field1 and   a.field2 = b.field2 and   a.field3 = b.field3 and
  ...   a.fieldn = b.fieldn;

[There's a weakness here; if there are multiple dupes, they may get
picked multiple times in the second query :-(.]
-- 
If this was helpful, <http://svcs.affero.net/rm.php?r=cbbrowne> rate me
http://cbbrowne.com/info/rdbms.html
As Will Rogers would have said, "There is no such thing as a free
variable."  -- Alan Perlis

pgsql-sql by date:

From: Rudi Starcevic
Date: 22 September 2003, 23:50:52
Subject: unsubscribe

From: Joe Conway
Date: 23 September 2003, 01:01:49
Subject: Re: Oracle 'connect by prior' now eaiser in 7.3?

Re: selecting duplicate records - Mailing list pgsql-sql

Previous

Next