Home > mailing lists

Re: Slow duplicate deletes - Mailing list pgsql-novice

From	DrYSG
Subject	Re: Slow duplicate deletes
Date	March 5, 2012 16:44:07
Msg-id	1330980227869-5538858.post@n5.nabble.com Whole thread Raw
In response to	Re: Slow duplicate deletes (Merlin Moncure <mmoncure@gmail.com>)
Responses	Re: Slow duplicate deletes Re: Slow duplicate deletes
List	pgsql-novice

Tree view

One point I might not have made clear. The reason I want to remove duplicates
is that the column "data_object.unique_id" became non-unique (someone added
duplicate rows). So I added the bigSeriel (idx) to uniquely identify the
rows, and I was using the SELECT MIN(idx) and GroupBy to pick just one of
the rows that became duplicated.

I am going to try out some of your excellent suggestions. I will report back
on how they are working.

One idea that was given to me was the following (what do you think Merlin?)

CREATE TABLE portal.new_metatdata AS
select distinct on (data_object.unique_id) * FROM portal.metadata;

Or something of this ilk should be faster because it only need to do a
sort on data_object.unique_id and then an insert. After you have
verified the results you can do:

BEGIN;
ALTER TABLE portal.metatdata rename TO portal.new_metatdata_old;
ALTER TABLE portal.new_metatdata rename TO portal.metatdata_old;
COMMIT;


--
View this message in context: http://postgresql.1045698.n5.nabble.com/Slow-duplicate-deletes-tp5537818p5538858.html
Sent from the PostgreSQL - novice mailing list archive at Nabble.com.

pgsql-novice by date:

From: Merlin Moncure
Date: 05 March 2012, 15:52:57
Subject: Re: Slow duplicate deletes

From: "VARTAK, SATISH CTR DFAS"
Date: 05 March 2012, 17:33:43
Subject: postgreSQL odbc driver for Sun Solaris

Re: Slow duplicate deletes - Mailing list pgsql-novice

Previous

Next