Home > mailing lists

Re: COPY from .csv File and Remove Duplicates - Mailing list pgsql-general

From	Rich Shepard
Subject	Re: COPY from .csv File and Remove Duplicates
Date	August 11, 2011 21:00:43
Msg-id	alpine.LNX.2.00.1108111656050.14240@salmo.appl-ecosys.com Whole thread Raw
In response to	Re: COPY from .csv File and Remove Duplicates (David Johnston <polobo@yahoo.com>)
List	pgsql-general

Tree view

On Thu, 11 Aug 2011, David Johnston wrote:

> If you have duplicates with matching real keys inserting into a staging
> table and then moving new records to the final table is your best option
> (in general it is better to do a two-step with a staging table since you
> can readily use Postgresql to perform any intermediate translations) As
> for the import itself,

David,

   I presume what you call a staging table is what I refer to as a copy of
the main table, but with no key attribute.

   Writing the SELECT statement to delete from the staging table those rows
that already exist in the main table is where I'm open to suggestions.

> In this case I would just import the data to a staging table without any
> kind of artificial key, just the true key,

   There is no true key, only an artificial key so I can ensure that rows are
unique. That's in the main table with the 50K rows. No key column in the
.csv file.

Thanks,

Rich

pgsql-general by date:

From: Tom Lane
Date: 11 August 2011, 20:54:45
Subject: Re: Regex Query Index question

From: David Johnston
Date: 11 August 2011, 21:02:52
Subject: Re: Regex Query Index question

Re: COPY from .csv File and Remove Duplicates - Mailing list pgsql-general

Previous

Next