Home > mailing lists

Building a database from a flat file - Mailing list pgsql-sql

From	Casey T. Deccio
Subject	Building a database from a flat file
Date	March 3, 2005 03:17:22
Msg-id	1109808921.12430.75.camel@boomerang.ran.sandia.gov Whole thread Raw
Responses	Re: Building a database from a flat file Re: Building a database from a flat file
List	pgsql-sql

Tree view

A database I am currently using is built and updated periodically from a
flat csv file (The situation is rather unfortunate, but that's all I
have right now).  The schema I use is more complex than the flat file,
so I follow a process to populate the tables with the data from the
file.  First I slurp the whole file into one temporary table, whose
columns correspond to the columns in the file.  Then I DELETE all the
existing rows from the tables in the schema and perform a series of
queries on that table to INSERT and UPDATE rows in the tables that are
in the schema.  Then I DELETE the data from the temporary table.  I do
it this way, rather than trying to synchronize it, because of the
inconsistencies and redundancies in the flat file.

There is more than one problem with this, but the largest is that I
would like to perform this whole database rebuild within one
transaction, so other processes that need to access the database can do
so without noticing the disturbance.  However, performing this set of
events (besides populating the temporary table) within a single
transaction takes a long time--over an hour in some cases.

What are some suggestions to help improve performance with replacing one
set of data in a schema with another?

Casey

pgsql-sql by date:

From: Ian Barwick
Date: 03 March 2005, 02:35:39
Subject: Re: Postgres performance

From: Bret Hughes
Date: 03 March 2005, 08:43:05
Subject: Re: definative way to place secs from epoc into timestamp

Building a database from a flat file - Mailing list pgsql-sql

Previous

Next