Re: Importing Large Amounts of Data - Mailing list pgsql-hackers

From Bruce Momjian
Subject Re: Importing Large Amounts of Data
Date
Msg-id 200204160144.g3G1iQV21731@candle.pha.pa.us
Whole thread Raw
In response to Re: Importing Large Amounts of Data  (Curt Sampson <cjs@cynic.net>)
Responses Re: Importing Large Amounts of Data  (Neil Conway <nconway@klamath.dyndns.org>)
Re: Importing Large Amounts of Data  (Curt Sampson <cjs@cynic.net>)
List pgsql-hackers
Curt Sampson wrote:
> On Mon, 15 Apr 2002, Tom Lane wrote:
> 
> > > I'm not looking for "runs a bit faster;" five percent either way
> > > makes little difference to me. I'm looking for a five-fold performance
> > > increase.
> >
> > You are not going to get it from this; where in the world did you get
> > the notion that data integrity costs that much?
> 
> Um...the fact that MySQL imports the same data five times as fast? :-)
> 
> Note that this is *only* related to bulk-importing huge amounts of
> data. Postgres seems a little bit slower than MySQL at building
> the indicies afterwards, but this would be expected since (probably
> due to higher tuple overhead) the size of the data once in postgres
> is about 75% larger than in MySQL: 742 MB vs 420 MB. I've not done
> any serious testing of query speed, but the bit of toying I've done
> with it shows no major difference.

Can you check your load and see if there is a PRIMARY key on the table
at the time it is being loaded.  In the old days, we created indexes
only after the data was loaded, but when we added PRIMARY key, pg_dump
was creating the table with PRIMARY key then loading it, meaning the
table was being loaded while it had an existing index.  I know we fixed
this recently but I am not sure if it was in 7.2 or not. 

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
853-3000+  If your life is a hard drive,     |  830 Blythe Avenue +  Christ can be your backup.        |  Drexel Hill,
Pennsylvania19026
 


pgsql-hackers by date:

Previous
From: Curt Sampson
Date:
Subject: Re: Importing Large Amounts of Data
Next
From: Peter Eisentraut
Date:
Subject: Re: regexp character class locale awareness patch