Home > mailing lists

Re: hundreds of millions row dBs - Mailing list pgsql-general

From	Tom Lane
Subject	Re: hundreds of millions row dBs
Date	January 4, 2005 08:32:38
Msg-id	13956.1104816742@sss.pgh.pa.us Whole thread Raw
In response to	Re: hundreds of millions row dBs ("Guy Rouillier" <guyr@masergy.com>)
List	pgsql-general

Tree view

"Guy Rouillier" <guyr@masergy.com> writes:
> Greer, Doug wrote:
>> I am interested in using Postgresql for a dB of hundreds of
>> millions of rows in several tables.  The COPY command seems to be way
>> too slow.  Is there any bulk import program similar to Oracle's SQL
>> loader for Postgresql? Sincerely,

> We're getting about 64 million rows inserted in about 1.5 hrs into a
> table with a multiple-column primary key - that's the only index.
> That's seems pretty good to me - SQL Loader takes about 4 hrs to do the
> same job.

If you're talking about loading into an initially empty database, it's
worth a try to load into bare tables and then create indexes and add
foreign key constraints.  Index build and FK checking are both
significantly faster as "bulk" operations than "incremental".  Don't
forget to pump up sort_mem as much as you can stand in the backend doing
such chores, too.

I have heard of people who would actually drop and recreate indexes
and/or FKs when adding a lot of data to an existing table.

            regards, tom lane

pgsql-general by date:

From: "Guy Rouillier"
Date: 04 January 2005, 07:58:48
Subject: Re: hundreds of millions row dBs

From: David Teran
Date: 04 January 2005, 11:47:54
Subject: changing column from int4 to int8, what happens with indexes?

Re: hundreds of millions row dBs - Mailing list pgsql-general

Previous

Next