Home > mailing lists

Re: concurrent COPY performance - Mailing list pgsql-hackers

From	Andrew Dunstan
Subject	Re: concurrent COPY performance
Date	June 16, 2009 18:49:47
Msg-id	4A381375.2070906@dunslane.net Whole thread Raw
In response to	Re: concurrent COPY performance (Merlin Moncure <mmoncure@gmail.com>)
Responses	Re: concurrent COPY performance
List	pgsql-hackers

Tree view


Merlin Moncure wrote:
> On Tue, Jun 16, 2009 at 12:47 PM, Stefan
> Kaltenbrunner<stefan@kaltenbrunner.cc> wrote:
>   
>> Hi!
>>
>> I have been doing some bulk loading testing recently - mostly with a focus
>> on answering why we are "only" getting a (max of) cores/2(up to around 8
>> cores even less with more) speedup using parallel restore.
>> What I found is that on some fast IO-subsystem we are CPU bottlenecked on
>> concurrent copy which is able to utilize WAL bypass (and scale up to around
>> cores/2) and performance without wal bypass is very bad.
>> In the WAL logged case we are only able to get a 50% speedup using the
>> second process already and we are never able to scale better than 3x (up to
>> 8 cores) and performance degrades even after that point.
>>     
>
> how are you bypassing wal?  do I read this properly that on your 8
> core system you are getting 4x speedup with wal bypass and 3x speedup
> without?
>   

If a table is created or truncated in the same transaction that does the 
load, and archiving is not on, the COPY is not WALed. That is why 
parallel restore wraps the COPY in a transaction and precedes it with a 
TRUNCATE if it created the table.

cheers

andrew

pgsql-hackers by date:

From: Merlin Moncure
Date: 16 June 2009, 18:33:16
Subject: Re: concurrent COPY performance

From: "Kevin Grittner"
Date: 16 June 2009, 19:14:47
Subject: Re: concurrent COPY performance

Re: concurrent COPY performance - Mailing list pgsql-hackers

Previous

Next