Home > mailing lists

Allowing parallel pg_restore from pipe - Mailing list pgsql-hackers

From	Timothy Garnett
Subject	Allowing parallel pg_restore from pipe
Date	April 24, 2013 15:55:37
Msg-id	CAPcyiQ2MNGr21Jvp1ZJLwdtTUeoWgvGHWby1RTnbLkhnALcTDA@mail.gmail.com Whole thread Raw
Responses	Re: Allowing parallel pg_restore from pipe
List	pgsql-hackers

Tree view

Hi All,

Currently the -j option to pg_restore, which allows for parallelization in the restore, can only be used if the input file is a regular file and not, for ex., a pipe. However this is a pretty common occurrence for us (usually in the form of pg_dump | pg_restore to copy an individual database or some tables thereof from one machine to another). While there's no good way to parallelize the data load steps when reading from a pipe, the index and constraint building can still be parallelized and as they are generally CPU bound on our machines we've found quite a bit of speedup from doing so.

Attached is two diffs off of the REL9_2_4 tag that I've been using. The first is a simple change that serially loads the data section before handing off the remainder of the restore to the existing parallelized restore code (the .ALT. diff). The second which gets more parallelization but is a bit more of a change uses the existing dependency analysis code to allow index building etc. to occur in parallel with data loading. The data loading tasks are still performed serially in the main thread, but non-data loading tasks are scheduled in parallel as their dependencies are satisfied (with the caveat that the main thread can only dispatch new tasks between data loads).

Anyways, the question is if people think this is generally useful. If so I can clean up the preferred choice a bit and rebase it off of master, etc.

Tim

Attachment

pgsql-hackers by date:

From: Andres Freund
Date: 24 April 2013, 14:18:21
Subject: Re: high io BUT huge amount of free memory

From: Heikki Linnakangas
Date: 24 April 2013, 16:16:35
Subject: Re: missing time.h include in psql/command.c since the addition of \watch

Allowing parallel pg_restore from pipe - Mailing list pgsql-hackers

Attachment

Previous

Next