Re: parallel pg_restore - Mailing list pgsql-hackers

From Andrew Dunstan
Subject Re: parallel pg_restore
Date
Msg-id 48D956A3.5030204@dunslane.net
Whole thread Raw
In response to Re: parallel pg_restore  (Simon Riggs <simon@2ndQuadrant.com>)
Responses Re: parallel pg_restore  (Simon Riggs <simon@2ndQuadrant.com>)
Re: parallel pg_restore  (Joshua Drake <jd@commandprompt.com>)
Re: parallel pg_restore  (Dimitri Fontaine <dfontaine@hi-media.com>)
List pgsql-hackers

Simon Riggs wrote:
> On Tue, 2008-09-23 at 12:43 -0700, Joshua Drake wrote:
>   
>> On Tue, 23 Sep 2008 08:44:19 +0100
>> Simon Riggs <simon@2ndQuadrant.com> wrote:
>>
>>     
>>> On Mon, 2008-09-22 at 15:05 -0400, Andrew Dunstan wrote:
>>>
>>>       
>>>> j and m happen to be two of those that are available.
>>>>
>>>> I honestly don't have a terribly strong opinion about what it
>>>> should be called. I can live with jobs or multi-threads.
>>>>         
>>> Perhaps we can use -j for jobs and -m for memory, so we can set memory
>>> available across all threads with a single total value.
>>>
>>> I can live with jobs or multi-threads also, whichever we decide.
>>> Neither one is confusing to explain.
>>>
>>>       
>> Memory? Where did that come from. Andrew is that in your spec?
>>     
>
> No, but it's in mine. As I said upthread, no point in making it more
> parallel than memory allows. Different operations need more/less memory
> than others, so we must think about that also. We can quickly work out
> how big a table is, so we can work out how much memory it will need to
> perform sorts for index builds and thus how many parallel builds can
> sensibly take place.
>
>   

If that ever happens it will certainly not be in this go round.

In fact, we have some anecdotal evidence that the point of dimishing 
returns is not reached until a fairly high degree of parallelism is used 
(Joshua's and my client has been using 24 threads, I believe).

In any case, my agenda goes something like this:
   * get it working with a basic selection algorithm on Unix (nearly     done - keep your eyes open for a patch soon)
*start testing   * get it working on Windows   * improve the selection algorithm   * harden code
 

If we get all that done by November we'll have done well. And we know 
that in some cases just this much can lead to reductions in restore time 
of the order of 80%.

cheers

andrew




pgsql-hackers by date:

Previous
From: Simon Riggs
Date:
Subject: Re: EXEC_BACKEND
Next
From: Simon Riggs
Date:
Subject: Re: parallel pg_restore