Re: Should pg_dump dump larger tables first? - Mailing list pgsql-hackers

From Jeff Janes
Subject Re: Should pg_dump dump larger tables first?
Date
Msg-id CAMkU=1xEMEeSqvLGUMcBjSX5Ag5KNqi0OBoC37+rf6+v33UuJA@mail.gmail.com
Whole thread Raw
In response to Re: Should pg_dump dump larger tables first?  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
On Tue, Jan 29, 2013 at 3:34 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> "David Rowley" <dgrowleyml@gmail.com> writes:
>> If pg_dump was to still follow the dependencies of objects, would there be
>> any reason why it shouldn't backup larger tables first?
>
> Pretty much every single discussion/complaint about pg_dump's ordering
> choices has been about making its behavior more deterministic not less
> so.  So I can't imagine such a change would go over well with most folks.
>
> Also, it's far from obvious to me that "largest first" is the best rule
> anyhow; it's likely to be more complicated than that.

From my experience in the non-database world of processing many files
of greatly different sizes in parallel, sorting them so the largest
are scheduled first and smaller ones get "pack" around them is very
successful and very easy.

I agree that best rule surely is more complicated, but probably so
much so that it will never get implemented.

>
> But anyway, the right place to add this sort of consideration is in
> pg_restore --parallel, not pg_dump.

Yeah.

Cheers,

Jeff



pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: Strange Windows problem, lock_timeout test request
Next
From: Christopher Browne
Date:
Subject: Re: Should pg_dump dump larger tables first?