Re: archive modules - Mailing list pgsql-hackers

From Bossart, Nathan
Subject Re: archive modules
Date
Msg-id F9F7CCA3-489D-479E-941F-84C5C1D1D3CB@amazon.com
Whole thread Raw
In response to Re: archive modules  (Fujii Masao <masao.fujii@oss.nttdata.com>)
Responses Re: archive modules
List pgsql-hackers
On 11/1/21, 9:44 PM, "Fujii Masao" <masao.fujii@oss.nttdata.com> wrote:
> What is the main motivation of this patch? I was thinking that
> it's for parallelizing WAL archiving. But as far as I read
> the patch very briefly, WAL file name is still passed to
> the archive callback function one by one.

The main motivation is provide a way to archive without shelling out.
This reduces the amount of overhead, which can improve archival rate
significantly.  It should also make it easier to archive more safely.
For example, many of the common shell commands used for archiving
won't fsync the data, but it isn't too hard to do so via C.  The
current proposal doesn't introduce any extra infrastructure for
batching or parallelism, but it is probably still possible.  I would
like to eventually add batching, but for now I'm only focused on
introducing basic archive module support.

> Are you planning to extend this mechanism to other WAL
> archiving-related commands like restore_command? I can imagine
> that those who use archive library (rather than shell) would
> like to use the same mechanism for WAL restore.

I would like to do this eventually, but my current proposal is limited
to archive_command.

> I think that it's worth adding this module into core
> rather than handling it as test module. It provides very basic
> WAL archiving feature, but (I guess) it's enough for some users.

Do you think it should go into contrib?

Nathan


pgsql-hackers by date:

Previous
From: Gilles Darold
Date:
Subject: [PATCH] fix references to like_regex
Next
From: Andrew Dunstan
Date:
Subject: Re: enabling FOO=bar arguments to vcregress.pl