Thread: Avoiding timeline generation

Avoiding timeline generation

From

Daniel Farina

Date:

25 March 2011, 01:00:58

Hello List,

I have a couple of use cases that are important to me, but my reading
of xlog.c suggests I'm stuck between a rock and a hard place. Or, I am
missing some commonly used pattern -- forgive me in that case. I am
reading 9.0.3 when making these determinations.

Here is the mechanism: I want to author a recovery.conf to perform
some amount of restore_command or streaming replication based
recovery, but I do *not* want to generate a new timeline. Rather, I
want to stay in hot standby mode to allow read-only connections.
Right now, at xlog.c:6346, we have code like this that is run after
zero to many WAL segments have been applied:

/* * Consider whether we need to assign a new timeline ID. * * If we are doing an archive recovery, we always assign a
newID. This * handles a couple of issues. If we stopped short of the end of WAL * during recovery, then we are
clearlygenerating a new timeline and must * assign it a unique new ID. Even if we ran to the end, modifying the *
currentlast segment is problematic because it may result in trying to * overwrite an already-archived copy of that
segment,and we encourage * DBAs to make their archive_commands reject that. We can dodge the * problem by making the
newactive segment have a new timeline ID. * * In a normal crash recovery, we can just extend the timeline we were in.
*/if(InArchiveRecovery){ ThisTimeLineID = findNewestTimeLine(recoveryTargetTLI) + 1; ereport(LOG,
(errmsg("selectednew timeline ID: %u", ThisTimeLineID))); writeTimeLineHistory(ThisTimeLineID, recoveryTargetTLI,
curFileTLI, endLogId, endLogSeg);}

InArchiveRecovery gets set to "true" as soon as
readRecoveryCommandFile completes basically normally, and it looks as
though that will ensure we will get a new timeline. If one tries a
bizarre hack, like ensuring the restore_command does not terminate,
one never finishes recovery -- as one may expect -- and one cannot
connect to the server -- which one may not expect is necessarily the
case presuming hot standby, if the server was terminated cleanly.

The things I want to do with the ability to suppress a new timeline:

* Offline WAL application -- I want to be able to bring up a second
server, perform some amount of point in time recovery, and then stop
and archive. It would be nice to support read-only queries in this
case to test the recovered database. The goal of this is to reduce
recovery time in a disaster scenario without tying up resources on a
live server.

* The ability to quiesce a system by bringing it into read-only state
that generates no new WAL while still being able to ship old WAL.
This is useful in switchover scenarios, and is a cousin of cascading
replication support -- whereby a hot standby may also act as a
WAL-relay. The need for cascading is mitigated by having one's own
WAL archiving machinery, but even in absence of that it is highly
desirable to continue to allow read-only access to the database while
guaranteeing no new WAL is generated.

There's also the possibility that someone(s) here may know the
timeline and WAL application code with enough confidence that I can go
in there and do some surgery to un-do a new timeline as a workaround
reliably, knowing that my data directory is fully intact for more
accurate restorations. Or, perhaps bludgeoning the system with
non-writable UNIX permissions may be acceptable.

Please relate your thoughts...

Thanks,

--
fdr

Re: Avoiding timeline generation

From

Heikki Linnakangas

Date:

25 March 2011, 07:39:05

On 25.03.2011 03:00, Daniel Farina wrote:
> Here is the mechanism:  I want to author a recovery.conf to perform
> some amount of restore_command or streaming replication based
> recovery, but I do *not* want to generate a new timeline.  Rather, I
> want to stay in hot standby mode to allow read-only connections.

That's exactly what the standby mode is for. Add "standby_mode=on" to 
recovery.conf, and the server will do exactly that.

Perhaps the documentation is not clear on this. Any suggestions on how 
to improve that?

> InArchiveRecovery gets set to "true" as soon as
> readRecoveryCommandFile completes basically normally, and it looks as
> though that will ensure we will get a new timeline.  If one tries a
> bizarre hack, like ensuring the restore_command does not terminate,
> one never finishes recovery

That's what pg_standby does. That was the only option before 
standby_mode was introduced, in version 9.0, although we didn't have hot 
standby until 9.0 either.

> -- as one may expect -- and one cannot
> connect to the server -- which one may not expect is necessarily the
> case presuming hot standby, if the server was terminated cleanly.

That's not true. As long as you enable hot standby, the server will 
accept connections while restore command is running.

> The things I want to do with the ability to suppress a new timeline:
>
> * Offline WAL application -- I want to be able to bring up a second
> server, perform some amount of point in time recovery, and then stop
> and archive.  It would be nice to support read-only queries in this
> case to test the recovered database.  The goal of this is to reduce
> recovery time in a disaster scenario without tying up resources on a
> live server.

Yep, that can be done with standby_mode=on.

> * The ability to quiesce a system by bringing it into read-only state
> that generates no new WAL while still being able to ship old WAL.

In theory it should be possible to stop a server, put it into hot 
standby mode by creating a recovery.conf file, and restart, but it won't 
try ship the old WAL after that. When you stop a server it will try to 
archive all existing WAL before exiting, though.

--   Heikki Linnakangas  EnterpriseDB   http://www.enterprisedb.com

Re: Avoiding timeline generation

From

Daniel Farina

Date:

25 March 2011, 07:57:42

On Fri, Mar 25, 2011 at 12:38 AM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:
> On 25.03.2011 03:00, Daniel Farina wrote:
>>
>> Here is the mechanism:  I want to author a recovery.conf to perform
>> some amount of restore_command or streaming replication based
>> recovery, but I do *not* want to generate a new timeline.  Rather, I
>> want to stay in hot standby mode to allow read-only connections.
>
> That's exactly what the standby mode is for. Add "standby_mode=on" to
> recovery.conf, and the server will do exactly that.
>
> Perhaps the documentation is not clear on this. Any suggestions on how to
> improve that?

I was actually pretty well aware of this option, if that is the case,
I fat-fingered something or had a thinko (mental bit flip?) and then
what I thought I knew about standby_mode=on was invalidated (perhaps
incorrectly).  I will confirm tomorrow.

--
fdr

Re: Avoiding timeline generation

From

Robert Haas

Date:

25 March 2011, 11:54:55

On Mar 24, 2011, at 9:00 PM, Daniel Farina <daniel@heroku.com> wrote:
> * Offline WAL application -- I want to be able to bring up a second
> server, perform some amount of point in time recovery, and then stop
> and archive.  It would be nice to support read-only queries in this
> case to test the recovered database.  The goal of this is to reduce
> recovery time in a disaster scenario without tying up resources on a
> live server.

The new 9.1 feature paise_at_recovery_target seems like it might be what you need here.

...Robert