Thread: Hot Standby: Startup at shutdown checkpoint

Hot Standby: Startup at shutdown checkpoint

From

Simon Riggs

Date:

06 April 2010, 06:25:28

Initial patch. I will be testing over next day. No commit before at
least midday on Wed 7 Apr.

The existing call to PrescanPreparedTransactions() looks correct to me
but the comment is wrong. I will change that also, if we agree.

--
 Simon Riggs           www.2ndQuadrant.com

Attachment

startup_at_shutdown_checkpoint.patch

Re: Hot Standby: Startup at shutdown checkpoint

From

Simon Riggs

Date:

08 April 2010, 07:17:38

On Tue, 2010-04-06 at 10:22 +0100, Simon Riggs wrote:

> Initial patch. I will be testing over next day. No commit before at
> least midday on Wed 7 Apr.

Various previous discussions sidelined a very important point: what
exactly does it mean to "start recovery from a shutdown checkpoint"?

If standby_mode is enabled and there is no source of WAL, then we get a
stream of messages saying

LOG:  record with zero length at 0/C000088
...

but most importantly we never get to the main recovery loop, so Hot
Standby never gets to start at all. We can't keep retrying the request
for WAL and at the same time enter the retry loop, executing lots of
things that expect non-NULL pointers using a NULL xlog pointer.

What we are asking for here is a completely new state: the database is
not "in recovery" - by definition there is nothing at all to recover.

The following patch adds "Snapshot Mode", a very simple variation on the
existing code - emphasis on the "simple":

LOG:  entering snapshot mode
LOG:  record with zero length at 0/C000088
LOG:  consistent recovery state reached at 0/C000088
LOG:  database system is ready to accept read only connections

this mode does *not* continually check to see if new WAL files have been
added. Startup just sits and waits, backends allowed. If a trigger file
is specified, then we can leave recovery. Otherwise Startup process just
sits doing nothing.

There's possibly an argument for inventing some more special modes where
we do allow read only connections but don't start the bgwriter. I don't
personally wish to do this at this stage of the release cycle. The
attached patch is non-invasive and safe and I want to leave it at that.

I will be committing later today, unless major objections, but I ask you
to read the patch before you sharpen your pen. It's simple.

--
 Simon Riggs           www.2ndQuadrant.com

Attachment

snapshot_mode.patch

Re: Hot Standby: Startup at shutdown checkpoint

From

Heikki Linnakangas

Date:

08 April 2010, 07:33:51

Simon Riggs wrote:
> On Tue, 2010-04-06 at 10:22 +0100, Simon Riggs wrote:
> 
>> Initial patch. I will be testing over next day. No commit before at
>> least midday on Wed 7 Apr.
> 
> Various previous discussions sidelined a very important point: what
> exactly does it mean to "start recovery from a shutdown checkpoint"?

Hot standby should be possible as soon we know that the database is
consistent. That is, as soon as we've replayed WAL past the
minRecoveryPoint/backupStartPoint point indicated in pg_control.

> If standby_mode is enabled and there is no source of WAL, then we get a
> stream of messages saying
> 
> LOG:  record with zero length at 0/C000088
> ...
> 
> but most importantly we never get to the main recovery loop, so Hot
> Standby never gets to start at all. We can't keep retrying the request
> for WAL and at the same time enter the retry loop, executing lots of
> things that expect non-NULL pointers using a NULL xlog pointer.

You mean it can't find even the checkpoint record to start replaying? I
think the behavior in that scenario is fine as it is. The database isn't
consistent (or at least we can't know if it is, because we don't know
the redo pointer) until you read and replay the first checkpoint record.

--  Heikki Linnakangas EnterpriseDB   http://www.enterprisedb.com

Re: Hot Standby: Startup at shutdown checkpoint

From

Simon Riggs

Date:

08 April 2010, 08:32:37

On Thu, 2010-04-08 at 13:33 +0300, Heikki Linnakangas wrote:

> > If standby_mode is enabled and there is no source of WAL, then we get a
> > stream of messages saying
> > 
> > LOG:  record with zero length at 0/C000088
> > ...
> > 
> > but most importantly we never get to the main recovery loop, so Hot
> > Standby never gets to start at all. We can't keep retrying the request
> > for WAL and at the same time enter the retry loop, executing lots of
> > things that expect non-NULL pointers using a NULL xlog pointer.
> 
> You mean it can't find even the checkpoint record to start replaying? 

Clearly I don't mean that. Otherwise it wouldn't be "start from a
shutdown checkpoint". I think you are misunderstanding me.

Let me explain in more detail though please also read the patch before
replying, if you do.

The patch I submitted at top of this thread works for allowing Hot
Standby during recovery. Yes, of course that occurs when the database is
consistent. The trick is to get recovery to the point where it can be
enabled. The second patch on this thread presents a way to get the
database to that point; it touches some of the other recovery code that
you and Masao have worked on. We *must* touch that code if we are to
enable Hot Standby in the way you desire.

In StartupXlog() when we get to the point where we "Find the first
record that logically follows the checkpoint", in the current code
ReadRecord() loops forever, spitting out
LOG: record with zero length at 0/C000088
...

That prevents us from going further down StartupXLog() to the point
where we start the InRedo loop and hence start hot standby. As long as
we retry we cannot progress further: this is the main problem.

So in the patch, I have modified the retry test in ReadRecord() so it no
longer retries iff there is no WAL source defined. Now, when
ReadRecord() exits, record == NULL at that point and so we do not (and
cannot) enter the redo loop.

So I have introduced the new mode ("snapshot mode") to enter hot standby
anyway. That avoids us having to screw around with the loop logic for
redo. I don't see any need to support the case of where we have no WAL
source defined, yet we want Hot Standby but we also want to allow
somebody to drop a WAL file into pg_xlog at some future point. That has
no use case of value AFAICS and is too complex to add at this stage of
the release cycle.

-- Simon Riggs           www.2ndQuadrant.com

Re: Hot Standby: Startup at shutdown checkpoint

From

Robert Haas

Date:

08 April 2010, 10:49:32

On Thu, Apr 8, 2010 at 6:16 AM, Simon Riggs <simon@2ndquadrant.com> wrote:
> If standby_mode is enabled and there is no source of WAL, then we get a
> stream of messages saying
>
> LOG:  record with zero length at 0/C000088
> ...
>
> but most importantly we never get to the main recovery loop, so Hot
> Standby never gets to start at all. We can't keep retrying the request
> for WAL and at the same time enter the retry loop, executing lots of
> things that expect non-NULL pointers using a NULL xlog pointer.

This is pretty much a corner case, so I don't think it's a good idea
to add a new mode to handle it.  It also seems like it would be pretty
inconsistent if we allow WAL to be dropped in pg_xlog, but only if we
are also doing archive recovery or streaming replication.  If we can't
support this case with the same code path we use otherwise, I think we
should revert to disallowing it.

Having said that, I guess I don't understand how having a source of
WAL solves the problem described above.  Do we always have to read at
least 1 byte of WAL from either SR or the archive before starting up?
If not, why do we need to do so here?

...Robert

Re: Hot Standby: Startup at shutdown checkpoint

From

Heikki Linnakangas

Date:

08 April 2010, 12:35:42

Simon Riggs wrote:
> In StartupXlog() when we get to the point where we "Find the first
> record that logically follows the checkpoint", in the current code
> ReadRecord() loops forever, spitting out
> LOG: record with zero length at 0/C000088
> ...
>
> That prevents us from going further down StartupXLog() to the point
> where we start the InRedo loop and hence start hot standby. As long as
> we retry we cannot progress further: this is the main problem.
>
> So in the patch, I have modified the retry test in ReadRecord() so it no
> longer retries iff there is no WAL source defined. Now, when
> ReadRecord() exits, record == NULL at that point and so we do not (and
> cannot) enter the redo loop.

Oh, I see.

> So I have introduced the new mode ("snapshot mode") to enter hot standby
> anyway. That avoids us having to screw around with the loop logic for
> redo. I don't see any need to support the case of where we have no WAL
> source defined, yet we want Hot Standby but we also want to allow
> somebody to drop a WAL file into pg_xlog at some future point. That has
> no use case of value AFAICS and is too complex to add at this stage of
> the release cycle.

You don't need a new mode for that. Just do the same "are we consistent
now?" check you do in the loop once before calling ReadRecord to fetch
the record that follows the checkpoint pointer. Attached is a patch to
show what I mean. We just need to let postmaster know that recovery has
started a bit earlier, right after processing the checkpoint record, not
delaying it until we've read the first record after it.

--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com
*** a/src/backend/access/transam/twophase.c
--- b/src/backend/access/transam/twophase.c
***************
*** 1719,1724 **** PrescanPreparedTransactions(TransactionId **xids_p, int *nxids_p)
--- 1719,1806 ----
  }

  /*
+  * StandbyRecoverPreparedTransactions
+  *
+  * Scan the pg_twophase directory and setup all the required information to
+  * allow standby queries to treat prepared transactions as still active.
+  * This is never called at the end of recovery - we use
+  * RecoverPreparedTransactions() at that point.
+  *
+  * Currently we simply call SubTransSetParent() for any subxids of prepared
+  * transactions.
+  */
+ void
+ StandbyRecoverPreparedTransactions(bool can_overwrite)
+ {
+     DIR           *cldir;
+     struct dirent *clde;
+
+     cldir = AllocateDir(TWOPHASE_DIR);
+     while ((clde = ReadDir(cldir, TWOPHASE_DIR)) != NULL)
+     {
+         if (strlen(clde->d_name) == 8 &&
+             strspn(clde->d_name, "0123456789ABCDEF") == 8)
+         {
+             TransactionId xid;
+             char       *buf;
+             TwoPhaseFileHeader *hdr;
+             TransactionId *subxids;
+             int            i;
+
+             xid = (TransactionId) strtoul(clde->d_name, NULL, 16);
+
+             /* Already processed? */
+             if (TransactionIdDidCommit(xid) || TransactionIdDidAbort(xid))
+             {
+                 ereport(WARNING,
+                         (errmsg("removing stale two-phase state file \"%s\"",
+                                 clde->d_name)));
+                 RemoveTwoPhaseFile(xid, true);
+                 continue;
+             }
+
+             /* Read and validate file */
+             buf = ReadTwoPhaseFile(xid, true);
+             if (buf == NULL)
+             {
+                 ereport(WARNING,
+                       (errmsg("removing corrupt two-phase state file \"%s\"",
+                               clde->d_name)));
+                 RemoveTwoPhaseFile(xid, true);
+                 continue;
+             }
+
+             /* Deconstruct header */
+             hdr = (TwoPhaseFileHeader *) buf;
+             if (!TransactionIdEquals(hdr->xid, xid))
+             {
+                 ereport(WARNING,
+                       (errmsg("removing corrupt two-phase state file \"%s\"",
+                               clde->d_name)));
+                 RemoveTwoPhaseFile(xid, true);
+                 pfree(buf);
+                 continue;
+             }
+
+             /*
+              * Examine subtransaction XIDs ... they should all follow main
+              * XID, and they may force us to advance nextXid.
+              */
+             subxids = (TransactionId *)
+                 (buf + MAXALIGN(sizeof(TwoPhaseFileHeader)));
+             for (i = 0; i < hdr->nsubxacts; i++)
+             {
+                 TransactionId subxid = subxids[i];
+
+                 Assert(TransactionIdFollows(subxid, xid));
+                 SubTransSetParent(xid, subxid, can_overwrite);
+             }
+         }
+     }
+     FreeDir(cldir);
+ }
+
+ /*
   * RecoverPreparedTransactions
   *
   * Scan the pg_twophase directory and reload shared-memory state for each
*** a/src/backend/access/transam/xlog.c
--- b/src/backend/access/transam/xlog.c
***************
*** 494,499 **** static XLogRecPtr minRecoveryPoint;        /* local copy of
--- 494,501 ----
                                           * ControlFile->minRecoveryPoint */
  static bool updateMinRecoveryPoint = true;

+ static bool reachedMinRecoveryPoint = false;
+
  static bool InRedo = false;

  /*
***************
*** 547,552 **** static void ValidateXLOGDirectoryStructure(void);
--- 549,555 ----
  static void CleanupBackupHistory(void);
  static void UpdateMinRecoveryPoint(XLogRecPtr lsn, bool force);
  static XLogRecord *ReadRecord(XLogRecPtr *RecPtr, int emode, bool fetching_ckpt);
+ static void CheckRecoveryConsistency(void);
  static bool ValidXLOGHeader(XLogPageHeader hdr, int emode);
  static XLogRecord *ReadCheckpointRecord(XLogRecPtr RecPtr, int whichChkpt);
  static List *readTimeLineHistory(TimeLineID targetTLI);
***************
*** 5470,5476 **** StartupXLOG(void)
      uint32        freespace;
      TransactionId oldestActiveXID;
      bool        bgwriterLaunched = false;
-     bool        backendsAllowed = false;

      /*
       * Read control file and check XLOG status looks valid.
--- 5473,5478 ----
***************
*** 5718,5723 **** StartupXLOG(void)
--- 5720,5728 ----
      {
          int            rmid;

+         /* use volatile pointer to prevent code rearrangement */
+         volatile XLogCtlData *xlogctl = XLogCtl;
+
          /*
           * Update pg_control to show that we are recovering and to show the
           * selected checkpoint as the place we are starting from. We also mark
***************
*** 5809,5814 **** StartupXLOG(void)
--- 5814,5846 ----
              StartupMultiXact();

              ProcArrayInitRecoveryInfo(oldestActiveXID);
+
+             /*
+              * If we're beginning at a shutdown checkpoint, we know that
+              * nothing was running on the master at this point. So fake-up
+              * an empty running-xacts record and use that here and now.
+              * Recover additional standby state for prepared transactions.
+              */
+             if (wasShutdown)
+             {
+                 RunningTransactionsData running;
+
+                 /*
+                  * Construct a RunningTransactions snapshot representing a shut
+                  * down server, with only prepared transactions still alive.
+                  * We're never overflowed at this point because all subxids
+                  * are listed with their parent prepared transactions.
+                  */
+                 running.xcnt = nxids;
+                 running.subxid_overflow = false;
+                 running.nextXid = checkPoint.nextXid;
+                 running.oldestRunningXid = oldestActiveXID;
+                 running.xids = xids;
+
+                 ProcArrayApplyRecoveryInfo(&running);
+
+                 StandbyRecoverPreparedTransactions(false);
+             }
          }

          /* Initialize resource managers */
***************
*** 5818,5823 **** StartupXLOG(void)
--- 5850,5885 ----
                  RmgrTable[rmid].rm_startup();
          }

+         /* initialize shared replayEndRecPtr and recoveryLastRecPtr */
+         SpinLockAcquire(&xlogctl->info_lck);
+         xlogctl->replayEndRecPtr = ReadRecPtr;
+         xlogctl->recoveryLastRecPtr = ReadRecPtr;
+         SpinLockRelease(&xlogctl->info_lck);
+
+         /*
+          * Let postmaster know we've started redo now, so that it can
+          * launch bgwriter to perform restartpoints.  We don't bother
+          * during crash recovery as restartpoints can only be performed
+          * during archive recovery.  And we'd like to keep crash recovery
+          * simple, to avoid introducing bugs that could you from
+          * recovering after crash.
+          *
+          * After this point, we can no longer assume that we're the only
+          * process in addition to postmaster!  Also, fsync requests are
+          * subsequently to be handled by the bgwriter, not locally.
+          */
+         if (InArchiveRecovery && IsUnderPostmaster)
+         {
+             SetForwardFsyncRequests();
+             SendPostmasterSignal(PMSIGNAL_RECOVERY_STARTED);
+             bgwriterLaunched = true;
+         }
+
+         /*
+          * Allow read-only connections immediately if we're consistent already.
+          */
+         CheckRecoveryConsistency();
+
          /*
           * Find the first record that logically follows the checkpoint --- it
           * might physically precede it, though.
***************
*** 5837,5854 **** StartupXLOG(void)
          {
              bool        recoveryContinue = true;
              bool        recoveryApply = true;
-             bool        reachedMinRecoveryPoint = false;
              ErrorContextCallback errcontext;

-             /* use volatile pointer to prevent code rearrangement */
-             volatile XLogCtlData *xlogctl = XLogCtl;
-
-             /* initialize shared replayEndRecPtr and recoveryLastRecPtr */
-             SpinLockAcquire(&xlogctl->info_lck);
-             xlogctl->replayEndRecPtr = ReadRecPtr;
-             xlogctl->recoveryLastRecPtr = ReadRecPtr;
-             SpinLockRelease(&xlogctl->info_lck);
-
              InRedo = true;

              ereport(LOG,
--- 5899,5906 ----
***************
*** 5856,5880 **** StartupXLOG(void)
                              ReadRecPtr.xlogid, ReadRecPtr.xrecoff)));

              /*
-              * Let postmaster know we've started redo now, so that it can
-              * launch bgwriter to perform restartpoints.  We don't bother
-              * during crash recovery as restartpoints can only be performed
-              * during archive recovery.  And we'd like to keep crash recovery
-              * simple, to avoid introducing bugs that could you from
-              * recovering after crash.
-              *
-              * After this point, we can no longer assume that we're the only
-              * process in addition to postmaster!  Also, fsync requests are
-              * subsequently to be handled by the bgwriter, not locally.
-              */
-             if (InArchiveRecovery && IsUnderPostmaster)
-             {
-                 SetForwardFsyncRequests();
-                 SendPostmasterSignal(PMSIGNAL_RECOVERY_STARTED);
-                 bgwriterLaunched = true;
-             }
-
-             /*
               * main redo apply loop
               */
              do
--- 5908,5913 ----
***************
*** 5903,5934 **** StartupXLOG(void)
                  /* Handle interrupt signals of startup process */
                  HandleStartupProcInterrupts();

!                 /*
!                  * Have we passed our safe starting point?
!                  */
!                 if (!reachedMinRecoveryPoint &&
!                     XLByteLE(minRecoveryPoint, EndRecPtr) &&
!                     XLogRecPtrIsInvalid(ControlFile->backupStartPoint))
!                 {
!                     reachedMinRecoveryPoint = true;
!                     ereport(LOG,
!                         (errmsg("consistent recovery state reached at %X/%X",
!                                 EndRecPtr.xlogid, EndRecPtr.xrecoff)));
!                 }
!
!                 /*
!                  * Have we got a valid starting snapshot that will allow
!                  * queries to be run? If so, we can tell postmaster that the
!                  * database is consistent now, enabling connections.
!                  */
!                 if (standbyState == STANDBY_SNAPSHOT_READY &&
!                     !backendsAllowed &&
!                     reachedMinRecoveryPoint &&
!                     IsUnderPostmaster)
!                 {
!                     backendsAllowed = true;
!                     SendPostmasterSignal(PMSIGNAL_RECOVERY_CONSISTENT);
!                 }

                  /*
                   * Have we reached our recovery target?
--- 5936,5943 ----
                  /* Handle interrupt signals of startup process */
                  HandleStartupProcInterrupts();

!                 /* Allow read-only connections if we're consistent now */
!                 CheckRecoveryConsistency();

                  /*
                   * Have we reached our recovery target?
***************
*** 6278,6283 **** StartupXLOG(void)
--- 6287,6330 ----
  }

  /*
+  * Checks if recovery has reached a consistent state. When consistency is
+  * reached and we have a valid starting standby snapshot, tell postmaster
+  * that it can start accepting read-only connections.
+  */
+ static void
+ CheckRecoveryConsistency(void)
+ {
+     static bool        backendsAllowed = false;
+
+     /*
+      * Have we passed our safe starting point?
+      */
+     if (!reachedMinRecoveryPoint &&
+         XLByteLE(minRecoveryPoint, EndRecPtr) &&
+         XLogRecPtrIsInvalid(ControlFile->backupStartPoint))
+     {
+         reachedMinRecoveryPoint = true;
+         ereport(LOG,
+                 (errmsg("consistent recovery state reached at %X/%X",
+                         EndRecPtr.xlogid, EndRecPtr.xrecoff)));
+     }
+
+     /*
+      * Have we got a valid starting snapshot that will allow
+      * queries to be run? If so, we can tell postmaster that the
+      * database is consistent now, enabling connections.
+      */
+     if (standbyState == STANDBY_SNAPSHOT_READY &&
+         !backendsAllowed &&
+         reachedMinRecoveryPoint &&
+         IsUnderPostmaster)
+     {
+         backendsAllowed = true;
+         SendPostmasterSignal(PMSIGNAL_RECOVERY_CONSISTENT);
+     }
+ }
+
+ /*
   * Is the system still in recovery?
   *
   * Unlike testing InRecovery, this works in any process that's connected to
***************
*** 7521,7533 **** xlog_redo(XLogRecPtr lsn, XLogRecord *record)
          if (standbyState != STANDBY_DISABLED)
              CheckRequiredParameterValues(checkPoint);

          if (standbyState >= STANDBY_INITIALIZED)
          {
              /*
!              * Remove stale transactions, if any.
               */
!             ExpireOldKnownAssignedTransactionIds(checkPoint.nextXid);
!             StandbyReleaseOldLocks(checkPoint.nextXid);
          }

          /* ControlFile->checkPointCopy always tracks the latest ckpt XID */
--- 7568,7601 ----
          if (standbyState != STANDBY_DISABLED)
              CheckRequiredParameterValues(checkPoint);

+         /*
+          * If we're beginning at a shutdown checkpoint, we know that
+          * nothing was running on the master at this point. So fake-up
+          * an empty running-xacts record and use that here and now.
+          * Recover additional standby state for prepared transactions.
+          */
          if (standbyState >= STANDBY_INITIALIZED)
          {
+             TransactionId *xids;
+             int            nxids;
+             TransactionId oldestActiveXID = PrescanPreparedTransactions(&xids, &nxids);
+             RunningTransactionsData running;
+
              /*
!              * Construct a RunningTransactions snapshot representing a shut
!              * down server, with only prepared transactions still alive.
!              * We're never overflowed at this point because all subxids
!              * are listed with their parent prepared transactions.
               */
!             running.xcnt = nxids;
!             running.subxid_overflow = false;
!             running.nextXid = checkPoint.nextXid;
!             running.oldestRunningXid = oldestActiveXID;
!             running.xids = xids;
!
!             ProcArrayApplyRecoveryInfo(&running);
!
!             StandbyRecoverPreparedTransactions(true);
          }

          /* ControlFile->checkPointCopy always tracks the latest ckpt XID */
*** a/src/include/access/twophase.h
--- b/src/include/access/twophase.h
***************
*** 44,49 **** extern bool StandbyTransactionIdIsPrepared(TransactionId xid);
--- 44,50 ----

  extern TransactionId PrescanPreparedTransactions(TransactionId **xids_p,
                              int *nxids_p);
+ extern void StandbyRecoverPreparedTransactions(bool can_overwrite);
  extern void RecoverPreparedTransactions(void);

  extern void RecreateTwoPhaseFile(TransactionId xid, void *content, int len);

Re: Hot Standby: Startup at shutdown checkpoint

From

Simon Riggs

Date:

08 April 2010, 12:58:45

On Thu, 2010-04-08 at 18:35 +0300, Heikki Linnakangas wrote:
> 
> > So I have introduced the new mode ("snapshot mode") to enter hot
> standby
> > anyway. That avoids us having to screw around with the loop logic
> for
> > redo. I don't see any need to support the case of where we have no
> WAL
> > source defined, yet we want Hot Standby but we also want to allow
> > somebody to drop a WAL file into pg_xlog at some future point. That
> has
> > no use case of value AFAICS and is too complex to add at this stage
> of
> > the release cycle.
> 
> You don't need a new mode for that. Just do the same "are we
> consistent now?" check you do in the loop once before calling
> ReadRecord to fetch the record that follows the checkpoint pointer.
> Attached is a patch to show what I mean. We just need to let
> postmaster know that recovery has started a bit earlier, right after
> processing the checkpoint record, not delaying it until we've read the
> first record after it.

OK, that seems better. I'm happy with that instead.

Have you tested this? Is it ready to commit?

-- Simon Riggs           www.2ndQuadrant.com

Re: Hot Standby: Startup at shutdown checkpoint

From

Heikki Linnakangas

Date:

08 April 2010, 13:02:19

Simon Riggs wrote:
> OK, that seems better. I'm happy with that instead.
> 
> Have you tested this? Is it ready to commit?

Only very briefly. I think the code is ready, but please review and test
to see I didn't miss anything.

--  Heikki Linnakangas EnterpriseDB   http://www.enterprisedb.com

Re: Hot Standby: Startup at shutdown checkpoint

From

Simon Riggs

Date:

08 April 2010, 13:21:13

On Thu, 2010-04-08 at 19:02 +0300, Heikki Linnakangas wrote:
> Simon Riggs wrote:
> > OK, that seems better. I'm happy with that instead.
> > 
> > Have you tested this? Is it ready to commit?
> 
> Only very briefly. I think the code is ready, but please review and test
> to see I didn't miss anything.

I'm going to need you to commit this. I'm on holiday now until 14 April,
so its not going to get a retest before then otherwise; its not smart to
commit and then go on holiday, IIRC. 

I've reviewed your changes and they look correct to me; the main chunk
of code is mine and that was tested by me.

-- Simon Riggs           www.2ndQuadrant.com

Re: Hot Standby: Startup at shutdown checkpoint

From

Heikki Linnakangas

Date:

13 April 2010, 11:19:12

Simon Riggs wrote:
> On Thu, 2010-04-08 at 19:02 +0300, Heikki Linnakangas wrote:
>> Simon Riggs wrote:
>>> OK, that seems better. I'm happy with that instead.
>>>
>>> Have you tested this? Is it ready to commit?
>> Only very briefly. I think the code is ready, but please review and test
>> to see I didn't miss anything.
> 
> I'm going to need you to commit this. I'm on holiday now until 14 April,
> so its not going to get a retest before then otherwise; its not smart to
> commit and then go on holiday, IIRC. 

:-)

> I've reviewed your changes and they look correct to me; the main chunk
> of code is mine and that was tested by me.

Ok, committed after fixing an obsoleted comment & other small
editorialization.

--  Heikki Linnakangas EnterpriseDB   http://www.enterprisedb.com

Re: Hot Standby: Startup at shutdown checkpoint

From

Simon Riggs

Date:

14 April 2010, 07:32:58

On Tue, 2010-04-13 at 17:18 +0300, Heikki Linnakangas wrote:
> I've reviewed your changes and they look correct to me; the main chunk
> > of code is mine and that was tested by me.
> 
> Ok, committed after fixing an obsoleted comment & other small
> editorialization.

Looks good, thanks.

-- Simon Riggs           www.2ndQuadrant.com