pgsql: Fix scenario where streaming standby gets stuck at acontinuatio - Mailing list pgsql-committers

From Heikki Linnakangas
Subject pgsql: Fix scenario where streaming standby gets stuck at acontinuatio
Date
Msg-id E1fEjKD-00057O-PP@gemulon.postgresql.org
Whole thread Raw
List pgsql-committers
Fix scenario where streaming standby gets stuck at a continuation record.

If a continuation record is split so that its first half has already been
removed from the master, and is only present in pg_wal, and there is a
recycled WAL segment in the standby server that looks like it would
contain the second half, recovery would get stuck. The code in
XLogPageRead() incorrectly started streaming at the beginning of the
WAL record, even if we had already read the first page.

Backpatch to 9.4. In principle, older versions have the same problem, but
without replication slots, there was no straightforward mechanism to
prevent the master from recycling old WAL that was still needed by standby.
Without such a mechanism, I think it's reasonable to assume that there's
enough slack in how many old segments are kept around to not run into this,
or you have a WAL archive.

Reported by Jonathon Nelson. Analysis and patch by Kyotaro HORIGUCHI, with
some extra comments by me.

Discussion:
https://www.postgresql.org/message-id/CACJqAM3xVz0JY1XFDKPP%2BJoJAjoGx%3DGNuOAshEDWCext7BFvCQ%40mail.gmail.com

Branch
------
REL_10_STABLE

Details
-------
https://git.postgresql.org/pg/commitdiff/ca572db22f62c24b060a0377d33ba312329b47c7

Modified Files
--------------
src/backend/access/transam/xlog.c       | 44 +++++++++++++++++++++++++++++++--
src/backend/access/transam/xlogreader.c | 27 +++++++++++---------
src/include/access/xlogreader.h         |  4 +++
3 files changed, 62 insertions(+), 13 deletions(-)


pgsql-committers by date:

Previous
From: Mike Blackwell
Date:
Subject: Re: pgsql: Fix precedence problem in new Perl code.
Next
From: Andrew Dunstan
Date:
Subject: Re: pgsql: Provide for testing on python3 modules when under MSVC