Thread: pgsql: Don't archive bogus recycled or preallocated files after timelin

pgsql: Don't archive bogus recycled or preallocated files after timelin

From
Heikki Linnakangas
Date:
Don't archive bogus recycled or preallocated files after timeline switch.

After a timeline switch, we would leave behind recycled WAL segments that
are in the future, but on the old timeline. After promotion, and after they
become old enough to be recycled again, we would notice that they don't have
a .ready or .done file, create a .ready file for them, and archive them.
That's bogus, because the files contain garbage, recycled from an older
timeline (or prealloced as zeros). We shouldn't archive such files.

This could happen when we're following a timeline switch during replay, or
when we switch to new timeline at end-of-recovery.

To fix, whenever we switch to a new timeline, scan the data directory for
WAL segments on the old timeline, but with a higher segment number, and
remove them. Those don't belong to our timeline history, and are most
likely bogus recycled or preallocated files. They could also be valid files
that we streamed from the primary ahead of time, but in any case, they're
not needed to recover to the new timeline.

Branch
------
master

Details
-------
http://git.postgresql.org/pg/commitdiff/b2a5545bd63fc94a71b1e97ecdd03c605d97a438

Modified Files
--------------
src/backend/access/transam/xlog.c        |  280 ++++++++++++++++++++----------
src/backend/access/transam/xlogarchive.c |   19 ++
src/include/access/xlog_internal.h       |    1 +
3 files changed, 211 insertions(+), 89 deletions(-)


Heikki Linnakangas <heikki.linnakangas@iki.fi> writes:
> To fix, whenever we switch to a new timeline, scan the data directory for
> WAL segments on the old timeline, but with a higher segment number, and
> remove them.

Wait a minute.  Didn't you just break PITR scenarios?  The *entire point*
of the multiple-timeline mechanism is that you can start a new WAL history
from a point in the past without destroying WAL history after that point
on the old timeline.

I've not read the patch, so maybe the code is all right, but this
description most certainly is not.

            regards, tom lane


Re: pgsql: Don't archive bogus recycled or preallocated files after timelin

From
Heikki Linnakangas
Date:
On 04/14/2015 02:24 AM, Tom Lane wrote:
> Heikki Linnakangas <heikki.linnakangas@iki.fi> writes:
>> To fix, whenever we switch to a new timeline, scan the data directory for
>> WAL segments on the old timeline, but with a higher segment number, and
>> remove them.
>
> Wait a minute.  Didn't you just break PITR scenarios?  The *entire point*
> of the multiple-timeline mechanism is that you can start a new WAL history
> from a point in the past without destroying WAL history after that point
> on the old timeline.

It deletes the files belonging to other timelines from pg_xlog only. The
WAL archive is not touched.

- Heikki



Heikki Linnakangas <hlinnaka@iki.fi> writes:
> On 04/14/2015 02:24 AM, Tom Lane wrote:
>> Wait a minute.  Didn't you just break PITR scenarios?  The *entire point*
>> of the multiple-timeline mechanism is that you can start a new WAL history
>> from a point in the past without destroying WAL history after that point
>> on the old timeline.

> It deletes the files belonging to other timelines from pg_xlog only. The
> WAL archive is not touched.

How do we know those files are in the WAL archive?

            regards, tom lane


Re: pgsql: Don't archive bogus recycled or preallocated files after timelin

From
Heikki Linnakangas
Date:
On 04/14/2015 05:32 PM, Tom Lane wrote:
> Heikki Linnakangas <hlinnaka@iki.fi> writes:
>> On 04/14/2015 02:24 AM, Tom Lane wrote:
>>> Wait a minute.  Didn't you just break PITR scenarios?  The *entire point*
>>> of the multiple-timeline mechanism is that you can start a new WAL history
>>> from a point in the past without destroying WAL history after that point
>>> on the old timeline.
>
>> It deletes the files belonging to other timelines from pg_xlog only. The
>> WAL archive is not touched.
>
> How do we know those files are in the WAL archive?

Presumably that's where they came from if you're doing PITR. Or they
were copied into pg_xlog manually from somewhere else.

After PITR, if you decide that you didn't like the result and perform
another recovery, you have to restore from a base backup again, which
will wipe out pg_xlog anyway.

Also, if the recovery continued longer, those files would be deleted
from pg_xlog anyway, at a later restartpoint.

- Heikki