Re: The XLogFindNextRecord() routine find incorrect record startpoint after a long continuation record - Mailing list pgsql-bugs

From Michael Paquier
Subject Re: The XLogFindNextRecord() routine find incorrect record startpoint after a long continuation record
Date
Msg-id 20191106044120.GF1604@paquier.xyz
Whole thread Raw
In response to The XLogFindNextRecord() routine find incorrect record start pointafter a long continuation record  (Andrey Lepikhov <a.lepikhov@postgrespro.ru>)
Responses Re: The XLogFindNextRecord() routine find incorrect record startpoint after a long continuation record  (Andrey Lepikhov <a.lepikhov@postgrespro.ru>)
List pgsql-bugs
On Wed, Nov 06, 2019 at 07:40:48AM +0500, Andrey Lepikhov wrote:
> I found this in our multimaster project on PostgreSQL 11.5. It is difficult
> to reproduce this error, but I will try to do it if necessary.
>
> The rest of a continuation WAL-record can exactly match the block size. In
> this case, we need to switch targetPagePtr to the next block before
> calculating the starting point of the next WAL-record.
> See the patch in attachment for the bug fix.

What's the error you actually saw after reading the record in
xlogreader.c?  If you have past WAL archives, perhaps you are able to
reproduce the problem with a given WAL segment and pg_waldump?

This code rings a bell as of this thread which led to commit bab7823:
https://www.postgresql.org/message-id/CABOikdPsPByMiG6J01DKq6om2+BNkxHTPkOyqHM2a4oYwGKsqQ@mail.gmail.com

Reminds me as well of the following:
https://www.postgresql.org/message-id/CAOR=d=16x+jKM=00AoEZEKU9RJDwjU9E2bUnFBpX_HrGjkMHXg@mail.gmail.com
--
Michael

Attachment

pgsql-bugs by date:

Previous
From: Andrey Lepikhov
Date:
Subject: The XLogFindNextRecord() routine find incorrect record start pointafter a long continuation record
Next
From: Andrey Lepikhov
Date:
Subject: Re: The XLogFindNextRecord() routine find incorrect record startpoint after a long continuation record