Re: WAL replay issue from 9.6.8 to 9.6.10 - Mailing list pgsql-general

From Michael Paquier
Subject Re: WAL replay issue from 9.6.8 to 9.6.10
Date
Msg-id 20180829205003.GD5903@paquier.xyz
Whole thread Raw
In response to Re: WAL replay issue from 9.6.8 to 9.6.10  (Dave Peticolas <dave@krondo.com>)
Responses Re: WAL replay issue from 9.6.8 to 9.6.10
List pgsql-general
On Wed, Aug 29, 2018 at 09:15:29AM -0700, Dave Peticolas wrote:
> Oh, perhaps I do, depending on what you mean by worker. There are a couple
> of periodic processes that connect to the server to obtain metrics. Is that
> what is triggering this issue? In my case I could probably suspend them
> until the replay has reached the desired point.

That would be it.  How do you decide when those begin to run and connect
to Postgres.  Do you use pg_isready or similar in a loop for sanity
checks?

> I have noticed this behavior in the past but prior to 9.6.10 restarting the
> server would fix the issue. And the replay always seemed to reach a point
> past which the problem would not re-occur.

You are picking my interest here.  Did you actually see the same
problem?  In 9.6.10 what happens is that I have tightened the consistent
point checks and logic so as inconsistent page issues would actually
show up when they should, and that those become reproducible so as we
can track down any rogue WAL record or inconsistent behavior.
--
Michael

Attachment

pgsql-general by date:

Previous
From: Bruce Momjian
Date:
Subject: Re: pg_upgrade fails saying function unaccent(text) doesn't exist
Next
From: Dave Peticolas
Date:
Subject: Re: WAL replay issue from 9.6.8 to 9.6.10