pgsql: walsnd: Don't set waiting_for_ping_response spuriously - Mailing list pgsql-committers

From Alvaro Herrera
Subject pgsql: walsnd: Don't set waiting_for_ping_response spuriously
Date
Msg-id E1k4RvX-0003qM-Rr@gemulon.postgresql.org
Whole thread Raw
List pgsql-committers
walsnd: Don't set waiting_for_ping_response spuriously

Ashutosh Bapat noticed that when logical walsender needs to wait for
WAL, and it realizes that it must send a keepalive message to
walreceiver to update the sent-LSN, which *does not* request a reply
from walreceiver, it wrongly sets the flag that it's going to wait for
that reply.  That means that any future would-be sender of feedback
messages ends up not sending a feedback message, because they all
believe that a reply is expected.

With built-in logical replication there's not much harm in this, because
WalReceiverMain will send a ping-back every wal_receiver_timeout/2
anyway; but with other logical replication systems (e.g. pglogical) it
can cause significant pain.

This problem was introduced in commit 41d5f8ad734, where the
request-reply flag was changed from true to false to WalSndKeepalive,
without at the same time removing the line that sets
waiting_for_ping_response.

Just removing that line would be a sufficient fix, but it seems better
to shift the responsibility of setting the flag to WalSndKeepalive
itself instead of requiring caller to do it; this is clearly less
error-prone.

Author: Álvaro Herrera <alvherre@alvh.no-ip.org>
Reported-by: Ashutosh Bapat <ashutosh.bapat@2ndquadrant.com>
Backpatch: 9.5 and up
Discussion: https://postgr.es/m/20200806225558.GA22401@alvherre.pgsql

Branch
------
REL_13_STABLE

Details
-------
https://git.postgresql.org/pg/commitdiff/900429d0c03668ac474770c01ba5911b15025dfb

Modified Files
--------------
src/backend/replication/walsender.c | 24 ++++++++++++------------
1 file changed, 12 insertions(+), 12 deletions(-)


pgsql-committers by date:

Previous
From: Amit Kapila
Date:
Subject: pgsql: Fix the logical streaming test.
Next
From: Peter Geoghegan
Date:
Subject: pgsql: Teach amcheck to verify sibling links in all cases.