Home > mailing lists

sync_standbys_defined and pg_stat_replication - Mailing list pgsql-hackers

From	Jeremy Schneider
Subject	sync_standbys_defined and pg_stat_replication
Date	October 7 08:59:33
Msg-id	20251006225933.1dde33c7@ardentperf.com Whole thread Raw
List	pgsql-hackers

Tree view

For failover to work correctly, if someone changes the GUC
synchronous_standby_names to enable sync replication, then we need to
understand the exact moment when backends will begin to block in order
to correctly determine when we can failover without data loss.

There's an older mailing list thread that discusses one aspect of this

https://www.postgresql.org/message-id/flat/CABrsG8j3kPD%2Bkbbsx_isEpFvAgaOBNGyGpsqSjQ6L8vwVUaZAQ%40mail.gmail.com

I've also gone through the code for SyncRepWaitForLSN() and worked
backwards to where the checkpointer sets sync_standbys_defined. But I
have a question which I couldn't answer so far.

It looks like sync_standbys_defined is only updated by the checkpointer
process. Is there a short period of time where the pg_stat_replication
view would show sync_state=sync and state=streaming, but the
checkpointer has not yet updated sync_standbys_defined?

I'm wondering if this is a race condition where COMMITs are not being
blocked for replication but external tools which rely on
pg_stat_replication would think it's safe to failover with zero data
loss?

-Jeremy

pgsql-hackers by date:

From: "Hayato Kuroda (Fujitsu)"
Date: 07 October, 08:57:14
Subject: RE: Patch for migration of the pg_commit_ts directory

From: "Joel Jacobson"
Date: 07 October, 09:16:24
Subject: Re: Optimize LISTEN/NOTIFY

sync_standbys_defined and pg_stat_replication - Mailing list pgsql-hackers

Previous

Next