synchronous streaming replication - Mailing list pgsql-general

From Joseph Kennedy
Subject synchronous streaming replication
Date
Msg-id CACEKkuPwy-Y5j_u13MUhtLEyHnTHpR_mKV-Z+f3h4GpGu6tBjQ@mail.gmail.com
Whole thread Raw
Responses Re: synchronous streaming replication
List pgsql-general
Hi,

I have a question. Its stays at the intersection of software engineering and PostgreSQL.


I have configured streaming synchronous replication and whit setting "synchronous_commit=remote_apply" to make sure that the slave will always respond the same as the MASTER (this is a developers' requirement that the MASTER always responds the same as SLAVE). I set "hot_standby_feedback=on" and "max_standby_streaming_delay=-1",

max_standby_streaming_delay set to -1 to make MASTER wait indefinitely before SELECT conflicts on the SLAVE will end.

 

Here's where the problem arises, because not long after the replication has been started some serious delays occur in the form of "replay_lag" - which rather indicates the appearance of conflicts; the replication stops working properly.

 

From the server logs it appears that UPDATE (select for update) has occurred on the MASTER, and SELECT queries are in progress on SLAVE causing replication conflicts, with setting "max_standby_streaming_delay=-1" they never ends and there are huge lags.

 

From the findings with the developers it emerged that they do not want me to set max_standby_streaming_delay to a value after which the queries conflicted with replication will be canceled.

  • So I'm wondering if, in this configuration, it can work properly at all without setting, for example, "max_standby_streaming_delay=30" ?
  • On the other hand I wonder if the application should not be developed in such a way to support replication of PostgreSQL configured as a streaming synchronous replication cluster with "synchronous_commit=remote_apply" ?
  • Or perhaps "synchronous streaming replication" is a bad choice, maybe logical replication would be better ?
  • What are the best practices?
  • Perhaps you just need to force/teach applications to work with synchronous replication in such a way that when the SELECT causes conflicts with replication such queries are canceled and the application should resend/repeat query ?
  • I also think that after setting, for example, "max_standby_streaming_delay=30" queries (addressed) to the database should be very well optimized, so that too long queries are not canceled too frequently?
  • Do you know any books focused on applications adapted to work in postgresql synchronous streaming replication environment i.e. High Availability?

Joseph

pgsql-general by date:

Previous
From: "Peter J. Holzer"
Date:
Subject: Re: Explain returns different number of rows
Next
From: Adrian Klaver
Date:
Subject: Re: High CPU usage