Home > mailing lists

Replication & recovery_min_apply_delay - Mailing list pgsql-hackers

From	Konstantin Knizhnik
Subject	Replication & recovery_min_apply_delay
Date	January 30, 2019 12:04:53
Msg-id	b271715f-f945-35b0-d1f5-c9de3e56f65e@postgrespro.ru Whole thread Raw
Responses	Re: Replication & recovery_min_apply_delay
List	pgsql-hackers

Tree view

Hi hackers,

One of our customers was faced with the following problem:
he has setup  physical primary-slave replication but for some reasons 
specified very large (~12 hours)
recovery_min_apply_delay. I do not know precise reasons for such large 
gap between master and replica.
But everything works normally until replica is restarted. Then it starts 
to apply WAL, comes to the point where record timestamp is less then 12 
hours older
and ... suspends recovery. No WAL receiver is launched and so nobody is 
fetching changes from master.
It may cause master's WAL space overflow (if there is replication slot) 
and loose of data in case of master crash.

Looks like the right behavior is to be able launch WAL receiver before 
replica reaches end of WAL.
For example, we can launch it before going to sleep in recoveryApplyDelay.
We need to specify start LSN for WAL sender. I didn't find better 
solution except iterating WAL until I reach the last valid record.

I attach small patch which implements this approach.
I wonder if it can be considered as acceptable solution of the problem 
or there can be some better approach?

-- 
Konstantin Knizhnik
Postgres Professional:http://www.postgrespro.com
The Russian Postgres Company

Attachment

wal_apply_delay.patch

pgsql-hackers by date:

From: Peter Eisentraut
Date: 30 January 2019, 11:33:47
Subject: Re: Unused parameters & co in code

From: Hubert Zhang
Date: 30 January 2019, 12:34:27
Subject: Re: Control your disk usage in PG: Introduction to Disk Quota Extension

Replication & recovery_min_apply_delay - Mailing list pgsql-hackers

Attachment

Previous

Next