pg_rewind problem: cannot find WAL - Mailing list pgsql-general

From Luca Ferrari
Subject pg_rewind problem: cannot find WAL
Date
Msg-id CAKoxK+5ypkP1Ts1KNuCgqAxwp=8ejQ-wfiBrpvZ1coZ3dZfOWg@mail.gmail.com
Whole thread Raw
Responses Re: pg_rewind problem: cannot find WAL
List pgsql-general
Hi all,
running 17.4 on ubuntu 24.04 machines. I've three hosts, pg-1
(primary) and two physical replicas.
I then promote host pg-3 as a master (pg_promote()) and want to rewind
the pg-1 to follow the new master, so:

ssh pg-3 'sudo -u postgres /usr/lib/postgresql/17/bin/pg_rewind -D
/var/lib/postgresql/17/main --source-server="user=replica_fluca
host=pg-3 dbname=replica_fluca"'
pg_rewind: servers diverged at WAL location 0/B8550F8 on timeline 1
pg_rewind: error: could not open file
"/var/lib/postgresql/17/main/pg_wal/00000001000000000000000A": No such
file or directory
pg_rewind: error: could not find previous WAL record at 0/AFFF4E8

But the file 0x010000A is not there:


 % ssh pg-3 'sudo ls /var/lib/postgresql/17/main/pg_wal'
00000001000000000000000B.partial
00000002.history
00000002000000000000000B
00000002000000000000000C
00000002000000000000000D
00000002000000000000000E
archive_status
summaries

% ssh pg-1 'sudo ls /var/lib/postgresql/17/main/pg_wal'
000000010000000000000005.00000028.backup
00000001000000000000000B
00000001000000000000000C
00000001000000000000000D
00000001000000000000000E
archive_status
summaries

Do i have to ensure the old primary pg-1 does a wal switch before
promoting the other one and try to rewind?

Thanks,
Luca



pgsql-general by date:

Previous
From: Laurenz Albe
Date:
Subject: Re: Error in DROP TABLESPACE
Next
From: Bogdan Siara
Date:
Subject: huge_pages=on cause could not map anonymous shared memory: Cannot allocate memory