Home > mailing lists

Streaming Replication Failover - Mailing list pgsql-general

From	ning chan
Subject	Streaming Replication Failover
Date	January 17, 2013 05:17:34
Msg-id	CAG0k5vDu=qkKBWWa=jiSDxhXk6jww3-vPKHLQYq=aTzq9NcF8w@mail.gmail.com Whole thread
List	pgsql-general

Tree view

Hi,

I have a cluster of 3 nodes Primary is connected by StandbyA (streaming), Standby A is connected by Standby B (streaming).

I failed over the cluster

1) stop primary

2) promoted StandbyA

now fail over Primary

On StandByA syslog,
Jan 16 23:08:12 se032c-94-31 postgres[12316]: [3-1] 12316FATAL: replication terminated by primary server
Jan 16 23:08:12 se032c-94-31 postgres[12312]: [5-1] 12312LOG: redo starts at 0/1EAC3E68

On StandByB syslog
Jan 16 23:09:48 localhost postgres[3932]: [5-1] LOG: redo starts at 0/1EAC3E68

Now as soon as I promoted the StandByA,

i see replication between A & B is broken, from StandBy B syslog, it shows the following.
Jan 16 23:11:28 localhost postgres[3945]: [2-1] FATAL: timeline 15 of the primary does not match recovery target timeline 14

Now my question is while A & B are in sync, why promoting B will break the replication.

To resolve the problem, I need to do stop the engine on B, rsync from A, and start back the B engine.
rsync -a --progress --exclude postgresql.conf --exclude recovery.done --exclude pg_hba.conf root@10.89.94.31:/opt/postgres/9.2/data/* /opt/postgres/9.2/data

Do I need to sync the whole data directory from A? I have a small DB now (2 tables with only few rows). This may take a long time if I have a much larger DB. Any shortcut? Why do i need to do the rync while A & B are originally in sync?

Thanks~

Ning

pgsql-general by date:

From: Kirk Wythers
Date: 17 January 2013, 05:16:10
Subject: speeding up a join query that utilizes a view

From: Stuart Bishop
Date: 17 January 2013, 08:18:15
Subject: Re: plpython intermittent ImportErrors

Streaming Replication Failover - Mailing list pgsql-general

Previous

Next