Re: Slave promotion problem... - Mailing list pgsql-general

From Martín Marqués
Subject Re: Slave promotion problem...
Date
Msg-id 55E44AC2.4000103@2ndquadrant.com
Whole thread Raw
In response to Slave promotion problem...  (marin@kset.org)
Responses Re: Slave promotion problem...
List pgsql-general
El 31/08/15 a las 03:29, marin@kset.org escribió:
> Last week we had some problems on the master server which caused a
> failover on the slave (the master was completely unresponsive due to
> reasons still unknown). The slave received the promote signal (pg_ctl
> promote) and logged that in the logs:
> 2015-08-28 23:05:10 UTC [6]: [50-1] user=,db= LOG:  received promote
> request
> 2015-08-28 23:05:10 UTC [467]: [2-1] user=,db= FATAL:  terminating
> walreceiver process due to administrator command
>
> 5 hours later the slave still didn't promote. Meanwhile we fixed the
> master and restarted it. The slave was restarted and it behaved just
> like the promote signal didn't arrive, connecting to the master as a
> regular slave.

Aren't there any further logs after the walreceiver termination?

Up to here everything looks fine, but we have no idea on what was logged
afterwards.

> I am unsure if this promotion failure is a bug/glitch, but the promote
> procedure is automated and tested a couple of hundred times so I am
> certain we initiated the promote correctly.

Are you using homemade scripts? Maybe you need to test them more
thoroughly, with different environment parameters.

Regards,

--
Martín Marqués                http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services


pgsql-general by date:

Previous
From: Saravanakumar Murugesan
Date:
Subject: FW: JsonArray value criteria
Next
From: Melvin Davidson
Date:
Subject: Re: PostgreSQL Developer Best Practices