Re: postmaster recovery and automatic restart suppression - Mailing list pgsql-hackers

From Gregory Stark
Subject Re: postmaster recovery and automatic restart suppression
Date
Msg-id 87tz2rt5x7.fsf@oxford.xeocode.com
Whole thread Raw
In response to Re: postmaster recovery and automatic restart suppression  (Fujii Masao <masao.fujii@gmail.com>)
Responses Re: postmaster recovery and automatic restart suppression  (Fujii Masao <masao.fujii@gmail.com>)
Re: postmaster recovery and automatic restart suppression  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
Fujii Masao <masao.fujii@gmail.com> writes:

> On the other hand, the primary postgres might *not* restart automatically.
> So, it's difficult for clusterware to choose whether to do failover when it
> detects the death of the primary postgres, I think.


I think the accepted way to handle this kind of situation is called STONITH --
"Shoot The Other Node In The Head".

You need some way when the cluster software decides to initiate failover to
ensure that the first node *cannot* come back up. That could mean shutting the
power to it at the PDU or disabling its network connection at the switch, or
various other options.
 Gregory Stark http://mit.edu/~gsstark/resume.pdf


pgsql-hackers by date:

Previous
From: Fujii Masao
Date:
Subject: Re: postmaster recovery and automatic restart suppression
Next
From: Mark Mielke
Date:
Subject: Re: PostgreSQL Developer meeting minutes up