I have meet the similar issue when the postgres is not stopped normally.
You could run pg_controldata to check if your postgres status is shutdown/shutdown in recovery.
I change the /usr/lib/ocf/resource.d/heartbeat/pgsqlms to avoid this problem:
elsif ( $pgisready_rc == 2 ) {
# The instance is not listening.
# We check the process status using pg_ctl status and check
# if it was propertly shut down using pg_controldata.
ocf_log( 'debug', 'pgsql_monitor: instance "%s" is not listening',
$OCF_RESOURCE_INSTANCE );
# return _confirm_stopped(); # remove this line
return $OCF_NOT_RUNNING;
}
-----邮件原件-----
发件人: Users [mailto:users-bounces@clusterlabs.org] 代表 Adrien Nayrat
发送时间: 2018年4月24日 16:16
收件人: Andrew Edenburn <andrew.edenburn@gm.com>; pgsql-general@postgresql.org; users@clusterlabs.org
主题: Re: [ClusterLabs] Postgres PAF setup
On 04/23/2018 08:09 PM, Andrew Edenburn wrote:
> I am having issues with my PAF setup. I am new to Postgres and have
> setup the cluster as seen below.
>
> I am getting this error when trying to start my cluster resources.
>
>
>
> Master/Slave Set: pgsql-ha [pgsqld]
>
> pgsqld (ocf::heartbeat:pgsqlms): FAILED dcmilphlum224
> (unmanaged)
>
> pgsqld (ocf::heartbeat:pgsqlms): FAILED dcmilphlum223
> (unmanaged)
>
> pgsql-master-ip (ocf::heartbeat:IPaddr2): Started
> dcmilphlum223
>
>
>
> Failed Actions:
>
> * pgsqld_stop_0 on dcmilphlum224 'unknown error' (1): call=239,
> status=complete, exitreason='Unexpected state for instance "pgsqld"
> (returned 1)',
>
> last-rc-change='Mon Apr 23 13:11:17 2018', queued=0ms, exec=95ms
>
> * pgsqld_stop_0 on dcmilphlum223 'unknown error' (1): call=248,
> status=complete, exitreason='Unexpected state for instance "pgsqld"
> (returned 1)',
>
> last-rc-change='Mon Apr 23 13:11:17 2018', queued=0ms, exec=89ms
>
>
>
> cleanup and clear is not fixing any issues and I am not seeing
> anything in the logs. Any help would be greatly appreciated.
>
>
Hello Andrew,
Could you enable debug logs in Pacemaker?
With Centos you have to edit PCMK_debug variable in /etc/sysconfig/pacemaker :
PCMK_debug=crmd,pengine,lrmd
This should give you more information in logs. Monitor action in PAF should report why the cluster doesn't start :
https://github.com/ClusterLabs/PAF/blob/master/script/pgsqlms#L1525
Regards,
--
Adrien NAYRAT