Re: Pgsql resource agent of pacemaker - Mailing list pgsql-admin

From Shital A
Subject Re: Pgsql resource agent of pacemaker
Date
Msg-id CAMp7vw9XQo2z44OFMCD6C4nwnb6eP8hCC392kp90HVDRXBV9kg@mail.gmail.com
Whole thread Raw
In response to Re: Pgsql resource agent of pacemaker  (Shital A <brightuser2019@gmail.com>)
List pgsql-admin


On Mon, 19 Aug 2019, 23:36 Shital A, <brightuser2019@gmail.com> wrote:


On Mon, 19 Aug 2019, 18:47 Shital A, <brightuser2019@gmail.com> wrote:
Hello,

Need advise on below situation:

Postgres 9.6
Pacemaker 1.1.19
Corosync 2.4.3

We are testing HA setup on a two node cluster using pacemaker, corosync stack. The replication is streaming replication in async mode.
Whenever there is a failover to standby, pgsql created a PGSQL.lock file in /var/lib/pgsql/tmp/ folder. We noticed that this file is created at random on any one of the nodes. 

Want to know:
- Is there any logic behind which node it creates the lock file? 

- When a node in kept in HS:alone state ? 

- How can we check if failed node is at a log location behind current primary? So that we can force RA to start by deleting lock file? 

Please help to understand

Thanks. 





Any thoughts? 

Thanks in advance! 


Hello,

Was wandering if this is not the right mailing list for this issue. Please let me know.

Pacemaker on pgsql do not behave in a fixed pattern. The lock file gets created on any node. According to link https://wiki.clusterlabs.org/wiki/PgSQL_Replicated_Cluster it should get created on the failing node however we are seeing it being created on the promoted node.

 If you have such setup in production or have evaluated, please provide inputs.

Thanks.




pgsql-admin by date:

Previous
From: Jeff Janes
Date:
Subject: Re: invalid record length at XX: wanted 24, got
Next
From: Srinath Ganesh
Date:
Subject: Postgres PITR: Recovery does not seem to fetch from Archive Dir