Home > mailing lists

Re: Sync Rep Design - Mailing list pgsql-hackers

From	Aidan Van Dyk
Subject	Re: Sync Rep Design
Date	December 30, 2010 19:47:12
Msg-id	AANLkTinV4hNPfBcozG-e+63-0BAxbhnsJdhVmdOu356A@mail.gmail.com Whole thread Raw
In response to	Re: Sync Rep Design (Robert Treat <rob@xzilla.net>)
Responses	Re: Sync Rep Design
List	pgsql-hackers

Tree view

On Thu, Dec 30, 2010 at 3:07 PM, Robert Treat <rob@xzilla.net> wrote:

>> If primary crashes while commits are waiting for acknowledgement, those
>> transactions will be marked fully committed if the primary database
>> recovers, no matter how allow_standalone_primary is set.
>
> This seems backwards; if you are waiting for acknowledgement, wouldn't the
> normal assumption be that the transactions *didnt* make it to any standby,
> and should be rolled back ?

This is the standard 2-phase commit problem.  The primary server *has*
committed it, it's fsync has returned, and the only thing keeping it
from returning the commit to the client is that it's waiting on a
synchronous "ack" from a slave.

You've got 2 options:
1) initiate fsync on the slave first  - In this case, the slave is farther ahead than the primary, and if
primary fails, you're *forced* to have a failover.  The standby is
head of the primary, so the primary recovering can cause divergence.
And you'll likely have to do a base-backup style sync to get a new
primary/standby setup.
2) initiate fsync on the primary first  - In this case, the slave is always slightly behind.  If if your
primary falls over, you don't give commit messages to the clients, but
if it recovers, it might have committed data, and slaves will still be
able to catch up.

The thing is that currently, even without replication, #2 can happen.
If your db falls over before it gets the commit packet stuffed out the
network, you're in the same boat.  The data might be committed, even
though you didn't get the commit packet, and when  your DB recovers,
it's got the committed data that you never "knew" was committed.

a.

--
Aidan Van Dyk                                             Create like a god,
aidan@highrise.ca                                       command like a king,
http://www.highrise.ca/                                   work like a slave.

pgsql-hackers by date:

From: Stefan Kaltenbrunner
Date: 30 December 2010, 19:42:36
Subject: Re: Sync Rep Design

From: Robert Treat
Date: 30 December 2010, 19:51:28
Subject: Re: Sync Rep Design

Re: Sync Rep Design - Mailing list pgsql-hackers

Previous

Next