Home > mailing lists

Re: Hot standby, race condition between recovery snapshot and commit - Mailing list pgsql-hackers

From	Heikki Linnakangas
Subject	Re: Hot standby, race condition between recovery snapshot and commit
Date	November 15, 2009 08:42:57
Msg-id	4AFFF756.3020206@enterprisedb.com Whole thread Raw
In response to	Re: Hot standby, race condition between recovery snapshot and commit (Heikki Linnakangas <heikki.linnakangas@enterprisedb.com>)
Responses	Re: Hot standby, race condition between recovery snapshot and commit
List	pgsql-hackers

Tree view

Oh, forgot to mention another thing that I've been pondering:

Currently, the running-xacts record is written to the WAL after the
checkpoint record. There's a small chance that you get an xlog switch in
between. If that happens, it might take a long time after the checkpoint
record until the standby sees the running-xacts record, so it might take
a long time until the standby can open up for connections.

In general, I'd like to remove as many as possible of those cases where
the standby starts up, and can't open up for connections. It makes the
standby a lot less useful if you can't rely on it being open. So I'd
like to make it so that the standby can *always* open up. There's
currently three cases where that can happen:

1. If the subxid cache has overflown.

2. If there's no running-xacts record after the checkpoint record for
some reason. For example, one was written but not archive yet, or
because the master crashed before it was written.

3. If too many AccessExclusiveLocks was being held.

Case 3 should be pretty easy to handle. Just need to WAL log all the
AccessExclusiveLocks, perhaps as separate WAL records (we already have a
new WAL record type for logging locks) if we're worried about the
running-xacts record growing too large. I think we could handle case 2
if we wrote the running-xacts record *before* the checkpoint record.
Then it would be always between the REDO pointer of the checkpoint
record, and the checkpoint record itself, so it would always be seen by
the WAL recovery. To handle case 1, we could scan pg_subtrans. It would
add some amount of code and would add some more work to taking the
running-xacts snapshot, but it could be done.

This isn't absolutely necessary for the first version, but it's
something to keep in mind...

--  Heikki Linnakangas EnterpriseDB   http://www.enterprisedb.com

pgsql-hackers by date:

From: Robert Haas
Date: 15 November 2009, 08:36:18
Subject: Re: commitfest patch move unavailable

From: Peter Eisentraut
Date: 15 November 2009, 09:38:28
Subject: Re: Python 3.1 support

Re: Hot standby, race condition between recovery snapshot and commit - Mailing list pgsql-hackers

Previous

Next