Re: tracking commit timestamps - Mailing list pgsql-hackers

From Heikki Linnakangas
Subject Re: tracking commit timestamps
Date
Msg-id 52A6E55D.6020603@vmware.com
Whole thread Raw
In response to tracking commit timestamps  (Alvaro Herrera <alvherre@2ndquadrant.com>)
Responses Re: tracking commit timestamps
Re: tracking commit timestamps
List pgsql-hackers
On 10/23/2013 01:16 AM, Alvaro Herrera wrote:
> There has been some interest in keeping track of timestamp of
> transaction commits.  This patch implements that.
>
> There are some seemingly curious choices here.  First, this module can
> be disabled, and in fact it's turned off by default.  At startup, we
> verify whether it's enabled, and create the necessary SLRU segments if
> so.  And if the server is started with this disabled, we set the oldest
> value we know about to avoid trying to read the commit TS of
> transactions of which we didn't keep record.  The ability to turn this
> off is there to avoid imposing the overhead on systems that don't need
> this feature.
>
> Another thing of note is that we allow for some extra data alongside the
> timestamp proper.  This might be useful for a replication system that
> wants to keep track of the origin node ID of a committed transaction,
> for example.  Exactly what will we do with the bit space we have is
> unclear, so I have kept it generic and called it "commit extra data".
>
> This offers the chance for outside modules to set the commit TS of a
> transaction; there is support for WAL-logging such values.  But the core
> user of the feature (RecordTransactionCommit) doesn't use it, because
> xact.c's WAL logging itself is enough.  For systems that are replicating
> transactions from remote nodes, it is useful.
>
> We also keep track of the latest committed transaction.  This is
> supposed to be useful to calculate replication lag.

Generally speaking, I'm not in favor of adding dead code, even if it 
might be useful to someone in the future. For one, it's going to get 
zero testing. Once someone comes up with an actual use case, let's add 
that stuff at that point. Otherwise there's a good chance that we build 
something that's almost but not quite useful.

Speaking of the functionality this does offer, it seems pretty limited. 
A commit timestamp is nice, but it isn't very interesting on its own. 
You really also want to know what the transaction did, who ran it, etc. 
ISTM some kind of a auditing or log-parsing system that could tell you 
all that would be much more useful, but this patch doesn't get us any 
closer to that.

Does this handle XID wraparound correctly? SLRU has a maximum of 64k 
segments with 32 SLRU pages each. With 12 bytes per each commit entry, 
that's not enough to hold the timestamp and "commit extra data" of the 
whole 2^31 XID range: (8192 * 32 * 65536) / 12 = 1431655765. And that's 
with the default page size, with smaller pages you run into the limit 
quicker.

It would be nice to teach SLRU machinery how to deal with more than 64k 
segments. SSI code in twophase.c ran into the same limit, and all you 
get is a warning there.

- Heikki



pgsql-hackers by date:

Previous
From: Andres Freund
Date:
Subject: Re: Time-Delayed Standbys
Next
From: Andres Freund
Date:
Subject: Re: tracking commit timestamps