Re: TransactionXmin != MyProc->xmin - Mailing list pgsql-hackers

From Andres Freund
Subject Re: TransactionXmin != MyProc->xmin
Date
Msg-id xcypt6khyzfdztv5x52e4jyrxnu33sq6ei3mks4s6gflhcipjy@yfxofe5dhzm4
Whole thread Raw
Responses Re: TransactionXmin != MyProc->xmin
List pgsql-hackers
Hi,

On 2024-12-12 20:16:39 +0200, Heikki Linnakangas wrote:
> The comment in GetSnapshotData() defines transactionXmin like this:
> 
> > TransactionXmin: the oldest xmin of any snapshot in use in the current
> > transaction (this is the same as MyProc->xmin).
> However, we don't update TransactionXmin when we update MyProc->xmin in
> SnapshotResetXmin(). So TransactionXmin can be older than MyProc->xmin.

Oops, good catch.


> A straightforward fix is to ensure that TransactionXmin is updated whenever
> MyProc->xmin is:
> 
> diff --git a/src/backend/utils/time/snapmgr.c
> b/src/backend/utils/time/snapmgr.c
> index a1a0c2adeb6..f59830abd21 100644
> --- a/src/backend/utils/time/snapmgr.c
> +++ b/src/backend/utils/time/snapmgr.c
> @@ -883,7 +883,7 @@ SnapshotResetXmin(void)
>                                          pairingheap_first(&RegisteredSnapshots));
> 
>      if (TransactionIdPrecedes(MyProc->xmin, minSnapshot->xmin))
> -        MyProc->xmin = minSnapshot->xmin;
> +        MyProc->xmin = TransactionXmin = minSnapshot->xmin;
>  }
> 
>  /*
> 
> Anyone see a reason not to do that?

Seems to make sense.

But shouldn't we reset TransactionXmin in the pairingheap_is_empty() case as
well?

Perhaps we should have some assertions ensuring TransactionXmin has a valid
value in some places?



> There are a two other places where we set MyProc->xmin without updating
> TransactionXmin:
> 
> 1. In ProcessStandbyHSFeedbackMessage(). AFAICS that's OK because walsender
> doesn't use TransactionXmin for anything.

It's worth noting that a walsender connection can do normal query processing
these days if connected to a database....


> 2. In SnapBuildInitialSnapshot(). I suppose that's also OK because the
> TransactionXmin isn't used. I don't quite remember when that function might
> be called though.

It's used to export a snapshot after creating a logical slot. The snapshot can
be used to sync the existing data, before starting to apply future changes.

I don't see a need to modify TransactionXmin here, it's not a normal snapshot,
and as a comment in the function says, we rely on the slot's xmin to protect
against relevant rows being removed.


> In any case, I propose that we set TransactionXmin in all of those cases as
> well, so that TransactionXmin is always the equal to MyProc->xmin. Maybe
> even rename it to MyProcXmin to make that more clear.

I'm not sure it's really right to do that. If we don't hold a snapshot, what
makes sure that we then call SnapshotResetXmin() or such to reset
TransactionXmin?

Greetings,

Andres Freund



pgsql-hackers by date:

Previous
From: Jeff Davis
Date:
Subject: Re: Support regular expressions with nondeterministic collations
Next
From: Heikki Linnakangas
Date:
Subject: Re: TransactionXmin != MyProc->xmin