Re: [HACKERS] A misconception about the meaning of 'volatile' in GetNewTransactionId? - Mailing list pgsql-hackers

From Thomas Munro
Subject Re: [HACKERS] A misconception about the meaning of 'volatile' in GetNewTransactionId?
Date
Msg-id CAEepm=2AnkH3iFWbUkZ74f0n2wAjWJJgGHtwTTGLpY=7j9Scgw@mail.gmail.com
Whole thread Raw
In response to Re: [HACKERS] A misconception about the meaning of 'volatile' in GetNewTransactionId?  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
On Sun, Apr 30, 2017 at 1:19 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Thomas Munro <thomas.munro@enterprisedb.com> writes:
>> I was reading xact.c and noticed this block:
>> ...
>> Isn't this insufficient on non-TSO systems like POWER and Arm?
>
> Yeah, I think you're right.  That code probably predates our support
> for memory barriers, so "volatile" was the best we could do at the
> time --- but as you say, it doesn't fix hardware-level rearrangements.

Here is an experimental patch, for discussion only, to drop some
apparently useless volatile qualifiers and introduce a write barrier
when extending the array and a corresponding read barrier when
scanning or copying the array from other processes.

I wonder about this code that shrinks the array:

#define XidCacheRemove(i) \
        do { \
                MyProc->subxids.xids[i] =
MyProc->subxids.xids[MyPgXact->nxids - 1]; \
                MyPgXact->nxids--; \
        } while (0)

If a concurrent process saw the decremented nxids value before seeing
the effect of xids[i] = xids[final], then it would miss an arbitrary
running subtransaction (not the aborting subtransaction being removed
from the array, but whichever xid had the bad luck to be in final
position).  In the patch I added pg_write_barrier(), but I suspect
that that might be not really a problem because of higher level
interlocking that I'm missing, because this code makes no mention of
the problem and doesn't (ab)use volatile qualifiers like the code that
extends the array (so it has neither compiler barrier/volatile nor
memory barrier so could be broken even on TSO assumptions at the whim
of the compiler if my guess were right about that).

-- 
Thomas Munro
http://www.enterprisedb.com

Attachment

pgsql-hackers by date:

Previous
From: Andrew Dunstan
Date:
Subject: Re: A space-efficient, user-friendly way to store categorical data
Next
From: Thomas Munro
Date:
Subject: Removing shm_mq.c's volatile qualifiers