Thread: Spinlock is missing when updating two_phase of ReplicationSlot

Spinlock is missing when updating two_phase of ReplicationSlot

From
Masahiko Sawada
Date:
Hi,

I realized that in CreateDecodingContext() function, we update both
slot->data.two_phase and two_phase_at without acquiring the spinlock:

    /* Mark slot to allow two_phase decoding if not already marked */
    if (ctx->twophase && !slot->data.two_phase)
    {
        slot->data.two_phase = true;
        slot->data.two_phase_at = start_lsn;
        ReplicationSlotMarkDirty();
        ReplicationSlotSave();
        SnapBuildSetTwoPhaseAt(ctx->snapshot_builder, start_lsn);
    }

I think we should acquire the spinlock when updating fields of the
replication slot even by its owner. Otherwise readers could see
inconsistent results. Looking at another place where we update
two_phase_at, we acquire the spinlock:

    SpinLockAcquire(&slot->mutex);
    slot->data.confirmed_flush = ctx->reader->EndRecPtr;
    if (slot->data.two_phase)
        slot->data.two_phase_at = ctx->reader->EndRecPtr;
    SpinLockRelease(&slot->mutex);

It seems to me an oversight of commit a8fd13cab0b. I've attached the
small patch to fix it.

Regards,

-- 
Masahiko Sawada
Amazon Web Services: https://aws.amazon.com

Attachment

Re: Spinlock is missing when updating two_phase of ReplicationSlot

From
Michael Paquier
Date:
On Wed, Jan 11, 2023 at 11:07:05AM +0900, Masahiko Sawada wrote:
> I think we should acquire the spinlock when updating fields of the
> replication slot even by its owner. Otherwise readers could see
> inconsistent results. Looking at another place where we update
> two_phase_at, we acquire the spinlock:
>
>     SpinLockAcquire(&slot->mutex);
>     slot->data.confirmed_flush = ctx->reader->EndRecPtr;
>     if (slot->data.two_phase)
>         slot->data.two_phase_at = ctx->reader->EndRecPtr;
>     SpinLockRelease(&slot->mutex);
>
> It seems to me an oversight of commit a8fd13cab0b. I've attached the
> small patch to fix it.

Looks right to me, the paths updating the data related to the slots
are careful about that, even when it comes to fetching a slot from
MyReplicationSlot.  I have been looking around the slot code to see if
there are other inconsistencies, and did not notice anything standing
out.  Will fix..
--
Michael

Attachment

Re: Spinlock is missing when updating two_phase of ReplicationSlot

From
Michael Paquier
Date:
On Wed, Jan 11, 2023 at 02:36:17PM +0900, Michael Paquier wrote:
> Looks right to me, the paths updating the data related to the slots
> are careful about that, even when it comes to fetching a slot from
> MyReplicationSlot.  I have been looking around the slot code to see if
> there are other inconsistencies, and did not notice anything standing
> out.  Will fix..

And done, thanks!
--
Michael

Attachment

Re: Spinlock is missing when updating two_phase of ReplicationSlot

From
Masahiko Sawada
Date:
On Thu, Jan 12, 2023 at 1:42 PM Michael Paquier <michael@paquier.xyz> wrote:
>
> On Wed, Jan 11, 2023 at 02:36:17PM +0900, Michael Paquier wrote:
> > Looks right to me, the paths updating the data related to the slots
> > are careful about that, even when it comes to fetching a slot from
> > MyReplicationSlot.  I have been looking around the slot code to see if
> > there are other inconsistencies, and did not notice anything standing
> > out.  Will fix..
>
> And done, thanks!

Thank you!

Regards,

-- 
Masahiko Sawada
Amazon Web Services: https://aws.amazon.com