Re: Conflict detection for update_deleted in logical replication - Mailing list pgsql-hackers

From Dilip Kumar
Subject Re: Conflict detection for update_deleted in logical replication
Date
Msg-id CAFiTN-u_ZgStqnH_twTWVM1BLtguYC+yo-XrLo2QjBuRDXXTFA@mail.gmail.com
Whole thread Raw
In response to RE: Conflict detection for update_deleted in logical replication  ("Zhijie Hou (Fujitsu)" <houzj.fnst@fujitsu.com>)
List pgsql-hackers
On Fri, Aug 1, 2025 at 9:16 PM Zhijie Hou (Fujitsu)
<houzj.fnst@fujitsu.com> wrote:
>
> On Friday, August 1, 2025 7:42 PM Dilip Kumar <dilipbalaut@gmail.com> wrote:
> >
> > On Fri, Aug 1, 2025 at 5:02 PM Zhijie Hou (Fujitsu) <houzj.fnst@fujitsu.com>
> > wrote:
> > >
> > > > 2.
> > > >
> > > > +          If set to <literal>true</literal>, the detection of
> > > > +          <xref linkend="conflict-update-deleted"/> is enabled, and
> > > > + a
> > > > physical
> > > > +          replication slot named
> > > > <quote><literal>pg_conflict_detection</literal></quote>
> > > >            created on the subscriber to prevent the conflict information
> > from
> > > >            being removed.
> > > >
> > > > "to prevent the conflict information from being removed." should be
> > > > rewritten as "to prevent removal of tuple required for conflict detection"
> > >
> > > It appears the document you commented is already committed. I think
> > > the intention was to make a general statement that neither dead tuples
> > > nor commit timestamp data would be removed.
> >
> > Okay got it, so instead of "conflict information" should we say "information for
> > detecting conflicts" or "conflict detection information", conflict information
> > looks like we want to prevent the information about the conflict which has
> > already happened, instead we are preventing information which are required
> > for detecting the conflict, does this make sense?
>
> It makes sense to me, so changed.
>
> >
> > I know this is already committed, but actually this is part of the whole patch set
> > so we can always improvise it.
> >
> > > >
> > > > 3.
> > > > +   /* Return if the commit timestamp data is not available */
> > > > +   if (!track_commit_timestamp)
> > > > +       return false;
> > > >
> > > > Shouldn't caller should take care of this?  I mean if the
> > > > 'retaindeadtuples' and 'track_commit_timestamp' is not set then
> > > > caller shouldn't even call this function.
> > >
> > > I feel moving the checks into a single central function would
> > > streamline the caller, reducing code duplication. So, maybe we could
> > > move the retaindeadtuple check into this function as well for consistency.
> > Thoughts ?
> >
> > Fine with either way, actually I wanted both the check 'retaindeadtuple' and
> > 'track_commit_timestamp' at the same place.
>
> Thanks for confirming. Here is V56 patch set which addressed all the
> comments including the comments from Amit[1] and Shveta[2].
>
> I have merged V55-0002 into 0001 and updated the list of author
> and reviewers based on my knowledge.

Now this LGTM, I would suggest some modification in the commit
message, I think the purpose of this improvement is not very clear to
me, it says "supports detecting update_deleted conflicts during update
operations". But I think we can be more clearer on when we want to
detect such conflicts, here is my proposal, feel free to edit/revise/

"This commit improves conflict detection for updates. When an update
arrives for a row that has been locally deleted, the subscriber now
performs a secondary scan to check for the recently deleted tuple.
This check uses the old column values from remote tuple to find
a match among tuples that are visible to 'SnapshotAny' but not yet
vacuumed away.

If a matching, deleted tuple is found, an update_deleted conflict is
raised.  This provides a more robust and accurate conflict resolution
process, preventing unexpected behavior by correctly identifying cases
where a remote update clashes with a local deletion."

--
Regards,
Dilip Kumar
Google



pgsql-hackers by date:

Previous
From: Michael Paquier
Date:
Subject: Dead code with short varlenas in toast_save_datum()
Next
From: "David G. Johnston"
Date:
Subject: Re: implement CAST(expr AS type FORMAT 'template')