RE: Conflict detection for update_deleted in logical replication - Mailing list pgsql-hackers

From Hayato Kuroda (Fujitsu)
Subject RE: Conflict detection for update_deleted in logical replication
Date
Msg-id TYAPR01MB5692A6AC6EF2E22A2C9D099AF52A2@TYAPR01MB5692.jpnprd01.prod.outlook.com
Whole thread Raw
In response to Re: Conflict detection for update_deleted in logical replication  (Masahiko Sawada <sawada.mshk@gmail.com>)
Responses Re: Conflict detection for update_deleted in logical replication
Re: Conflict detection for update_deleted in logical replication
RE: Conflict detection for update_deleted in logical replication
List pgsql-hackers
Dear Hou,

Thanks for updating the patch! Here are my comments mainly for 0001.

01. protocol.sgml

I think the ordering of attributes in "Primary status update" seems not correct.
The second entry is LSN, not the oldest running xid.

02. maybe_advance_nonremovable_xid

```
+        case RCI_REQUEST_PUBLISHER_STATUS:
+            request_publisher_status(data);
+            break;
```

I think the part is not reachable because the transit
RCI_REQUEST_PUBLISHER_STATUS->RCI_WAIT_FOR_PUBLISHER_STATUS is done in
get_candidate_xid()->request_publisher_status().
Can we remove this?

03. RetainConflictInfoData

```
+    Timestamp   xid_advance_attempt_time;   /* when the candidate_xid is
+                                             * decided */
+    Timestamp   reply_time;     /* when the publisher responds with status */
+
+} RetainConflictInfoData;
```

The datatype should be TimestampTz.

04. get_candidate_xid

```
+    if (!TimestampDifferenceExceeds(data->xid_advance_attempt_time, now,
+                                    wal_receiver_status_interval * 1000))
+        return;
```

I think data->xid_advance_attempt_time can be accessed without the initialization
at the first try. I've found the patch could not pass test for 32-bit build
due to the reason.


05. request_publisher_status

```
+    if (!reply_message)
+    {
+        MemoryContext oldctx = MemoryContextSwitchTo(ApplyContext);
+
+        reply_message = makeStringInfo();
+        MemoryContextSwitchTo(oldctx);
+    }
+    else
+        resetStringInfo(reply_message);
```

Same lines exist in two functions: can we provide an inline function?

06. wait_for_publisher_status

```
+    if (!FullTransactionIdIsValid(data->last_phase_at))
+        data->last_phase_at = FullTransactionIdFromEpochAndXid(data->remote_epoch,
+                                                               data->remote_nextxid);
+
```

Not sure, is there a possibility that data->last_phase_at is valid here? It is initialized
just before transiting to RCI_WAIT_FOR_PUBLISHER_STATUS.

07. wait_for_publisher_status

I think all calculations and checking in the function can be done even on the
walsender. Based on this, I come up with an idea to reduce the message size:
walsender can just send a status (boolean) whether there are any running transactions
instead of oldest xid, next xid and their epoch. Or, it is more important to reduce the
amount of calc. on publisher side?

Best regards,
Hayato Kuroda
FUJITSU LIMITED


pgsql-hackers by date:

Previous
From: vignesh C
Date:
Subject: Re: Disallow UPDATE/DELETE on table with unpublished generated column as REPLICA IDENTITY
Next
From: Amit Kapila
Date:
Subject: Re: Conflict detection for update_deleted in logical replication