On Wed, Feb 10, 2021 at 12:08 AM Robert Haas <robertmhaas@gmail.com> wrote: > > On Tue, Feb 9, 2021 at 6:57 AM Amit Kapila <amit.kapila16@gmail.com> wrote: > > I think similar happens without any of the work done in PG-14 as well > > if we restart the apply worker before the commit completes on the > > subscriber. After the restart, we will send the start_decoding_at > > point based on some previous commit which will make publisher send the > > entire transaction again. I don't think restart of WAL sender or WAL > > receiver is such a common thing. It can only happen due to some bug in > > code or user wishes to stop the nodes or some crash happened. > > Really? My impression is that the logical replication protocol is > supposed to be designed in such a way that once a transaction is > successfully confirmed, it won't be sent again. Now if something is > not confirmed then it has to be sent again. But if it is confirmed > then it shouldn't happen. >
If by successfully confirmed, you mean that once the subscriber node has received, it won't be sent again then as far as I know that is not true. We rely on the flush location sent by the subscriber to advance the decoding locations. We update the flush locations after we apply the transaction's commit successfully. Also, after the restart, we use the replication origin's last flush location as a point from where we need the transactions and the origin's progress is updated at commit time.
OTOH, If by successfully confirmed, you mean that once the subscriber has applied the complete transaction (including commit), then you are right that it won't be sent again.
I think we need to treat a prepared transaction slightly different from an uncommitted transaction when sending downstream. We need to send a whole uncommitted transaction downstream again because previously applied changes must have been aborted and hence lost by the downstream and thus it needs to get all of those again. But when a downstream prepares a transaction, even if it's not committed, those changes are not lost even after restart of a walsender. If the downstream confirms that it has "flushed" PREPARE, there is no need to send all the changes again.