Re: PATCH: logical_work_mem and logical streaming of largein-progress transactions - Mailing list pgsql-hackers
From | Amit Kapila |
---|---|
Subject | Re: PATCH: logical_work_mem and logical streaming of largein-progress transactions |
Date | |
Msg-id | CAA4eK1JUyU32AMYNYT=xD5y4P1zq8uyWNbHz=aQ8Be5sVp0UBw@mail.gmail.com Whole thread Raw |
In response to | Re: PATCH: logical_work_mem and logical streaming of largein-progress transactions (Dilip Kumar <dilipbalaut@gmail.com>) |
Responses |
Re: PATCH: logical_work_mem and logical streaming of largein-progress transactions
Re: PATCH: logical_work_mem and logical streaming of largein-progress transactions |
List | pgsql-hackers |
On Fri, May 15, 2020 at 2:47 PM Dilip Kumar <dilipbalaut@gmail.com> wrote: > > > 6. I think it will be good if we can provide an example of streaming > > changes via test_decoding at > > https://www.postgresql.org/docs/devel/test-decoding.html. I think we > > can also explain there why the user is not expected to see the actual > > data in the stream. > > I have a few problems to solve here. > - With streaming transaction also shall we show the actual values or > we shall do like it is currently in the patch > (appendStringInfo(ctx->out, "streaming change for TXN %u", > txn->xid);). I think we should show the actual values instead of what > we are doing now. > I think why we don't want to display the tuple at this stage is because it is not clear by this time if the transaction will commit or abort. I am not sure if displaying the contents of aborted transactions is a good idea but if there is a reason for doing so, we can do it later as well. > - In the example we can not show a real example, because of the > in-progress transaction to show the changes, we might have to > implement a lot of tuple. I think we can show the partial output? > I think we can display what API will actually display, what is the confusion here. I have a few more comments on the previous version of patch v20-0005-Implement-streaming-mode-in-ReorderBuffer. If you have fixed any, then leave those and fix others. Review comments: ------------------------------ 1. @@ -1762,10 +1952,16 @@ ReorderBufferCommit(ReorderBuffer *rb, TransactionId xid, } case REORDER_BUFFER_CHANGE_MESSAGE: - rb->message(rb, txn, change->lsn, true, - change->data.msg.prefix, - change->data.msg.message_size, - change->data.msg.message); + if (streaming) + rb->stream_message(rb, txn, change->lsn, true, + change->data.msg.prefix, + change->data.msg.message_size, + change->data.msg.message); + else + rb->message(rb, txn, change->lsn, true, + change->data.msg.prefix, + change->data.msg.message_size, + change->data.msg.message); Don't we need to set any_data_sent flag while streaming messages as we do for other types of changes? 2. + if (streaming) + { + /* + * Set the last of the stream as the final lsn before calling + * stream stop. + */ + if (!XLogRecPtrIsInvalid(prev_lsn)) + txn->final_lsn = prev_lsn; + rb->stream_stop(rb, txn); + } I am not sure if it is good to use final_lsn for this purpose. See comments for this variable in reorderbuffer.h. Basically, it is used for a specific purpose on different occasions. Now, if we want to start using it for a new purpose, we need to study its interaction with all other places and update the comments as well. Can we pass an additional parameter to stream_stop() instead? 3. + /* remember the command ID and snapshot for the streaming run */ + txn->command_id = command_id; + + /* Avoid copying if it's already copied. */ + if (snapshot_now->copied) + txn->snapshot_now = snapshot_now; + else + txn->snapshot_now = ReorderBufferCopySnap(rb, snapshot_now, + txn, command_id); This code is used at two different places, can we try to keep this in a single function. 4. In ReorderBufferProcessTXN(), the patch is calling stream_stop in both the try and catch block. If there is an error after calling it in a try block, we might call it again via catch. I think that will lead to sending a stop message twice. Won't that be a problem? See the usage of iterstate in the catch block, we have made it safe from a similar problem. 5. + if (streaming) + { + /* Discard the changes that we just streamed. */ + ReorderBufferTruncateTXN(rb, txn); - PG_RE_THROW(); + /* Re-throw only if it's not an abort. */ + if (errdata->sqlerrcode != ERRCODE_TRANSACTION_ROLLBACK) + { + MemoryContextSwitchTo(ecxt); + PG_RE_THROW(); + } + else + { + FlushErrorState(); + FreeErrorData(errdata); + errdata = NULL; + I think here we can write few comments on why we are doing error-code specific handling, basically, explain a bit about concurrent abort handling and or refer to the part of comments where it is explained. 6. PG_CATCH(); { + MemoryContext ecxt = MemoryContextSwitchTo(ccxt); + ErrorData *errdata = CopyErrorData(); I don't understand the usage of memory context in this part of the code. Basically, you are switching to CurrentMemoryContext here, do some error handling and then again reset back to some random context before rethrowing the error. If there is some purpose for it, then it might be better if you can write a few comments to explain the same. 7. +ReorderBufferCommit() { .. + /* + * If the transaction was (partially) streamed, we need to commit it in a + * 'streamed' way. That is, we first stream the remaining part of the + * transaction, and then invoke stream_commit message. + * + * XXX Called after everything (origin ID and LSN, ...) is stored in the + * transaction, so we don't pass that directly. + * + * XXX Somewhat hackish redirection, perhaps needs to be refactored? + */ + if (rbtxn_is_streamed(txn)) + { + ReorderBufferStreamCommit(rb, txn); + return; + } + .. } "XXX Somewhat hackish redirection, perhaps needs to be refactored?" What kind of refactoring we can do here? To me, it looks okay. 8. @@ -2295,6 +2677,13 @@ ReorderBufferXidSetCatalogChanges(ReorderBuffer *rb, TransactionId xid, txn = ReorderBufferTXNByXid(rb, xid, true, NULL, lsn, true); txn->txn_flags |= RBTXN_HAS_CATALOG_CHANGES; + + /* + * TOCHECK: Mark toplevel transaction as having catalog changes too + * if one of its children has. + */ + if (txn->toptxn != NULL) + txn->toptxn->txn_flags |= RBTXN_HAS_CATALOG_CHANGES; } Why are we marking top transaction here? -- With Regards, Amit Kapila. EnterpriseDB: http://www.enterprisedb.com
pgsql-hackers by date: