Re: Column Filtering in Logical Replication - Mailing list pgsql-hackers
From | Tomas Vondra |
---|---|
Subject | Re: Column Filtering in Logical Replication |
Date | |
Msg-id | 43c15aa8-aa15-ca0f-40e4-3be68d98df05@enterprisedb.com Whole thread Raw |
In response to | Re: Column Filtering in Logical Replication (Amit Kapila <amit.kapila16@gmail.com>) |
List | pgsql-hackers |
On 3/11/22 03:46, Amit Kapila wrote: > On Fri, Mar 11, 2022 at 12:44 AM Tomas Vondra > <tomas.vondra@enterprisedb.com> wrote: >> >> On 3/10/22 04:09, Amit Kapila wrote: >>> On Wed, Mar 9, 2022 at 3:33 PM Amit Kapila <amit.kapila16@gmail.com> wrote: >>>> >>>> On Mon, Mar 7, 2022 at 8:48 PM Tomas Vondra >>>> <tomas.vondra@enterprisedb.com> wrote: >>>> >>>>> OK, I reworked this to do the same thing as the row filtering patch. >>>>> >>>> >>>> Thanks, I'll check this. >>>> >>> >>> Some assorted comments: >>> ===================== >>> 1. We don't need to send a column list for the old tuple in case of an >>> update (similar to delete). It is not required to apply a column >>> filter for those cases because we ensure that RI must be part of the >>> column list for updates and deletes. >> >> I'm not sure which part of the code does this refer to? >> > > The below part: > @@ -464,11 +473,11 @@ logicalrep_write_update(StringInfo out, > TransactionId xid, Relation rel, > pq_sendbyte(out, 'O'); /* old tuple follows */ > else > pq_sendbyte(out, 'K'); /* old key follows */ > - logicalrep_write_tuple(out, rel, oldslot, binary); > + logicalrep_write_tuple(out, rel, oldslot, binary, columns); > } > > I think here instead of columns, the patch needs to send NULL as it is > already doing in logicalrep_write_delete. > Hmmm, yeah. In practice it doesn't really matter, because NULL means "send all columns" so it actually relaxes the check. But we only send the RI keys, which is a subset of the column filter. But will fix. >>> 2. >>> + /* >>> + * Check if all columns referenced in the column filter are part of >>> + * the REPLICA IDENTITY index or not. >>> >>> I think this comment is reverse. The rule we follow here is that >>> attributes that are part of RI must be there in a specified column >>> list. This is used at two places in the patch. >> >> Yeah, you're right. Will fix. >> >>> 3. get_rel_sync_entry() >>> { >>> /* XXX is there a danger of memory leak here? beware */ >>> + oldctx = MemoryContextSwitchTo(CacheMemoryContext); >>> + for (int i = 0; i < nelems; i++) >>> ... >>> } >>> >>> Similar to the row filter, I think we need to use >>> entry->cache_expr_cxt to allocate this. There are other usages of >>> CacheMemoryContext in this part of the code but I think those need to >>> be also changed and we can do that as a separate patch. If we do the >>> suggested change then we don't need to separately free columns. >> >> I agree a shorter-lived context would be better than CacheMemoryContext, >> but "expr" seems to indicate it's for the expression only, so maybe we >> should rename that. >> > > Yeah, we can do that. How about rel_entry_cxt or something like that? > The idea is that eventually, we should move a few other things of > RelSyncEntry like attrmap where we are using CacheMemoryContext under > this context. > Yeah, rel_entry_cxt sounds fine I guess ... >> But do we really want a memory context for every >> single entry? >> > > Any other better idea? > No, I think you're right - it'd be hard/impossible to keep track of all the memory allocated for expression/estate. It'd be fine for the columns, because that's just a bitmap, but not for the expressions. regards -- Tomas Vondra EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company
pgsql-hackers by date: