RE: Data is copied twice when specifying both child and parent table in publication - Mailing list pgsql-hackers

From wangw.fnst@fujitsu.com
Subject RE: Data is copied twice when specifying both child and parent table in publication
Date
Msg-id OS3PR01MB62751C2FA25E23E24AB6980B9EC69@OS3PR01MB6275.jpnprd01.prod.outlook.com
Whole thread Raw
In response to RE: Data is copied twice when specifying both child and parent table in publication  ("shiy.fnst@fujitsu.com" <shiy.fnst@fujitsu.com>)
Responses RE: Data is copied twice when specifying both child and parent table in publication  ("osumi.takamichi@fujitsu.com" <osumi.takamichi@fujitsu.com>)
List pgsql-hackers
On Tue, Apr 28, 2022 9:22 AM Shi, Yu/侍 雨 <shiy.fnst@cn.fujitsu.com> wrote:
> Thanks for your patches.
> 
> Here's a comment on the patch for REL14.
Thanks for your comments.

> +    appendStringInfo(&cmd, "SELECT DISTINCT ns.nspname, c.relname\n"
> +                     " FROM
> pg_catalog.pg_publication_tables t\n"
> +                     "      JOIN pg_catalog.pg_namespace
> ns\n"
> +                     "         ON ns.nspname =
> t.schemaname\n"
> +                     "      JOIN pg_catalog.pg_class c\n"
> +                     "         ON c.relname = t.tablename AND
> c.relnamespace = ns.oid\n"
> +                     " WHERE t.pubname IN (%s)\n"
> +                     " AND (c.relispartition IS FALSE\n"
> +                     "      OR NOT EXISTS\n"
> +                     "        ( SELECT 1 FROM
> pg_partition_ancestors(c.oid) as relid\n"
> +                     "          WHERE relid IN\n"
> +                     "            (SELECT DISTINCT (schemaname
> || '.' || tablename)::regclass::oid\n"
> +                     "             FROM
> pg_catalog.pg_publication_tables t\n"
> +                     "             WHERE t.pubname IN (%s))\n"
> +                     "          AND relid != c.oid))\n",
> +                     pub_names.data, pub_names.data);
> 
> I think we can use an alias like 'pa' for pg_partition_ancestors, and modify the
> SQL as follows.
> 
> +    appendStringInfo(&cmd, "SELECT DISTINCT ns.nspname, c.relname\n"
> +                     " FROM
> pg_catalog.pg_publication_tables t\n"
> +                     "      JOIN pg_catalog.pg_namespace
> ns\n"
> +                     "         ON ns.nspname =
> t.schemaname\n"
> +                     "      JOIN pg_catalog.pg_class c\n"
> +                     "         ON c.relname = t.tablename AND
> c.relnamespace = ns.oid\n"
> +                     " WHERE t.pubname IN (%s)\n"
> +                     " AND (c.relispartition IS FALSE\n"
> +                     "      OR NOT EXISTS\n"
> +                     "        ( SELECT 1 FROM
> pg_partition_ancestors(c.oid) pa\n"
> +                     "          WHERE pa.relid IN\n"
> +                     "            (SELECT DISTINCT
> (t.schemaname || '.' || t.tablename)::regclass::oid\n"
> +                     "             FROM
> pg_catalog.pg_publication_tables t\n"
> +                     "             WHERE t.pubname IN (%s))\n"
> +                     "          AND pa.relid != c.oid))\n",
> +                     pub_names.data, pub_names.data);
Fix it.

In addition, I try to modify the approach for the HEAD.
I enhance the API of function pg_get_publication_tables. Change the parameter
type from 'text' to 'any'. Then we can use this function to get tables from one
publication or an array of publications. Any thoughts on this approach?

Attach new patches.
The patch for HEAD:
1. Modify the approach. Enhance the API of function pg_get_publication_tables to
handle one publication or an array of publications.
The patch for REL14:
1. Improve the table sync SQL. [suggestions by Shi yu]

Regards,
Wang wei

Attachment

pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: failures in t/031_recovery_conflict.pl on CI
Next
From: Michael Paquier
Date:
Subject: Re: Rewriting the test of pg_upgrade as a TAP test - take three - remastered set