Re: speed up a logical replica setup - Mailing list pgsql-hackers

From Noah Misch
Subject Re: speed up a logical replica setup
Date
Msg-id 20240623062157.97.nmisch@google.com
Whole thread Raw
In response to Re: speed up a logical replica setup  (Peter Eisentraut <peter@eisentraut.org>)
Responses Re: speed up a logical replica setup
RE: speed up a logical replica setup
Re: speed up a logical replica setup
List pgsql-hackers
On Mon, Mar 25, 2024 at 12:55:39PM +0100, Peter Eisentraut wrote:
> I have committed your version v33.

> commit d44032d

> --- /dev/null
> +++ b/src/bin/pg_basebackup/pg_createsubscriber.c

> +static char *
> +concat_conninfo_dbname(const char *conninfo, const char *dbname)
> +{
> +    PQExpBuffer buf = createPQExpBuffer();
> +    char       *ret;
> +
> +    Assert(conninfo != NULL);
> +
> +    appendPQExpBufferStr(buf, conninfo);
> +    appendPQExpBuffer(buf, " dbname=%s", dbname);

pg_createsubscriber fails on a dbname containing a space.  Use
appendConnStrVal() here and for other params in get_sub_conninfo().  See the
CVE-2016-5424 commits for more background.  For one way to test this scenario,
see generate_db() in the pg_upgrade test suite.

> +static char *
> +create_logical_replication_slot(PGconn *conn, struct LogicalRepInfo *dbinfo)
> +{
> +    PQExpBuffer str = createPQExpBuffer();
> +    PGresult   *res = NULL;
> +    const char *slot_name = dbinfo->replslotname;
> +    char       *slot_name_esc;
> +    char       *lsn = NULL;
> +
> +    Assert(conn != NULL);
> +
> +    pg_log_info("creating the replication slot \"%s\" on database \"%s\"",
> +                slot_name, dbinfo->dbname);
> +
> +    slot_name_esc = PQescapeLiteral(conn, slot_name, strlen(slot_name));
> +
> +    appendPQExpBuffer(str,
> +                      "SELECT lsn FROM pg_catalog.pg_create_logical_replication_slot(%s, 'pgoutput', false, false,
false)",

This is passing twophase=false, but the patch does not mention prepared
transactions.  Is the intent to not support workloads containing prepared
transactions?  If so, the documentation should say that, and the tool likely
should warn on startup if max_prepared_transactions != 0.

> +static void
> +create_publication(PGconn *conn, struct LogicalRepInfo *dbinfo)
> +{

> +    appendPQExpBuffer(str, "CREATE PUBLICATION %s FOR ALL TABLES",
> +                      ipubname_esc);

This tool's documentation says it "guarantees that no transaction will be
lost."  I tried to determine whether achieving that will require something
like the fix from
https://postgr.es/m/flat/de52b282-1166-1180-45a2-8d8917ca74c6@enterprisedb.com.
(Not exactly the fix from that thread, since that thread has not discussed the
FOR ALL TABLES version of its race condition.)  I don't know.  On the one
hand, pg_createsubscriber benefits from creating a logical slot after creating
the publication.  That snapbuild.c process will wait for running XIDs.  On the
other hand, an INSERT/UPDATE/DELETE acquires its RowExclusiveLock and builds
its relcache entry before assigning an XID, so perhaps the snapbuild.c process
isn't enough to prevent that thread's race condition.  What do you think?



pgsql-hackers by date:

Previous
From: "David G. Johnston"
Date:
Subject: Re: Unable parse a comment in gram.y
Next
From: "Joel Jacobson"
Date:
Subject: Re: Add pg_get_acl() function get the ACL for a database object