Home > mailing lists

Re: [GENERAL] Config for fast huge cascaded updates - Mailing list pgsql-general

From	Craig de Stigter
Subject	Re: [GENERAL] Config for fast huge cascaded updates
Date	July 2, 2017 23:15:15
Msg-id	CAF1M8pdL83C6=jP8CVM+RNwY9eWvOFN-ftmpFUpPZaa7+6c_Jw@mail.gmail.com Whole thread Raw
In response to	Re: [GENERAL] Config for fast huge cascaded updates (Andrew Sullivan <ajs@crankycanuck.ca>)
List	pgsql-general

Tree view

Thanks everyone. Sorry for the late reply.

Do you have indexes on all the referencing columns?

I had thought so, but it turns out no, and this appears to be the main cause of the slowness. After adding a couple of extra indexes in the bigger tables, things are going much more smoothly.

write the whole thing into a new SQL schema

This is a really interesting approach I hadn't thought of! We can currently afford a little bit of downtime, but it's helpful to keep this in mind if we ever do this kind of thing again in future.

The two changes we've made are:

Add a few indexes so that the cascades operate more efficiently
Move some of the tables (whose ID values don't matter so much to our app) into a separate migration, which can be run before we take down the site. Then only the tables whose IDs matter to the app/user are done while the site is down.

With those changes it looks like we can fit the downtime into the window we have. Thanks for all the advice, much appreciated!

On 28 June 2017 at 01:28, Andrew Sullivan <ajs@crankycanuck.ca> wrote:

On Mon, Jun 26, 2017 at 07:26:08PM -0700, Joshua D. Drake wrote:

> Alternatively, and ONLY do this if you take a backup right before hand, you
> can set the table unlogged, make the changes and assuming success, make the
> table logged again. That will great increase the write speed and reduce wal
> segment churn.

Note that this is not for just that table, but for all of the
implicated ones because of the CASCADE statements. It sounds like the
OP is basically rewriting a significant chunk of the entire database,
so nothing is going to be super fast: all those CASCADEs have to fire
and all those other tables need to be updated too.

> However, if that fails, the table is dead. You will have to reload it from
> backup.

Right, and that goes for all the affected tables.

Best regards,

A

--
Andrew Sullivan
ajs@crankycanuck.ca

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Regards,

Craig

Developer

Koordinates

+64 21 256 9488 / koordinates.com / @koordinates

pgsql-general by date:

From: Alvaro Aguayo Garcia-Rada
Date: 02 July 2017, 22:28:54
Subject: [GENERAL] pglogical trouble on BDR cluster

From: Steven Chang
Date: 03 July 2017, 04:08:28
Subject: Re: [GENERAL] duplicate key value violates unique constraint andduplicated records

Re: [GENERAL] Config for fast huge cascaded updates - Mailing list pgsql-general

Previous

Next