Home > mailing lists

Re: A concurrent VACUUM FULL? - Mailing list pgsql-hackers

From	DINESH NAIR
Subject	Re: A concurrent VACUUM FULL?
Date	June 30 18:23:13
Msg-id	PN4P287MB4381BAC1AFF24AC8A3CC6D449C46A@PN4P287MB4381.INDP287.PROD.OUTLOOK.COM Whole thread Raw
In response to	A concurrent VACUUM FULL? (Erik Nordström <erik@timescale.com>)
Responses	Re: A concurrent VACUUM FULL?
List	pgsql-hackers

Tree view

Hi Eric,

Currently the first suggested approach "Rebuild indexes on the temporary heap in TX1 and then swap in the new indexes along with the new heap in TX2." sound good.

It would be great if we are able to perform concurrent reads and writes.

In OLTP environments will it lead to slowing of the queries or query performance issues !!!!

Thanks

Dinesh Nair

From: Erik Nordström <erik@timescale.com>
Sent: Monday, June 30, 2025 3:19 PM
To: PostgreSQL Hackers <pgsql-hackers@lists.postgresql.org>
Subject: A concurrent VACUUM FULL?

You don't often get email from erik@timescale.com. Learn why this is important

Caution: This email was sent from an external source. Please verify the sender’s identity before clicking links or opening attachments.

Hi hackers,

I've been looking at the code for CLUSTER/VACUUM FULL, and whether it is possible to do a concurrent version of it using a multi-transactional approach similar to concurrent reindexing and partition detach.

The idea would be to hold weaker locks in TX1 when doing the heap rewrite (essentially allow reads but prevent writes), and then do the actual heap swap in a second TX2 transaction.

Having experimented a bit with this approach, I found that reindexing is an issue because that happens after the new heap has been swapped in. The current reindex during a heap swap effectively blocks reads so if one starts a new transaction after swapping heaps, it will block reads for a long time.

This made me think about two ways to handle this:

1. Rebuild indexes on the temporary heap in TX1 and then swap in the new indexes along with the new heap in TX2.

2. Do a concurrent index rebuild after the heap swap.

Of the two approaches above, (2) seems easiest to implement, but the downside is that indexes would be invalid while indexes are rebuilt. Therefore, (1) seems to be the more desirable one because all the heavy lifting would be done in TX1 on the temporary heap.

Does anyone have a sense of whether approach (1) is feasible or whether there are any major blockers?

Is this worth pursuing at all or am I missing something?

Best regards,

Erik

Database Architect, Timescale

pgsql-hackers by date:

From: Sami Imseih
Date: 30 June, 18:19:10
Subject: Re: pg_get_multixact_members not documented

From: Andres Freund
Date: 30 June, 19:27:10
Subject: Re: postmaster uses more CPU in 18 beta1 with io_method=io_uring

Re: A concurrent VACUUM FULL? - Mailing list pgsql-hackers

Previous

Next