Home > mailing lists

Re: New VACUUM FULL - Mailing list pgsql-hackers

From	Takahiro Itagaki
Subject	Re: New VACUUM FULL
Date	December 22, 2009 05:10:56
Msg-id	20091222181141.8B8E.52131E4D@oss.ntt.co.jp Whole thread Raw
In response to	Re: New VACUUM FULL (Simon Riggs <simon@2ndQuadrant.com>)
Responses	Re: New VACUUM FULL Re: New VACUUM FULL
List	pgsql-hackers

Tree view

Simon Riggs <simon@2ndQuadrant.com> wrote:

> I think we
> either need to implement that or document that vacuum will not skip
> all-visible pages when running VACUUM FULL.

All-visible does not always mean "filled enough", no?  We will need to
check both visibility and fillfactor when we optimize copying tuples.

> Old VACUUM FULL was substantially faster than this on tables that had
> nothing to remove.

Yeah, that's why traditional VACUUM FULL is still kept as INPLACE vacuum.

> Please can you arrange for the cluster operation to skip rebuilding
> indexes if the table is not reduced in size? 

Do you think we should dispose the rewritten table when we find the
VACUUM FULL (or CLUSTER) is useless?  We could save the time to reindex,
but we've already consumed time to rewrite tables. It will be still
slower than VACUUM FULL INPLACE because it is read-only.

Instead, I'd suggest to have conditional database-wide maintenance
commands, something like:   VACUUM FULL WHERE <the table is fragmented>

We don't have to support the feature by SQL syntax; it could be done
in client tools.  How about pg_maintain or pg_ctl maintain that cleans
up each relation with appropriate command? We could replace vacuumdb,
clusterdb, and reindexdb with it then.

> Part of the reason why these happen is that the code hasn't been
> refactored much at all from its original use for cluster. There are
> almost zero comments to explain the additional use of this code for
> VACUUM, or at least to explain it still all works even when there is no
> index.

Ok, I'll check for additional comments. The flow of code might be still
confusable because vacuum() calls cluster(), but we need major replacement
of codes to refactor it. I'm not sure we need the code rewrites for it.

Also, I think we need additional messages for VACUUM VERBOSE
(and CLUSTER VERBOSE), though it might be adjusted in another patch.

Regards,
---
Takahiro Itagaki
NTT Open Source Software Center

pgsql-hackers by date:

From: Simon Riggs
Date: 22 December 2009, 04:38:05
Subject: Re: alpha3 release schedule?

From: Martin Pihlak
Date: 22 December 2009, 05:30:54
Subject: Re: fdw validation function vs zero catalog id

Re: New VACUUM FULL - Mailing list pgsql-hackers

Previous

Next