Re: Index bloat of 4x - Mailing list pgsql-general

From Stefan Kaltenbrunner
Subject Re: Index bloat of 4x
Date
Msg-id 45AE833B.60407@kaltenbrunner.cc
Whole thread Raw
In response to Re: Index bloat of 4x  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-general
Tom Lane wrote:
> Bill Moran <wmoran@collaborativefusion.com> writes:
>> The entire database was around 28M prior to the upgrades, etc.  Immediately
>> after the upgrades, it was ~270M.  Following a vacuum full, it dropped to
>> 165M.  Following a database-wide reindex, it dropped to 30M.
>
> As Alvaro said, vacuum full doesn't shrink indexes but in fact bloats them.
> (Worst case, they could double in size, if the vacuum moves every row;
> there's an intermediate state where there have to be index entries for
> both old and new copies of each moved row, to ensure things are
> consistent if the vacuum crashes right there.)
>
> So the above doesn't sound too unlikely.  Perhaps we should recommend
> vac full + reindex as standard cleanup procedure.  Longer term, maybe
> teach vac full to do an automatic reindex if it's moved more than X% of
> the rows.  Or forget the current vac full implementation entirely, and
> go over to something acting more like CLUSTER ...

we have to recommend the CLUSTER "way" to fix overly bloated databases
quite often to people on IRC because vacuum full is unreasonably slow on
highly fragmented databases.
Doing something like that internally for vacuum full sounds like a
reasonable idea except for the additional disk usage during the process
which might cause issues for people ...



Stefan

pgsql-general by date:

Previous
From: "John D. Burger"
Date:
Subject: Re: Building web sites using a database
Next
From: Tom Lane
Date:
Subject: Re: pg_dump without oids