Re: Index bloat of 4x - Mailing list pgsql-general

From Tom Lane
Subject Re: Index bloat of 4x
Date
Msg-id 24871.1169052974@sss.pgh.pa.us
Whole thread Raw
In response to Re: Index bloat of 4x  (Bill Moran <wmoran@collaborativefusion.com>)
Responses Re: Index bloat of 4x  (Stefan Kaltenbrunner <stefan@kaltenbrunner.cc>)
Re: Index bloat of 4x  (Vivek Khera <vivek@khera.org>)
Re: Index bloat of 4x  (Bruce Momjian <bruce@momjian.us>)
List pgsql-general
Bill Moran <wmoran@collaborativefusion.com> writes:
> The entire database was around 28M prior to the upgrades, etc.  Immediately
> after the upgrades, it was ~270M.  Following a vacuum full, it dropped to
> 165M.  Following a database-wide reindex, it dropped to 30M.

As Alvaro said, vacuum full doesn't shrink indexes but in fact bloats them.
(Worst case, they could double in size, if the vacuum moves every row;
there's an intermediate state where there have to be index entries for
both old and new copies of each moved row, to ensure things are
consistent if the vacuum crashes right there.)

So the above doesn't sound too unlikely.  Perhaps we should recommend
vac full + reindex as standard cleanup procedure.  Longer term, maybe
teach vac full to do an automatic reindex if it's moved more than X% of
the rows.  Or forget the current vac full implementation entirely, and
go over to something acting more like CLUSTER ...

            regards, tom lane

pgsql-general by date:

Previous
From: Richard Huxton
Date:
Subject: Re: SELECT INTO TEMPORARY problem
Next
From: Alvaro Herrera
Date:
Subject: Re: Index bloat of 4x