Re: Autovaccum with cost_delay does not complete on one solaris 5.10 machine - Mailing list pgsql-performance

From Tom Lane
Subject Re: Autovaccum with cost_delay does not complete on one solaris 5.10 machine
Date
Msg-id 26178.1271371697@sss.pgh.pa.us
Whole thread Raw
In response to Re: Autovaccum with cost_delay does not complete on one solaris 5.10 machine  (Josh Berkus <josh@agliodbs.com>)
Responses Re: Autovaccum with cost_delay does not complete on one solaris 5.10 machine  (Josh Berkus <josh@agliodbs.com>)
List pgsql-performance
Josh Berkus <josh@agliodbs.com> writes:
>> But it shouldn't
>> be sleeping after each page with normal cost_delay parameters, should it?

> Right, that's why I find this puzzling.  If the problem was easier to
> reproduce it would be easier to analyze.

The behavior would be explained if VacuumCostLimit were getting set to
zero (or some unreasonably small value) in the autovac worker process.
I looked at the autovac code that manages that, and it seems complicated
enough that a bug wouldn't surprise me in the least.

I especially note that wi_cost_limit is explicitly initialized to zero,
rather than something sane; and that table_recheck_autovac falls back to
setting vac_cost_limit from the previous value of VacuumCostLimit
... which is NOT constant but in general is left over from the
previously processed table.  One should also keep in mind that SIGHUP
processing might reload VacuumCostLimit from GUC values.  So I think
that area needs a closer look.

Josh, are you sure that both servers are identical in terms of both
GUC-related and per-table autovacuum settings?

            regards, tom lane

pgsql-performance by date:

Previous
From: Greg Smith
Date:
Subject: Re: 8.3.9 - latency spikes with Linux (and tuning for consistently low latency)
Next
From: Josh Berkus
Date:
Subject: Re: Autovaccum with cost_delay does not complete on one solaris 5.10 machine