Thread: Re: [HACKERS] Autovacuum loose ends

Re: [HACKERS] Autovacuum loose ends

From

Alvaro Herrera

Date:

24 July 2005, 15:20:58

On Thu, Jul 14, 2005 at 10:52:56AM -0400, Tom Lane wrote:
> I've applied Alvaro's latest integrated-autovacuum patch.  There are
> still a number of loose ends to be dealt with before beta, though:

Ok, here's a patch that deals with some of this:

- The stat collector is modified so as to keep shared relations separate
  from regular ones.  Also, backends sends messages separately.
  Autovacuum takes advantage of this, so it correctly identifies the
  appropiate time to operate on a shared relation, irrespective of the
  database where they were modified.  Note however that it uses each
  database's pg_autovacuum settings.  This means it could be vacuumed
  sooner in one database than another, but I don't think it's a problem.

- Temp tables are completely ignored.

- pg_statistic is completely ignored.

- databases with no stat entry are still ignored, except that they are
  checked for Xid wraparound like any other.  The "oldest" one is chosen
  for vacuum in a particular autovacuum run.

- A database-wide vacuum forces a pg_database flat-file update, so that
  the wraparound check actually works.

- The postmaster's main loop sleeps Min(60, autovacuum_naptime), in
  order to be able to pick naptimes smaller than 60 seconds.  In order
  not to make the loop a busy-wait, I forced a minimum of 1 to that GUC
  var.



Some comments:

- Now that we have a real Xid wraparound check, we could go back to
  having any table with no stat entry ignored, which was the original
  coding.  There's no danger of wraparound, and there'd be no work done
  to a table that doesn't have any activity.  We have to consider what
  happens at stat reset -- AFAICS there's no problem, because as soon as
  the table sees some activity, it will be picked up by pgstat.
  However, it would be bad if stats are reset right after some heavy
  activity on a table.  Maybe the only thing we need is documentation.

- datallowcon is still ignored.  Now it's safe to do so, because we have
  a real Xid wraparound check.  Changing it requires extending the
  pg_database flat-file (should be fairly easy).

- There are stat messages emitted for a database-wide vacuum, just like
  any other.  This means that all tables in the database would end up in
  pgstat; and also all databases, including those with datallowconn = false.
  This may not be good.  I'm not sure what exactly to do about it.  Do
  we want to disallow such stats?  Disable message sending (or
  collecting) in some circumstances?

- I haven't done anything yet w.r.t. the custom vacuum_delay nor sleep
  scale factor.

- There are still no docs.


--
Alvaro Herrera (<alvherre[a]alvh.no-ip.org>)
"Porque Kim no hacia nada, pero, eso sí,
con extraordinario éxito" ("Kim", Kipling)

Attachment

autovac-two-1.patch

Re: [HACKERS] Autovacuum loose ends

From

Tom Lane

Date:

24 July 2005, 15:33:54

Alvaro Herrera <alvherre@alvh.no-ip.org> writes:
> - pg_statistic is completely ignored.

... pg_statistic still needs vacuuming, surely.  It's only ANALYZE
that you can/should skip for it.

> - The postmaster's main loop sleeps Min(60, autovacuum_naptime), in
>   order to be able to pick naptimes smaller than 60 seconds.  In order
>   not to make the loop a busy-wait, I forced a minimum of 1 to that GUC
>   var.

Hmm, I wonder whether the minimum shouldn't be 10.  Or even 60.

> - Now that we have a real Xid wraparound check, we could go back to
>   having any table with no stat entry ignored, which was the original
>   coding.  There's no danger of wraparound, and there'd be no work done
>   to a table that doesn't have any activity.

Agreed.

>   We have to consider what
>   happens at stat reset -- AFAICS there's no problem, because as soon as
>   the table sees some activity, it will be picked up by pgstat.
>   However, it would be bad if stats are reset right after some heavy
>   activity on a table.  Maybe the only thing we need is documentation.

What's the use-case for having the stat reset feature at all?

> - datallowcon is still ignored.  Now it's safe to do so, because we have
>   a real Xid wraparound check.  Changing it requires extending the
>   pg_database flat-file (should be fairly easy).

I think this is all right, as long as a database that shows no stats
traffic is only connected to when it needs to be vacuumed for XID wrap
prevention purposes.

> - There are stat messages emitted for a database-wide vacuum, just like
>   any other.  This means that all tables in the database would end up in
>   pgstat; and also all databases, including those with datallowconn = false.
>   This may not be good.  I'm not sure what exactly to do about it.  Do
>   we want to disallow such stats?  Disable message sending (or
>   collecting) in some circumstances?

Needs thought...

            regards, tom lane

Re: [HACKERS] Autovacuum loose ends

From

Alvaro Herrera

Date:

24 July 2005, 17:06:16

On Sun, Jul 24, 2005 at 02:33:38PM -0400, Tom Lane wrote:
> Alvaro Herrera <alvherre@alvh.no-ip.org> writes:
> > - pg_statistic is completely ignored.
>
> ... pg_statistic still needs vacuuming, surely.  It's only ANALYZE
> that you can/should skip for it.

Sorry, yes, it's ignored only for analyze.

> > - The postmaster's main loop sleeps Min(60, autovacuum_naptime), in
> >   order to be able to pick naptimes smaller than 60 seconds.  In order
> >   not to make the loop a busy-wait, I forced a minimum of 1 to that GUC
> >   var.
>
> Hmm, I wonder whether the minimum shouldn't be 10.  Or even 60.

It's ok with me.  What do other people think?

> >   We have to consider what
> >   happens at stat reset -- AFAICS there's no problem, because as soon as
> >   the table sees some activity, it will be picked up by pgstat.
> >   However, it would be bad if stats are reset right after some heavy
> >   activity on a table.  Maybe the only thing we need is documentation.
>
> What's the use-case for having the stat reset feature at all?

I don't know.  Maybe the people who added it can tell?


> > - There are stat messages emitted for a database-wide vacuum, just like
> >   any other.  This means that all tables in the database would end up in
> >   pgstat; and also all databases, including those with datallowconn = false.
> >   This may not be good.  I'm not sure what exactly to do about it.  Do
> >   we want to disallow such stats?  Disable message sending (or
> >   collecting) in some circumstances?
>
> Needs thought...

Ok.

--
Alvaro Herrera (<alvherre[a]alvh.no-ip.org>)
"I call it GNU/Linux. Except the GNU/ is silent." (Ben Reiter)

Re: [HACKERS] Autovacuum loose ends

From

Christopher Kings-Lynne

Date:

24 July 2005, 22:31:19

>>  We have to consider what
>>  happens at stat reset -- AFAICS there's no problem, because as soon as
>>  the table sees some activity, it will be picked up by pgstat.
>>  However, it would be bad if stats are reset right after some heavy
>>  activity on a table.  Maybe the only thing we need is documentation.
>
>
> What's the use-case for having the stat reset feature at all?

I believe I was the root cause of the pg_stat_reset() function.  The
idea at the time was that if you decide to do a round of index
optimisation, you want to be able to search for unused indexes and
heavily seq. scanned tables.

If you reset the stats you have 'clean' data to work with.  For
instance, you can get 24 hours of clean stats data.

Chris

Re: [HACKERS] Autovacuum loose ends

From

"Matthew T. O'Connor"

Date:

25 July 2005, 01:45:43

Alvaro Herrera wrote:

>On Sun, Jul 24, 2005 at 02:33:38PM -0400, Tom Lane wrote:
>
>
>>Hmm, I wonder whether the minimum shouldn't be 10.  Or even 60.
>>
>>
>
>It's ok with me.  What do other people think?
>
>

Effectiely, this is going to be the minimum amount of "down time" for
autovacuum between checking databases, right?  So if the minimum is 10
seconds, and there I have six databases, then it will check each
database at most once per minute?  If so, then I'm not sure what I think
if I have a few hundred databases, 10s might be too long.

>>What's the use-case for having the stat reset feature at all?
>>
>>
>
>I don't know.  Maybe the people who added it can tell?
>
>

I don't know either, but this brings up another question.  Stats
wraparound.  The n_tup_ins/upd/del columns in the stats system are
defined as bigint, what happens when the total number of upd for example
exceeds the capacity for bigint, or overflows to negative, anyone have
any idea?

Matt

Re: [HACKERS] Autovacuum loose ends

From

Tom Lane

Date:

25 July 2005, 01:59:10

"Matthew T. O'Connor" <matthew@zeut.net> writes:
> I don't know either, but this brings up another question.  Stats
> wraparound.  The n_tup_ins/upd/del columns in the stats system are
> defined as bigint, what happens when the total number of upd for example
> exceeds the capacity for bigint, or overflows to negative, anyone have
> any idea?

We'll all be safely dead, for one thing ;-)

At one update per nanosecond, it'd take approximately 300 years to wrap
a 64-bit counter.  Somehow I don't have a problem with the idea that
Postgres would need to be rebooted that often.  We'd want to fix the
32-bit nature of XIDs long before 64-bit stats counters get to be a
real-world issue ...

            regards, tom lane

Re: [HACKERS] Autovacuum loose ends

From

"Matthew T. O'Connor"

Date:

25 July 2005, 12:33:55

Tom Lane wrote:

>"Matthew T. O'Connor" <matthew@zeut.net> writes:
>
>
>>I don't know either, but this brings up another question.  Stats
>>wraparound.
>>
>We'll all be safely dead, for one thing ;-)
>
>At one update per nanosecond, it'd take approximately 300 years to wrap
>a 64-bit counter.  Somehow I don't have a problem with the idea that
>Postgres would need to be rebooted that often.  We'd want to fix the
>32-bit nature of XIDs long before 64-bit stats counters get to be a
>real-world issue ...
>

*sigh*  Sorry, I should have done a little math before I asked that
question.....

Re: [HACKERS] Autovacuum loose ends

From

Alvaro Herrera

Date:

27 July 2005, 17:26:08

On Mon, Jul 25, 2005 at 09:31:15AM +0800, Christopher Kings-Lynne wrote:
> >> We have to consider what
> >> happens at stat reset -- AFAICS there's no problem, because as soon as
> >> the table sees some activity, it will be picked up by pgstat.
> >> However, it would be bad if stats are reset right after some heavy
> >> activity on a table.  Maybe the only thing we need is documentation.
> >
> >What's the use-case for having the stat reset feature at all?
>
> I believe I was the root cause of the pg_stat_reset() function.  The
> idea at the time was that if you decide to do a round of index
> optimisation, you want to be able to search for unused indexes and
> heavily seq. scanned tables.
>
> If you reset the stats you have 'clean' data to work with.  For
> instance, you can get 24 hours of clean stats data.

Ok, so there's a reason for having a manual stat-reset.  However what's
the rationale for cleaning stats at postmaster start?  In fact I think
it's actively bad because you lose any data you had before postmaster
stop/crash.

--
Alvaro Herrera (<alvherre[a]alvh.no-ip.org>)
"I personally became interested in Linux while I was dating an English major
who wouldn't know an operating system if it walked up and bit him."
(Val Henson)

Re: [HACKERS] Autovacuum loose ends

From

Tom Lane

Date:

27 July 2005, 19:05:33

Alvaro Herrera <alvherre@alvh.no-ip.org> writes:
> Ok, so there's a reason for having a manual stat-reset.  However what's
> the rationale for cleaning stats at postmaster start?  In fact I think
> it's actively bad because you lose any data you had before postmaster
> stop/crash.

We probably *should* drop the stats file if any WAL replay activity
occurs, because the stats file could be out of sync with reality
--- this is particularly important in a PITR recovery situation,
where the stats file is likely to be WAY out of sync.  (Maybe only
clobber it in PITR mode?)

I agree that the argument for doing it in a normal restart is pretty
weak.

            regards, tom lane

Re: [HACKERS] Autovacuum loose ends

From

Mark Wong

Date:

12 August 2005, 14:49:37

I thought I'd run a couple of tests to see if it would be helpful
against CVS from Aug 3, 2005.

Here's a run with autovacuum turned off:
http://www.testing.osdl.org/projects/dbt2dev/results/dev4-015/42/
5186.55 notpm

Autvacuum on with default settings:
http://www.testing.osdl.org/projects/dbt2dev/results/dev4-015/38/
5462.23 notpm

Would it help more to try a series of parameter changes?

Mark

Re: [HACKERS] Autovacuum loose ends

From

Mark Wong

Date:

12 August 2005, 19:16:33

On Fri, 12 Aug 2005 17:49:41 -0400
Alvaro Herrera <alvherre@alvh.no-ip.org> wrote:

> On Fri, Aug 12, 2005 at 10:49:43AM -0700, Mark Wong wrote:
> > I thought I'd run a couple of tests to see if it would be helpful
> > against CVS from Aug 3, 2005.
> >
> > Here's a run with autovacuum turned off:
> > http://www.testing.osdl.org/projects/dbt2dev/results/dev4-015/42/
> > 5186.55 notpm
> >
> > Autvacuum on with default settings:
> > http://www.testing.osdl.org/projects/dbt2dev/results/dev4-015/38/
> > 5462.23 notpm
>
> Just noticed what seems to be a bug: in
>
> http://www.testing.osdl.org/projects/dbt2dev/results/dev4-015/42/db/index_info.input
>
> plot "index_info.data" using 1:2 title "i_customer" with lines, \
> "index_info.data" using 1:2 title "i_orders" with lines, \
> "index_info.data" using 1:3 title "pk_customer" with lines, \
> "index_info.data" using 1:4 title "pk_district" with lines, \
> "index_info.data" using 1:5 title "pk_item" with lines, \
> "index_info.data" using 1:6 title "pk_new_order" with lines, \
> "index_info.data" using 1:7 title "pk_order_line" with lines, \
> "index_info.data" using 1:8 title "pk_orders" with lines, \
> "index_info.data" using 1:9 title "pk_stock" with lines, \
> "index_info.data" using 1:11 title "pk_warehouse" with lines
>
> Notice how the subindexes are wrong ... I think it should be 1:3 for
> i_orders, no?  Apparently indexes_scan.data has the same problem.

Whoops!  I think I fixed it for real now and the charts should be
updated now.  It was broken slightly more previously.

> It called my attention that the pk_warehouse index seems to have a very
> different usage in both runs in index_info, but in indexes_scan they
> seem similar.

Thanks,
Mark

Re: [HACKERS] Autovacuum loose ends

From

Alvaro Herrera

Date:

12 August 2005, 19:17:38

On Fri, Aug 12, 2005 at 10:49:43AM -0700, Mark Wong wrote:
> I thought I'd run a couple of tests to see if it would be helpful
> against CVS from Aug 3, 2005.
>
> Here's a run with autovacuum turned off:
> http://www.testing.osdl.org/projects/dbt2dev/results/dev4-015/42/
> 5186.55 notpm
>
> Autvacuum on with default settings:
> http://www.testing.osdl.org/projects/dbt2dev/results/dev4-015/38/
> 5462.23 notpm

Just noticed what seems to be a bug: in

http://www.testing.osdl.org/projects/dbt2dev/results/dev4-015/42/db/index_info.input

plot "index_info.data" using 1:2 title "i_customer" with lines, \
"index_info.data" using 1:2 title "i_orders" with lines, \
"index_info.data" using 1:3 title "pk_customer" with lines, \
"index_info.data" using 1:4 title "pk_district" with lines, \
"index_info.data" using 1:5 title "pk_item" with lines, \
"index_info.data" using 1:6 title "pk_new_order" with lines, \
"index_info.data" using 1:7 title "pk_order_line" with lines, \
"index_info.data" using 1:8 title "pk_orders" with lines, \
"index_info.data" using 1:9 title "pk_stock" with lines, \
"index_info.data" using 1:11 title "pk_warehouse" with lines

Notice how the subindexes are wrong ... I think it should be 1:3 for
i_orders, no?  Apparently indexes_scan.data has the same problem.


It called my attention that the pk_warehouse index seems to have a very
different usage in both runs in index_info, but in indexes_scan they
seem similar.

--
Alvaro Herrera (<alvherre[a]alvh.no-ip.org>)
"Las mujeres son como hondas:  mientras más resistencia tienen,
 más lejos puedes llegar con ellas"  (Jonas Nightingale, Leap of Faith)

Re: [HACKERS] Autovacuum loose ends

From

Alvaro Herrera

Date:

12 August 2005, 19:46:53

On Fri, Aug 12, 2005 at 03:16:04PM -0700, Mark Wong wrote:
> On Fri, 12 Aug 2005 17:49:41 -0400
> Alvaro Herrera <alvherre@alvh.no-ip.org> wrote:
>
> > Notice how the subindexes are wrong ... I think it should be 1:3 for
> > i_orders, no?  Apparently indexes_scan.data has the same problem.
>
> Whoops!  I think I fixed it for real now and the charts should be
> updated now.  It was broken slightly more previously.

Hmm, did you fix the 42 case only?  The other one is broken too ...

Also, it seems the "tran_lock.out" file captured wrong input -- I think
you mean "WHERE transactionid IS NULL" in the query instead of "WHERE
transaction IS NULL".

I wonder what the big down-spikes (?) at minutes ~45 and ~85 correspond
to.  Are those checkpoints?  The IO vmstat chart would indicate that, I
think.

Anyway, it's interesting to see the performance go up with autovacuum
on.  I certainly didn't expect that in this kind of test.

--
Alvaro Herrera (<alvherre[a]alvh.no-ip.org>)
"No necesitamos banderas
 No reconocemos fronteras"                  (Jorge González)

Re: [HACKERS] Autovacuum loose ends

From

Mark Wong

Date:

12 August 2005, 19:53:00

On Fri, 12 Aug 2005 18:42:09 -0400
Alvaro Herrera <alvherre@alvh.no-ip.org> wrote:

> On Fri, Aug 12, 2005 at 03:16:04PM -0700, Mark Wong wrote:
> > On Fri, 12 Aug 2005 17:49:41 -0400
> > Alvaro Herrera <alvherre@alvh.no-ip.org> wrote:
> >
> > > Notice how the subindexes are wrong ... I think it should be 1:3 for
> > > i_orders, no?  Apparently indexes_scan.data has the same problem.
> >
> > Whoops!  I think I fixed it for real now and the charts should be
> > updated now.  It was broken slightly more previously.
>
> Hmm, did you fix the 42 case only?  The other one is broken too ...

The other dev4-015 cases should be fixed too.

> Also, it seems the "tran_lock.out" file captured wrong input -- I think
> you mean "WHERE transactionid IS NULL" in the query instead of "WHERE
> transaction IS NULL".

Hmm, ok I can try that in a future test run.  I'm not very familiar with
this table, what's the difference between transaction and transactionid?

> I wonder what the big down-spikes (?) at minutes ~45 and ~85 correspond
> to.  Are those checkpoints?  The IO vmstat chart would indicate that, I
> think.

That's correct, those should be checkpoints.

> Anyway, it's interesting to see the performance go up with autovacuum
> on.  I certainly didn't expect that in this kind of test.

I think in Mary's case it was hurting, but she's running the workload
dramatically different.  I think she was planning to revisit that after
we sort out what's going on with the grouped WAL writes.

Mark

Re: [HACKERS] Autovacuum loose ends

From

Alvaro Herrera

Date:

12 August 2005, 20:01:15

On Fri, Aug 12, 2005 at 03:49:57PM -0700, Mark Wong wrote:
> On Fri, 12 Aug 2005 18:42:09 -0400
> Alvaro Herrera <alvherre@alvh.no-ip.org> wrote:

> > Also, it seems the "tran_lock.out" file captured wrong input -- I think
> > you mean "WHERE transactionid IS NULL" in the query instead of "WHERE
> > transaction IS NULL".
>
> Hmm, ok I can try that in a future test run.  I'm not very familiar with
> this table, what's the difference between transaction and transactionid?

transaction is the Xid of the transaction holding or waiting for the
lock.  transactionid is not null in the case where the lock is for a
TransactionId.  I guess it depends on what do you want though -- now
that I think about it, capturing only transaction locks is very likely
not what you want.

http://developer.postgresql.org/docs/postgres/view-pg-locks.html

I wonder why do you have that condition though.  I don't think
"transaction" can ever be NULL in that view.

--
Alvaro Herrera (<alvherre[a]alvh.no-ip.org>)
"Hay dos momentos en la vida de un hombre en los que no debería
especular: cuando puede permitírselo y cuando no puede" (Mark Twain)