Thread: Autovacuum in the backend

Autovacuum in the backend

From

Bruce Momjian

Date:

14 June 2005, 22:23:51

One goal for 8.1 is to move /contrib/pg_autovacuum in to the backend.  I
think it has to be done in four stages:
o  move it into the backend and have it start/stop automaticallyo  move the autovacuum configuration parameters into
postgresql.confo modify the code to use the backend API for error recoveryo  modify the code to use the backend API
utilities,like hashes
 

Who would like to get started on this?  It seems pretty straight-forward.

---------------------------------------------------------------------------

Tom Lane wrote:
> "Thomas F. O'Connell" <tfo@sitening.com> writes:
> > Honestly, I'd prefer to see pg_autovacuum improved to do O(n) rather  
> > than O(n^2) table activity. At this point, though, I'm probably not  
> > too likely to have much time to hack pg_autovacuum before 8.1 is  
> > released, although if it doesn't become integrated by beta feature  
> > freeze, I might give it a shot.
> 
> This would be vastly easier to fix if the code were integrated into the
> backend first.  In the backend environment you could just keep the info
> in a dynahash.c hashtable instead of in a linear list.  On the client
> side, you have to roll your own hashing (or adapt dynahash to life
> outside the backend environment).
> 
>             regards, tom lane
> 
> ---------------------------(end of broadcast)---------------------------
> TIP 8: explain analyze is your friend
> 

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073

Re: Autovacuum in the backend

From

Tom Lane

Date:

15 June 2005, 00:02:32

Bruce Momjian <pgman@candle.pha.pa.us> writes:
> One goal for 8.1 is to move /contrib/pg_autovacuum in to the backend.  I
> think it has to be done in four stages:

>     o  move it into the backend and have it start/stop automatically
>     o  move the autovacuum configuration parameters into postgresql.conf
>     o  modify the code to use the backend API for error recovery
>     o  modify the code to use the backend API utilities, like hashes

> Who would like to get started on this?  It seems pretty straight-forward.

A small problem here is that until you get at least to step 3
(backend-standard error handling), none of it is going to be acceptable
to commit.  So I don't entirely buy Bruce's notion of bite-size pieces
of work.  You can certainly work on it in that fashion, but it's not
going into 8.1 unless most of the above is done by the end of the month.
        regards, tom lane

Re: Autovacuum in the backend

From

"Qingqing Zhou"

Date:

15 June 2005, 00:25:41

"Bruce Momjian" <pgman@candle.pha.pa.us> writes
>
> One goal for 8.1 is to move /contrib/pg_autovacuum in to the backend.  I
> think it has to be done in four stages:
>
> o  move it into the backend and have it start/stop automatically

The start/stop routine is quite like Bgwriter. It requires PgStats to be
turned on. If it aborts unexpectedly, hopefully we could restart it. Shall
we have a RequestVacuum() to pass the control to this process so to avoid
possible redundant vacuums from user side?

> o  move the autovacuum configuration parameters into postgresql.conf

There are some correlations of GUC parameters in order to incorporate it:
* stats_start_collector = true
* stats_row_level = true

add a parameter to let user pass in the configuration parameters:
* autovacuum_command = "-s 100 -S 1 ..."

So if autovacuum_command is given, we will automatically set the upper two
parameters true?

> o  modify the code to use the backend API for error recovery
> o  modify the code to use the backend API utilities, like hashes

Change "connect/disconnect server" to "start/stop autovacuum process";
Change "execute query" to "backend APIs";
Change "list" to "hash";
Need think more to handle various error conditions ...

>
> Who would like to get started on this?  It seems pretty straight-forward.
>

I'd like to give it a try.

Regards,
Qingqing

Re: Autovacuum in the backend

From

"Qingqing Zhou"

Date:

15 June 2005, 00:29:14

"Tom Lane" <tgl@sss.pgh.pa.us> writes
>
> A small problem here is that until you get at least to step 3
> (backend-standard error handling), none of it is going to be acceptable
> to commit.  So I don't entirely buy Bruce's notion of bite-size pieces
> of work.  You can certainly work on it in that fashion, but it's not
> going into 8.1 unless most of the above is done by the end of the month.
>

Scared ...

Regards,
Qingqing

Re: Autovacuum in the backend

From

Neil Conway

Date:

15 June 2005, 02:26:51

Qingqing Zhou wrote:
> The start/stop routine is quite like Bgwriter. It requires PgStats to be
> turned on.

Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the 
stats collector?

-Neil

Re: Autovacuum in the backend

From

"Qingqing Zhou"

Date:

15 June 2005, 02:44:12

"Neil Conway" <neilc@samurai.com> writes
>
> Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
> stats collector?
>

I don't understand. Currently the basic logic of pg_autovacuum is to use the
pg_stat_all_tables numbers like n_tup_upd, n_tup_del to determine if a
relation need to be vacuumed. How to use FSM to get these information?

Regards,
Qingqing

Re: Autovacuum in the backend

From

Hannu Krosing

Date:

15 June 2005, 05:50:57

On T, 2005-06-14 at 21:23 -0400, Bruce Momjian wrote:
> One goal for 8.1 is to move /contrib/pg_autovacuum in to the backend.  I
> think it has to be done in four stages:
> 
>     o  move it into the backend and have it start/stop automatically
>     o  move the autovacuum configuration parameters into postgresql.conf
>     o  modify the code to use the backend API for error recovery
>     o  modify the code to use the backend API utilities, like hashes
> 
> Who would like to get started on this?  It seems pretty straight-forward.

Can autovacuum yet be configured _not_ to run vacuum during some hours
or above some load ?

Even better - to stop or pause a long-running vacuum if load goes above
some limit.

If it goes into backend before the above is done, it should at least be
possible to switch it off completely.

-- 
Hannu Krosing <hannu@skype.net>

Re: Autovacuum in the backend

From

Hannu Krosing

Date:

15 June 2005, 05:56:24

On K, 2005-06-15 at 13:41 +0800, Qingqing Zhou wrote:
> "Neil Conway" <neilc@samurai.com> writes
> >
> > Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
> > stats collector?
> >
> 
> I don't understand. Currently the basic logic of pg_autovacuum is to use the
> pg_stat_all_tables numbers like n_tup_upd, n_tup_del to determine if a
> relation need to be vacuumed. How to use FSM to get these information?

One can't probably use FSM as it is, as FSM is filled in by vacuum and
this creates a circular dependency.

But it would be very nice to have something _similar_ to FSM, say DSM
(dead space map), which is filled in when a tuple is marked as "dead for
all running backends", which could be used to implement a vacuum which
vacuums only those pages, which do actually contain removable tuples.

-- 
Hannu Krosing <hannu@skype.net>

Re: Autovacuum in the backend

From

"Qingqing Zhou"

Date:

15 June 2005, 06:07:05

"Qingqing Zhou" <zhouqq@cs.toronto.edu> writes
>
> The start/stop routine is quite like Bgwriter.
>

I just realized that the non-standard backend can't execute any SQL command.
If so, how would the background pg_autovacuum do "vacuum"? It could be
forked more like a standard backend, but this is obviously not a good idea,
since we don't know which database this process will sit on.

A possible solution is that backgroud pg_autovacuum could fork another
process to connect to postmaster as an ordinary backend each time it feels
that a "vacuum" is needed.

Any ideas?

Regards,
Qingqing

Re: Autovacuum in the backend

From

Bruce Momjian

Date:

15 June 2005, 09:49:32

Hannu Krosing wrote:
> On K, 2005-06-15 at 13:41 +0800, Qingqing Zhou wrote:
> > "Neil Conway" <neilc@samurai.com> writes
> > >
> > > Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
> > > stats collector?
> > >
> > 
> > I don't understand. Currently the basic logic of pg_autovacuum is to use the
> > pg_stat_all_tables numbers like n_tup_upd, n_tup_del to determine if a
> > relation need to be vacuumed. How to use FSM to get these information?
> 
> One can't probably use FSM as it is, as FSM is filled in by vacuum and
> this creates a circular dependency.
> 
> But it would be very nice to have something _similar_ to FSM, say DSM
> (dead space map), which is filled in when a tuple is marked as "dead for
> all running backends", which could be used to implement a vacuum which
> vacuums only those pages, which do actually contain removable tuples.

Yes, those are step five.  The TODO list has:* Auto-vacuum        o Move into the backend code        o Scan the buffer
cacheto find free space or use background writer        o Use free-space map information to guide refilling        o Do
VACUUMFULL if table is nearly empty?
 

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073

Re: Autovacuum in the backend

From

Bruce Momjian

Date:

15 June 2005, 11:04:30

I am going to start working on it.  I am concerned it is a big job.

I will post questions as I find them, and the one below is a good one.

---------------------------------------------------------------------------

Qingqing Zhou wrote:
> 
> "Qingqing Zhou" <zhouqq@cs.toronto.edu> writes
> >
> > The start/stop routine is quite like Bgwriter.
> >
> 
> I just realized that the non-standard backend can't execute any SQL command.
> If so, how would the background pg_autovacuum do "vacuum"? It could be
> forked more like a standard backend, but this is obviously not a good idea,
> since we don't know which database this process will sit on.
> 
> A possible solution is that backgroud pg_autovacuum could fork another
> process to connect to postmaster as an ordinary backend each time it feels
> that a "vacuum" is needed.
> 
> Any ideas?
> 
> Regards,
> Qingqing
> 
> 
> 
> ---------------------------(end of broadcast)---------------------------
> TIP 5: Have you checked our extensive FAQ?
> 
>                http://www.postgresql.org/docs/faq
> 

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073

Re: Autovacuum in the backend

From

Alvaro Herrera

Date:

15 June 2005, 11:06:01

On Wed, Jun 15, 2005 at 11:23:20AM +0800, Qingqing Zhou wrote:

> > Who would like to get started on this?  It seems pretty straight-forward.
> 
> I'd like to give it a try.

I'm on it.  I have Matthew's patch, updated to current sources, and I'm
working on cleaning it up to address all known concerns.  I expect to be
able to have something for patches early next week, which can be
discussed.

-- 
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Postgres is bloatware by design: it was built to housePhD theses." (Joey Hellerstein, SIGMOD annual conference 2002)

Re: Autovacuum in the backend

From

Bruce Momjian

Date:

15 June 2005, 11:10:49

Alvaro Herrera wrote:
> On Wed, Jun 15, 2005 at 11:23:20AM +0800, Qingqing Zhou wrote:
> 
> > > Who would like to get started on this?  It seems pretty straight-forward.
> > 
> > I'd like to give it a try.
> 
> I'm on it.  I have Matthew's patch, updated to current sources, and I'm
> working on cleaning it up to address all known concerns.  I expect to be
> able to have something for patches early next week, which can be
> discussed.

Oh, excellent. Thanks.  Please look at the patch I just applied to
pg_autovacuum today and merge that into what you have.  Great!

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073

Re: Autovacuum in the backend

From

Josh Berkus

Date:

15 June 2005, 14:01:23

Qingqing,

> add a parameter to let user pass in the configuration parameters:
> * autovacuum_command = "-s 100 -S 1 ..."

um, can we have these as separate GUCs and not lumped together as a string?  
i.e.:
autovacuum_frequency = 60  #seconds, 0 = disable
autovacuum_vacuum_threshold = 200
autovacuum_vacuum_multiple = 0.5
autovacuum_analyze_threshold = 100
autovacuum_analyze_multiple = 0.4

AV should be disabled by default.  It should also automatically use the global 
vacuum_delay settings.

> But it would be very nice to have something _similar_ to FSM, say DSM
> (dead space map), which is filled in when a tuple is marked as "dead for
> all running backends", which could be used to implement a vacuum which
> vacuums only those pages, which do actually contain removable tuples.

Speaking of FSM improvements, it would be **really** useful to have a pg_stats 
view that let you know how full the FSM was, overall.  something like:
pg_stats_fsm_usage
fsm_relations    fsm_relations_used    fsm_pages        fsm_pages_used
1000            312                200000        11579

This would allow for other schemes of vacuum automation.

-- 
Josh Berkus
Aglio Database Solutions
San Francisco

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

15 June 2005, 16:45:15

Bruce Momjian wrote:

>Hannu Krosing wrote:
>  
>
>>On K, 2005-06-15 at 13:41 +0800, Qingqing Zhou wrote:
>>    
>>
>>>"Neil Conway" <neilc@samurai.com> writes
>>>      
>>>
>>>>Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
>>>>stats collector?
>>>>        
>>>>
>Yes, those are step five.  The TODO list has:
>    
>    * Auto-vacuum
>            o Move into the backend code
>            o Scan the buffer cache to find free space or use background writer
>            o Use free-space map information to guide refilling
>            o Do VACUUM FULL if table is nearly empty?
>

I think the two can be used in conjunction with one another, and perhaps 
one day it could be used with or without the stats system.  Integrating 
FSM information has to be helpful, but it also isn't going to tell us 
when to do an analyze due to lots of inserts, also the FSM (if not big 
enough) is lossy and might not be tracking all the tables.  So I think 
for 8.1 if it only used stats that would be OK.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

15 June 2005, 16:48:31

Bruce Momjian wrote:

>I am going to start working on it.  I am concerned it is a big job.
>
>I will post questions as I find them, and the one below is a good one.
>  
>

It is a big question, at least it's the main stumbling block I had, and 
it's the one that kept my work from being integrated into 8.0 (which 
side stepped the issue by using libpq to connect to the server to fire 
off commands).


>---------------------------------------------------------------------------
>
>Qingqing Zhou wrote:
>  
>
>>I just realized that the non-standard backend can't execute any SQL command.
>>If so, how would the background pg_autovacuum do "vacuum"? It could be
>>forked more like a standard backend, but this is obviously not a good idea,
>>since we don't know which database this process will sit on.
>>
>>A possible solution is that backgroud pg_autovacuum could fork another
>>process to connect to postmaster as an ordinary backend each time it feels
>>that a "vacuum" is needed.
>>

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

15 June 2005, 16:56:02

Josh Berkus wrote:

>Qingqing,
>  
>
>>add a parameter to let user pass in the configuration parameters:
>>* autovacuum_command = "-s 100 -S 1 ..."
>>    
>>
>
>um, can we have these as separate GUCs and not lumped together as a string?  
>i.e.:
>autovacuum_frequency = 60  #seconds, 0 = disable
>autovacuum_vacuum_threshold = 200
>autovacuum_vacuum_multiple = 0.5
>autovacuum_analyze_threshold = 100
>autovacuum_analyze_multiple = 0.4
>
>AV should be disabled by default.  It should also automatically use the global 
>vacuum_delay settings.
>  
>

Agreed, disabled by default (at least for 8.1, perhaps a topic of 
conversation for 8.2+), yes it should obey the global vacuum_delay 
settings, and yes it should have it's own GUC's as you suggested (all of 
this was the case with the patch that I submitted for 8.0, which Alvarro 
is now working on).

>>But it would be very nice to have something _similar_ to FSM, say DSM
>>(dead space map), which is filled in when a tuple is marked as "dead for
>>all running backends", which could be used to implement a vacuum which
>>vacuums only those pages, which do actually contain removable tuples.
>>    
>>
>
>Speaking of FSM improvements, it would be **really** useful to have a pg_stats 
>view that let you know how full the FSM was, overall.  something like:
>pg_stats_fsm_usage
>fsm_relations    fsm_relations_used    fsm_pages        fsm_pages_used
>1000            312                200000        11579
>
>This would allow for other schemes of vacuum automation.
>  
>

Interesting, perhaps if FSM data is exported to the stats system 
autovacuum could use that.  What might be best is both a view that 
showed overall FSM information, but then also export FSM information on 
a per table basis, perhaps as additional columns added to existing stats 
tables.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

15 June 2005, 17:02:40

Hannu Krosing wrote:

>Can autovacuum yet be configured _not_ to run vacuum during some hours
>or above some load ?
>  
>

That is certainly a goal, hopefully it will get done for 8.1.    The 
actual design I had in mind (based on prior discussion on hackers) is to 
allow a maintenance window that would have lower vacuum thresholds, this 
way only the tables that really need it will get vacuumed during the day.

>Even better - to stop or pause a long-running vacuum if load goes above
>some limit.
>  
>

I don't think the current implementation if VACUUM can support that.  I 
believe that all the work will get rolled back if gets canceled. 

Perhaps a decent solution would be to have autovacuum increase the 
vacuum delay settings dynamically based on system load average.  That 
way if a vacuum starts and the system starts to get busy, the autoavcuum 
daemon can increase the vacuum delay settings and VACUUM would honor 
this while running. 

>If it goes into backend before the above is done, it should at least be
>possible to switch it off completely.
>  
>

Absolutely, in fact it will not only have the option to turn it off, it 
will be off by default.

Re: Autovacuum in the backend

From

Bruce Momjian

Date:

15 June 2005, 17:21:38

Matthew T. O'Connor wrote:
> I don't think the current implementation if VACUUM can support that.  I 
> believe that all the work will get rolled back if gets canceled. 
> 
> Perhaps a decent solution would be to have autovacuum increase the 
> vacuum delay settings dynamically based on system load average.  That 
> way if a vacuum starts and the system starts to get busy, the autoavcuum 
> daemon can increase the vacuum delay settings and VACUUM would honor 
> this while running. 

I would like to have the GUC variables be honored while the system is
running, and that would all administrators to make changes from scripts.

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073

Re: Autovacuum in the backend

From

Gavin Sherry

Date:

15 June 2005, 22:07:56

On Wed, 15 Jun 2005, Bruce Momjian wrote:

>
> I am going to start working on it.  I am concerned it is a big job.
>
> I will post questions as I find them, and the one below is a good one.
>

I'm wondering if effort is being misdirected here. I remember when Mark
Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
significant performance loss -- I think on the order of 30% to 40% (I will
try and dig up a link to the results).

I think these results can be dramatically improved if the focus is on a
more effective vacuum.

In January I was in Toronto with Jan, Tom and others and some ideas about
vacuum were being discussed. The basic idea is that when we dirty pages we
need we set a bit in a bitmap to say that the page has been dirty. A
convenient place to do this is when we are writing dirty buffers out to
disk. In many situations, this can happen inside the bgwriter meaning that
there should be little contention for this bitmap. Of course, individual
backends may be writing pages out and would have to account for the
dirty pages at that point.

Now this bitmap can occur on a per heap segment basis (ie, per 1 GB heap
file). You only need 2 pages for the bitmap to represent all the pages in
the segment, which is fairly nice. When vacuum is run, instead of visiting
every page, it would see which pages have been dirtied in the bitmap and
visit only pages. With large tables and small numbers of modified
tuples/pages, the effect this change would have would be pretty
impressive.

This also means that we could effectively implement some of the ideas
which are being floated around, such as having vacuum run only for a short
time period.

One problem is whether or not we have to guarantee that we account for
every dirtied page. I think that would be difficult in the presence of a
crash. One idea Neil mentioned is that on a crash, we could set all pages
in the bitmap to dirty and the first vacuum would effectively be a vacuum
full. The alternative is to say that we don't guarantee that this type of
vacuum is completely comprehensive and that it isn't a replacement for
vacuum full.

Thoughts? Comments?

Thanks,

Gavin

Re: Autovacuum in the backend

From

Christopher Kings-Lynne

Date:

15 June 2005, 22:27:10

>> um, can we have these as separate GUCs and not lumped together as a 
>> string?  i.e.:
>> autovacuum_frequency = 60  #seconds, 0 = disable
>> autovacuum_vacuum_threshold = 200
>> autovacuum_vacuum_multiple = 0.5
>> autovacuum_analyze_threshold = 100
>> autovacuum_analyze_multiple = 0.4
>>
>> AV should be disabled by default.  It should also automatically use 
>> the global vacuum_delay settings.
> 
> Agreed, disabled by default (at least for 8.1, perhaps a topic of 
> conversation for 8.2+), yes it should obey the global vacuum_delay 
> settings, and yes it should have it's own GUC's as you suggested (all of 
> this was the case with the patch that I submitted for 8.0, which Alvarro 
> is now working on).

I think it should be on by default :)

Let's not ship software in a default configuration that we KNOW will go 
to hell.

How about we aim to make that if someone doesn't modify their 
postgresql.conf, they will have no problems.  Wasn't that the aim of 
defaulting shared_buffers to 1000 if we can?

Chris

Re: Autovacuum in the backend

From

Bruce Momjian

Date:

15 June 2005, 22:45:18

I totally agree.  I think we know pg_autovacuum is just a short-term
solution.  What we need is for someone to take the lead in this.

The TODO list has the ideas documented.  There is no reason the
background writer could not load the FSM directly with free pages.

---------------------------------------------------------------------------

Gavin Sherry wrote:
> On Wed, 15 Jun 2005, Bruce Momjian wrote:
> 
> >
> > I am going to start working on it.  I am concerned it is a big job.
> >
> > I will post questions as I find them, and the one below is a good one.
> >
> 
> I'm wondering if effort is being misdirected here. I remember when Mark
> Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> significant performance loss -- I think on the order of 30% to 40% (I will
> try and dig up a link to the results).
> 
> I think these results can be dramatically improved if the focus is on a
> more effective vacuum.
> 
> In January I was in Toronto with Jan, Tom and others and some ideas about
> vacuum were being discussed. The basic idea is that when we dirty pages we
> need we set a bit in a bitmap to say that the page has been dirty. A
> convenient place to do this is when we are writing dirty buffers out to
> disk. In many situations, this can happen inside the bgwriter meaning that
> there should be little contention for this bitmap. Of course, individual
> backends may be writing pages out and would have to account for the
> dirty pages at that point.
> 
> Now this bitmap can occur on a per heap segment basis (ie, per 1 GB heap
> file). You only need 2 pages for the bitmap to represent all the pages in
> the segment, which is fairly nice. When vacuum is run, instead of visiting
> every page, it would see which pages have been dirtied in the bitmap and
> visit only pages. With large tables and small numbers of modified
> tuples/pages, the effect this change would have would be pretty
> impressive.
> 
> This also means that we could effectively implement some of the ideas
> which are being floated around, such as having vacuum run only for a short
> time period.
> 
> One problem is whether or not we have to guarantee that we account for
> every dirtied page. I think that would be difficult in the presence of a
> crash. One idea Neil mentioned is that on a crash, we could set all pages
> in the bitmap to dirty and the first vacuum would effectively be a vacuum
> full. The alternative is to say that we don't guarantee that this type of
> vacuum is completely comprehensive and that it isn't a replacement for
> vacuum full.
> 
> Thoughts? Comments?
> 
> Thanks,
> 
> Gavin
> 
> ---------------------------(end of broadcast)---------------------------
> TIP 8: explain analyze is your friend
> 

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073

Re: Autovacuum in the backend

From

Alvaro Herrera

Date:

15 June 2005, 22:50:13

A question for interested parties.  I'm thinking in handling the
user/password issue by reading the flat files (the copies of pg_shadow,
pg_database, etc).

The only thing that I'd need to modify is add the datdba field to
pg_database, so we can figure out an appropiate user for vacuuming each
database.

What do people think?

-- 
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Acepta los honores y aplausos y perderás tu libertad"

Re: Autovacuum in the backend

From

Alvaro Herrera

Date:

15 June 2005, 23:56:05

On Thu, Jun 16, 2005 at 11:07:31AM +1000, Gavin Sherry wrote:
> On Wed, 15 Jun 2005, Bruce Momjian wrote:
> 
> > I am going to start working on it.  I am concerned it is a big job.
> >
> > I will post questions as I find them, and the one below is a good one.
> 
> I'm wondering if effort is being misdirected here. I remember when Mark
> Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> significant performance loss -- I think on the order of 30% to 40% (I will
> try and dig up a link to the results).

I think those are orthogonal issues.  One is fixing whatever performance
issues there are because of VACUUM.  Note that the fact that Mark was
having such a drop in performance with autovacuum does only mean that
at the enormous load under which the OSDL tests are run, autovacuum is
not the best solution.  Not everybody runs with that sort of load
anyway.  (In fact lots of people don't.)

So, one issue is that at high loads, there are improvements to be made
to VACUUM.  The other issue is to get VACUUM to run in the first place,
which is what autovacuum addresses.

I can easily predict that we will make adjustments and improvements to
VACUUM in the future, but I'm not so sure if it will happen before 8.1
feature-freezes.  I have more confidence that we can integrate
autovacuum for 8.1, which will be a leap forward.

-- 
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Pido que me den el Nobel por razones humanitarias" (Nicanor Parra)

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 00:42:41

Alvaro Herrera wrote:

>A question for interested parties.  I'm thinking in handling the
>user/password issue by reading the flat files (the copies of pg_shadow,
>pg_database, etc).
>
>The only thing that I'd need to modify is add the datdba field to
>pg_database, so we can figure out an appropiate user for vacuuming each
>database.
>
>What do people think?
>  
>

I probably don't understand all the issue involved here but reading 
pg_shadow by hand seems problematic.  Do you constantly re-read it?  
What happens when a new user is added etc....

Can't autovacuum run as a super-user that can vacuum anything?

Re: Autovacuum in the backend

From

Russell Smith

Date:

16 June 2005, 00:45:23

On Thu, 16 Jun 2005 12:54 pm, Alvaro Herrera wrote:
> On Thu, Jun 16, 2005 at 11:07:31AM +1000, Gavin Sherry wrote:
> > On Wed, 15 Jun 2005, Bruce Momjian wrote:
> > 
> > > I am going to start working on it.  I am concerned it is a big job.
> > >
> > > I will post questions as I find them, and the one below is a good one.
> > 
> > I'm wondering if effort is being misdirected here. I remember when Mark
> > Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> > significant performance loss -- I think on the order of 30% to 40% (I will
> > try and dig up a link to the results).
> 
> I think those are orthogonal issues.  One is fixing whatever performance
> issues there are because of VACUUM.  Note that the fact that Mark was
> having such a drop in performance with autovacuum does only mean that
> at the enormous load under which the OSDL tests are run, autovacuum is
> not the best solution.  Not everybody runs with that sort of load
> anyway.  (In fact lots of people don't.)

I agree here.  There have been a couple of patches for improvements to VACUUM rejected in the past.
EG, partial vacuum.  If we have autovacuum in the backend, it doesn't matter about people 
vacuuming the wrong part of a file.  The system will manage it.

I'd also say there are a much greater number of people who will be able to fiddle with
an implemented autovac to improve its performance and load.  However I think there
are less people who can complete what Alvaro is doing.

> So, one issue is that at high loads, there are improvements to be made
> to VACUUM.  The other issue is to get VACUUM to run in the first place,
> which is what autovacuum addresses.

There are plenty of ideas to shoot around here.  Like 
- only run one iteration of a vacuum so you only clean indexes once, then stop the vacuum till the next cycle.
- Create the dead space man stuff with the bgwriter
- Make sure you have individual table analyze and vacuum stats so vacuum can be flexible to different tables.

Some of the autovac issues we have seen recently like O(n^2) with tables will go away by being in the backend.
So not everything will perform the same after the integration.

> 
> I can easily predict that we will make adjustments and improvements to
> VACUUM in the future, but I'm not so sure if it will happen before 8.1
> feature-freezes.  I have more confidence that we can integrate
> autovacuum for 8.1, which will be a leap forward.
> 
The big leap will be to get it in the backend, which will as Chris KL suggested stop people shooting themselves in the
foot.
I'm not sure what strict rules are imposed by Feature freeze, but there may be time for others to make some
improvementsbefore 8.1.
 
We have also looked at this for at least 2 releases now.  If it doesn't get in now, it will just get in for 8.2 and no
improvementstill 8.2.
 

Regards

Russell Smith

Re: Autovacuum in the backend

From

Josh Berkus

Date:

16 June 2005, 00:45:39

Gavin, People,

> I'm wondering if effort is being misdirected here. I remember when Mark
> Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> significant performance loss -- I think on the order of 30% to 40% (I will
> try and dig up a link to the results).

It wasn't quite that bad, and the automated DBT2 is deceptive; the test 
doesn't run for long enough for *not* vacuuming to be a problem.  For a real 
test, you'd need to do a 24-hour, or 48-hour DBT2 run.

Not that I don't agree that we need a less I/O intense alternative to VACUUM, 
but it seems unlikely that we could actually do this, or even agree on a 
spec, before feature freeze.  Wheras integrated AV is something we *could* 
do, and is widely desired.

If we do integrated AV, it should only be turned on by default at a relatively 
low level.  And wasn't there an issue on Windows with AV not working?

-- 
Josh Berkus
Aglio Database Solutions
San Francisco

Re: Autovacuum in the backend

From

Alvaro Herrera

Date:

16 June 2005, 00:56:05

On Wed, Jun 15, 2005 at 11:42:17PM -0400, Matthew T. O'Connor wrote:
> Alvaro Herrera wrote:
> 
> >A question for interested parties.  I'm thinking in handling the
> >user/password issue by reading the flat files (the copies of pg_shadow,
> >pg_database, etc).
> >
> >The only thing that I'd need to modify is add the datdba field to
> >pg_database, so we can figure out an appropiate user for vacuuming each
> >database.
> 
> I probably don't understand all the issue involved here but reading 
> pg_shadow by hand seems problematic.  Do you constantly re-read it?  
> What happens when a new user is added etc....

You don't read the pg_shadow table.  Rather, you read the pg_user file,
which is a plain-text file representing the information in pg_shadow.
It's kept up to date by backends that modify user information.  Likewise
for pg_database and pg_group.

> Can't autovacuum run as a super-user that can vacuum anything?

That'd be another way to do it, maybe simpler.

Currently I'm working on separating this in two parts though, one being
a shlib and other the standard postmaster-launched backend process.  So
I don't have to address this issue right now.  It just bothered me to
need a separate file with username and password, and the corresponding
code to read it.

One issue I do have to deal with right now is how many autovacuum
processes do we want to be running.  The current approach is to have one
autovacuum process.  Two possible options would be to have one per
database, and one per tablespace.  What do people think?

I'm leaning for the simpler option myself but I'd like to hear more
opinions.  Particularly since one-per-database makes the code a lot
simpler as far as I can see, because the shlib only needs to worry about
issuing VACUUM commands; with the other approaches, the shlib has to
manage everything (keep the pg_autovacuum table up to date, figuring out
when vacuums are needed, etc.)

The main problem with the one-per-database is that we wouldn't have a
(simple) way of coordinating vacuums so that they don't compete for I/O.
That's why I thought of the one-per-tablespace approach, though that one
is the most complex of all.

-- 
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Un poeta es un mundo encerrado en un hombre" (Victor Hugo)

Re: Autovacuum in the backend

From

Gavin Sherry

Date:

16 June 2005, 00:57:24

On Wed, 15 Jun 2005, Alvaro Herrera wrote:

> On Thu, Jun 16, 2005 at 11:07:31AM +1000, Gavin Sherry wrote:
> > On Wed, 15 Jun 2005, Bruce Momjian wrote:
> >
> > > I am going to start working on it.  I am concerned it is a big job.
> > >
> > > I will post questions as I find them, and the one below is a good one.
> >
> > I'm wondering if effort is being misdirected here. I remember when Mark
> > Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> > significant performance loss -- I think on the order of 30% to 40% (I will
> > try and dig up a link to the results).
>
> I think those are orthogonal issues.  One is fixing whatever performance
> issues there are because of VACUUM.  Note that the fact that Mark was
> having such a drop in performance with autovacuum does only mean that
> at the enormous load under which the OSDL tests are run, autovacuum is
> not the best solution.  Not everybody runs with that sort of load
> anyway.  (In fact lots of people don't.)

I agree.

> So, one issue is that at high loads, there are improvements to be made
> to VACUUM.  The other issue is to get VACUUM to run in the first place,
> which is what autovacuum addresses.
>
> I can easily predict that we will make adjustments and improvements to
> VACUUM in the future, but I'm not so sure if it will happen before 8.1
> feature-freezes.  I have more confidence that we can integrate
> autovacuum for 8.1, which will be a leap forward.

I guess my main concern is that we'll have a solution to the problem of
dead tuples which is only half way there. It is only an incremental
improvement upon the contrib module and solves only one real problem:
users do not read up on VACUUM or autovacuum. This is at the expense of
making it appear to be suitable for the general user base when it isn't,
in my opinion. That isn't the fault of autovacuum but is a function of the
cost of ordinary vacuum.

Thanks,

Gavin

Re: Autovacuum in the backend

From

"Joshua D. Drake"

Date:

16 June 2005, 00:58:18

> The big leap will be to get it in the backend, which will as Chris KL suggested stop people shooting themselves in
thefoot.
 
> I'm not sure what strict rules are imposed by Feature freeze, but there may be time for others to make some
improvementsbefore 8.1.
 
> We have also looked at this for at least 2 releases now.  If it doesn't get in now, it will just get in for 8.2 and
noimprovements till 8.2.
 

Just my own two cents. First I am not knocking the work that has been on 
autovacuum. I am sure that it was a leap on its own to get it to work. 
However I will say that I just don't see the reason for it.

Vacuum especially in the 8.x series isn't that bad. Heck if you actually
manage your catalog even on large databases it can be reasonable. Yes
it takes a little **gasp** administrative maintenance to run vacuum at
just that right time, on just those right tables but...

Anyway -- it seems it may be beneficial to focus the efforts somewhere 
else. The only reason I wanted to know if it was going to be in the 
backend last week was because I needed to know if I was going to have
to document in the new book.

Sincerely,

Joshua D. Drake



> 
> Regards
> 
> Russell Smith
> 
> ---------------------------(end of broadcast)---------------------------
> TIP 8: explain analyze is your friend

Re: Autovacuum in the backend

From

Josh Berkus

Date:

16 June 2005, 01:04:30

Alvaro,

> One issue I do have to deal with right now is how many autovacuum
> processes do we want to be running.  The current approach is to have one
> autovacuum process.  Two possible options would be to have one per
> database, and one per tablespace.  What do people think?

I'd vote for one, period, for the cluster, if you can manage that.   Let's
stick to simple for now.   Most users have their database on a single disk or
array, so multiple concurrent vacuums will compete for I/O regardless of
different databases.

--
Josh Berkus
Aglio Database Solutions
San Francisco

Re: Autovacuum in the backend

From

Josh Berkus

Date:

16 June 2005, 01:07:16

Josh,

> Just my own two cents. First I am not knocking the work that has been on
> autovacuum. I am sure that it was a leap on its own to get it to work.
> However I will say that I just don't see the reason for it.

I've personally seen at least a dozen user requests for "autovacuum in the 
backend", and had this conversation about 1,100 times:

NB: "After a week, my database got really slow."
Me: "How often are you running VACUUM ANALYZE?"
NB: "Running what?"

-- 
Josh Berkus
Aglio Database Solutions
San Francisco

Re: Autovacuum in the backend

From

Neil Conway

Date:

16 June 2005, 01:08:33

Josh Berkus wrote:
> Not that I don't agree that we need a less I/O intense alternative to VACUUM, 
> but it seems unlikely that we could actually do this, or even agree on a 
> spec, before feature freeze.

I don't see the need to rush anything in before the feature freeze.
> Wheras integrated AV is something we *could*  do, and is widely desired.

I don't see why. IMHO the current autovacuum approach is far from 
optimal. If "integrated autovacuum" just means taking the same approach 
and building it into the backend, how does that significantly improve 
matters? (I find it difficult to take seriously answers like "it lets us 
use the backend's hash table implementation"). It _does_ mean there is 
more of an implicit stamp of PGDG approval for pg_autovacuum, which is 
something I personally wouldn't want to give to the current design.

-Neil

Re: Autovacuum in the backend

From

Neil Conway

Date:

16 June 2005, 01:10:05

Alvaro Herrera wrote:
> One issue I do have to deal with right now is how many autovacuum
> processes do we want to be running.  The current approach is to have one
> autovacuum process.  Two possible options would be to have one per
> database, and one per tablespace.  What do people think?

Why do we need more than one pg_autovacuum process? (Note that this need 
not necessarily imply only one concurrent VACUUM, as you can use 
non-blocking connections in libpq.)

-Neil

Re: Autovacuum in the backend

From

Christopher Kings-Lynne

Date:

16 June 2005, 01:14:17

> I've personally seen at least a dozen user requests for "autovacuum in the 
> backend", and had this conversation about 1,100 times:
> 
> NB: "After a week, my database got really slow."
> Me: "How often are you running VACUUM ANALYZE?"
> NB: "Running what?"

Me too.  Just hang out in #postgresql for a while :)

Re: Autovacuum in the backend

From

Alvaro Herrera

Date:

16 June 2005, 01:27:49

On Thu, Jun 16, 2005 at 02:09:47PM +1000, Neil Conway wrote:
> Alvaro Herrera wrote:
> >One issue I do have to deal with right now is how many autovacuum
> >processes do we want to be running.  The current approach is to have one
> >autovacuum process.  Two possible options would be to have one per
> >database, and one per tablespace.  What do people think?
> 
> Why do we need more than one pg_autovacuum process?

The only reason I considered it is because you can use the regular
catalog-management routines to handle the new pg_autovacuum system
catalog.  With a single process, we need to issue SQL queries.  This is
very ugly IMHO.

-- 
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Tiene valor aquel que admite que es un cobarde" (Fernandel)

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 01:44:46

Joshua D. Drake wrote:

> Just my own two cents. First I am not knocking the work that has been 
> on autovacuum. I am sure that it was a leap on its own to get it to 
> work. However I will say that I just don't see the reason for it.

The major reasons for autovacuum as I see it are as follows:

* Reduces administrative overhead having to keep track of what tables 
need to be vacuumed how often.
* Reduces the total amount of time the system spends vacuuming since it 
only vacuums when needed.
* Keeps stats up-to-date automatically
* Eliminates newbie confusion
* Eliminates one of the criticisms that the public has against 
PostgreSQL (justifed or not)

Also, as VACUUM improves, autovacuum will improve with it.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 01:46:35

Gavin Sherry wrote:

>I guess my main concern is that we'll have a solution to the problem of
>dead tuples which is only half way there. It is only an incremental
>improvement upon the contrib module and solves only one real problem:
>users do not read up on VACUUM or autovacuum. This is at the expense of
>making it appear to be suitable for the general user base when it isn't,
>in my opinion. That isn't the fault of autovacuum but is a function of the
>cost of ordinary vacuum.
>  
>

Would you mind expounding on why you think autovacuum isn't suitable for 
the general public?  I know it's not a silver bullet, but I think in 
general, it will be helpful for most people.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 01:52:48

Neil Conway wrote:

> Josh Berkus wrote:
> > Wheras integrated AV is something we *could*  do, and is widely 
> desired.
>
> I don't see why. IMHO the current autovacuum approach is far from 
> optimal. If "integrated autovacuum" just means taking the same 
> approach and building it into the backend, how does that significantly 
> improve matters? (I find it difficult to take seriously answers like 
> "it lets us use the backend's hash table implementation"). It _does_ 
> mean there is more of an implicit stamp of PGDG approval for 
> pg_autovacuum, which is something I personally wouldn't want to give 
> to the current design. 

The reason to integrate it has nothing to do with the hash 
implementation, it has to do making autovacuum more accecable to the 
masses, and more importantly, it proves a solution (not necerraily the 
best solution) to the vacuum problem, which I belive is a problem for 
PostgreSQL.  Integrating it into the backen also allows autovacuum to be 
better than it is now, using the backend logging functions, storing per 
table thresholds, solving the O(n2) problem, start up and shutdown 
issues and more.  I agree that if autovacuum becomes a long term 
solution then we should also integrate FSM information etc...

What else is lacking in the current design?  Or more specifically what 
else would have to be done before you would consider giving it the PGDG 
stamp of approval?

Matthew

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 01:57:00

Alvaro Herrera wrote:

>On Thu, Jun 16, 2005 at 02:09:47PM +1000, Neil Conway wrote:
>  
>
>>Alvaro Herrera wrote:
>>    
>>
>>>One issue I do have to deal with right now is how many autovacuum
>>>processes do we want to be running.  The current approach is to have one
>>>autovacuum process.  Two possible options would be to have one per
>>>database, and one per tablespace.  What do people think?
>>>      
>>>
>>Why do we need more than one pg_autovacuum process?
>>    
>>
>
>The only reason I considered it is because you can use the regular
>catalog-management routines to handle the new pg_autovacuum system
>catalog.  With a single process, we need to issue SQL queries.  This is
>very ugly IMHO.
>  
>

It was always my intention to have VACUUM and ANALYZE update the new 
autovacuum system table, I just never got around to making that happen.

Personally I would vote for simplicty for now, that is only one 
autovacuum process and allow it to only issue one VACUUM command at any 
given time.  Something more complicated sounds to me like a 2nd 
generation optimisation.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 01:58:18

Neil Conway wrote:

> Alvaro Herrera wrote:
>
>> One issue I do have to deal with right now is how many autovacuum
>> processes do we want to be running.  The current approach is to have one
>> autovacuum process.  Two possible options would be to have one per
>> database, and one per tablespace.  What do people think?
>
>
> Why do we need more than one pg_autovacuum process? (Note that this 
> need not necessarily imply only one concurrent VACUUM, as you can use 
> non-blocking connections in libpq.) 


Part of the backend integration work Alvaro is doing is teaching 
autovacuum to do it's work without libpq.

Re: Autovacuum in the backend

From

Rod Taylor

Date:

16 June 2005, 02:16:32

On Thu, 2005-06-16 at 00:44 -0400, Matthew T. O'Connor wrote:
> Joshua D. Drake wrote:
> 
> > Just my own two cents. First I am not knocking the work that has been 
> > on autovacuum. I am sure that it was a leap on its own to get it to 
> > work. However I will say that I just don't see the reason for it.
> 
> * Eliminates newbie confusion

Ignore everything else. This one is the clincher.

Someone doing serious database work is going to read the docs to find
out about backup / restore processes and basic tuning. They'll run
across the disable switch for autovacuum soon enough.

The jack of all trades IT guy who is running some minor work but doesn't
know much about databases in general won't have as many hurdles to
climb.

Besides, vacuum off by default possibly makes for huge files and takes
forever to reclaim space (cluster, vacuum full, etc.). Vacuum on by
default means worst case they turn it off and instantly their IO load
decreases.
--

Re: Autovacuum in the backend

From

Tom Lane

Date:

16 June 2005, 02:32:27

Alvaro Herrera <alvherre@surnet.cl> writes:
> A question for interested parties.  I'm thinking in handling the
> user/password issue by reading the flat files (the copies of pg_shadow,
> pg_database, etc).

Er, what "user/password issue"?  Context please.

> The only thing that I'd need to modify is add the datdba field to
> pg_database, so we can figure out an appropiate user for vacuuming each
> database.

The datdba is not necessarily a superuser, and therefore is absolutely
not the right answer for any question related to autovacuum.  But in
any case, I would expect that an integrated-into-the-backend autovac
implementation would be operating at a level below any permission checks
--- so this question shouldn't be relevant anyway.
        regards, tom lane

Re: Autovacuum in the backend

From

Gavin Sherry

Date:

16 June 2005, 03:20:47

On Thu, 16 Jun 2005, Matthew T. O'Connor wrote:

> Gavin Sherry wrote:
>
> >I guess my main concern is that we'll have a solution to the problem of
> >dead tuples which is only half way there. It is only an incremental
> >improvement upon the contrib module and solves only one real problem:
> >users do not read up on VACUUM or autovacuum. This is at the expense of
> >making it appear to be suitable for the general user base when it isn't,
> >in my opinion. That isn't the fault of autovacuum but is a function of the
> >cost of ordinary vacuum.
> >
> >
>
> Would you mind expounding on why you think autovacuum isn't suitable for
> the general public?  I know it's not a silver bullet, but I think in
> general, it will be helpful for most people.

As I said, this is largely the fault of VACUUM. The main thing I'd like to
see is a complete solution to the problem. I'm not picking on autovacuum.
However, I will elaborate a little on why I think autovacuum should not
be a feature of the backend:

1) The main argument so far is that autovacuum will ensure that users who
do not read the maintenance section of the manual will not notice a
deterioration of performance. This means that we anticipate autovacuum
being on by default. This suggests that the default autovacuum
configuration will not need tuning. I do not think that will be the case.

2) By no fault of its own, autovacuum's level of granularity is the table
level. For people dealing with non-trivial amounts of data (and we're not
talking gigabytes or terabytes here), this is a serious drawback. Vacuum
at peak times can cause very intense IO bursts -- even with the
enhancements in 8.0. I don't think the solution to the problem is to give
users the impression that it is solved and then vacuum their tables during
peak periods. I cannot stress this enough.

3) autovacuum on by default means row level stats are on by default. This
will have a non-trivial performance impact on users, IMHO. For right or
wrong, our users take the postgresql.conf defaults pretty seriously and
this level of stats collection could and will remain enabled in some
non-trivial percentage of users who turn autovacuum off (consider many
users' reluctance to change shared_buffers in previous releases). To quote
from the README:

"The overhead of the stats system has been shown to be significant under
certain workloads.  For instance, a tight loop of queries performing
"select 1" was found to run nearly 30% slower when row-level stats were
enabled."

I'm not one for "select 1" benchmarks but this is a problem that hasn't
even been mentioned, as far as I recall.

4) Related to this, I guess, is that a user's FSM settings might be
completely inappropriate. The 'Just read the manual' or 'Just read the
logs' argument doesn't cut it, because the main argument for autovacuum in
the backend is that people do not and will not.

5) It doesn't actually shrink tables -- ie, there's no VACUUM FULL. If
we're telling users about VACUUM less often than we are now, there's bound
to be bloating issues (see 4).

I guess the main point is, if something major like this ships in the
backend it says to users that the problem has gone away. pg_autovacuum is
a good contrib style solution: it addresses a problem users have and
attempts to solve it the way other users might try and solve it. When you
consider it in the backend, it looks like a workaround. I think users are
better served by solving the real problem.

Gavin

Re: Autovacuum in the backend

From

Hans-Jürgen Schönig

Date:

16 June 2005, 06:10:10

> 2) By no fault of its own, autovacuum's level of granularity is the table
> level. For people dealing with non-trivial amounts of data (and we're not
> talking gigabytes or terabytes here), this is a serious drawback. Vacuum
> at peak times can cause very intense IO bursts -- even with the
> enhancements in 8.0. I don't think the solution to the problem is to give
> users the impression that it is solved and then vacuum their tables during
> peak periods. I cannot stress this enough.


I completly agree with Gavin - integrating this kind of thing into the 
backend writer or integrate it with FSM would be the ideal solution.

I guess everybody who has already vacuumed a 2 TB relation will agree 
here. VACUUM is not a problem for small "my cat Minka" databases. 
However, it has been a real problem on large, heavy-load databases. I 
have even seen people splitting large tables and join them with a view 
to avoid long vacuums and long CREATE INDEX operations (i am not joking 
- this is serious).

postgresql is more an more used to really large boxes. this is an 
increasing problem. gavin's approach using a vacuum bitmap seems to be a 
good approach. an alternative would be to have some sort of vacuum queue 
containing a set of pages which are reported by the writing process (= 
backend writer or backends).
best regards,
    hans

-- 
Cybertec Geschwinde u Schoenig
Schoengrabern 134, A-2020 Hollabrunn, Austria
Tel: +43/664/393 39 74
www.cybertec.at, www.postgresql.at

Re: Autovacuum in the backend

From

"Magnus Hagander"

Date:

16 June 2005, 06:15:10

> If we do integrated AV, it should only be turned on by
> default at a relatively low level.  And wasn't there an issue
> on Windows with AV not working?

AFAIK, it works.
But the fact that you need to provide it with a userid/password combo
makes it a lot harder to install as a service. And it's not installed by
default by the installer, for that reason (and I think maybe others?
Dave, I think you're the one who said no-service-by-default?)

//Magnus

Re: Autovacuum in the backend

From

Gavin Sherry

Date:

16 June 2005, 06:34:32

On Thu, 16 Jun 2005, [ISO-8859-1] Hans-J�rgen Sch�nig wrote:

> > 2) By no fault of its own, autovacuum's level of granularity is the table
> > level. For people dealing with non-trivial amounts of data (and we're not
> > talking gigabytes or terabytes here), this is a serious drawback. Vacuum
> > at peak times can cause very intense IO bursts -- even with the
> > enhancements in 8.0. I don't think the solution to the problem is to give
> > users the impression that it is solved and then vacuum their tables during
> > peak periods. I cannot stress this enough.
>
>
> I completly agree with Gavin - integrating this kind of thing into the
> backend writer or integrate it with FSM would be the ideal solution.
>
> I guess everybody who has already vacuumed a 2 TB relation will agree
> here. VACUUM is not a problem for small "my cat Minka" databases.
> However, it has been a real problem on large, heavy-load databases. I
> have even seen people splitting large tables and join them with a view
> to avoid long vacuums and long CREATE INDEX operations (i am not joking
> - this is serious).

I think this gets away from my point a little. People with 2 TB tables can
take care of themselves, as can people who've taken the time to partition
their tables to speed up vacuum. I'm more concerned about the majority of
people who fall in the middle -- between the hobbiest and the high end
data centre.

Thanks,

Gavin

Re: Autovacuum in the backend

From

Hans-Jürgen Schönig

Date:

16 June 2005, 07:03:11

Gavin Sherry wrote:
> On Thu, 16 Jun 2005, [ISO-8859-1] Hans-J�rgen Sch�nig wrote:
> 
> 
>>>2) By no fault of its own, autovacuum's level of granularity is the table
>>>level. For people dealing with non-trivial amounts of data (and we're not
>>>talking gigabytes or terabytes here), this is a serious drawback. Vacuum
>>>at peak times can cause very intense IO bursts -- even with the
>>>enhancements in 8.0. I don't think the solution to the problem is to give
>>>users the impression that it is solved and then vacuum their tables during
>>>peak periods. I cannot stress this enough.
>>
>>
>>I completly agree with Gavin - integrating this kind of thing into the
>>backend writer or integrate it with FSM would be the ideal solution.
>>
>>I guess everybody who has already vacuumed a 2 TB relation will agree
>>here. VACUUM is not a problem for small "my cat Minka" databases.
>>However, it has been a real problem on large, heavy-load databases. I
>>have even seen people splitting large tables and join them with a view
>>to avoid long vacuums and long CREATE INDEX operations (i am not joking
>>- this is serious).
> 
> 
> I think this gets away from my point a little. People with 2 TB tables can
> take care of themselves, as can people who've taken the time to partition
> their tables to speed up vacuum. I'm more concerned about the majority of
> people who fall in the middle -- between the hobbiest and the high end
> data centre.
> 
> Thanks,
> 
> Gavin


I think your approach will help all of them.
If we had some sort of autovacuum (which is packages with most distros 
anyway - having it in the core is nice as well) and a mechanism to 
improve realloaction / vacuum speed we have solved all problems.

i do think that 2 tb can take care of themselves. the question is, 
however, whether the database can do what they want ...
thanks a lot,
    hans

-- 
Cybertec Geschwinde u Schoenig
Schoengrabern 134, A-2020 Hollabrunn, Austria
Tel: +43/664/393 39 74
www.cybertec.at, www.postgresql.at

Re: Autovacuum in the backend

From

"Dave Page"

Date:

16 June 2005, 07:42:51


> -----Original Message-----
> From: Magnus Hagander [mailto:mha@sollentuna.net]
> Sent: 16 June 2005 10:15
> To: Josh Berkus; Gavin Sherry
> Cc: Bruce Momjian; pgsql-hackers@postgresql.org; Dave Page
> Subject: RE: [HACKERS] Autovacuum in the backend
>
> > If we do integrated AV, it should only be turned on by
> > default at a relatively low level.  And wasn't there an issue
> > on Windows with AV not working?
>
> AFAIK, it works.
> But the fact that you need to provide it with a userid/password combo
> makes it a lot harder to install as a service. And it's not
> installed by
> default by the installer, for that reason (and I think maybe others?
> Dave, I think you're the one who said no-service-by-default?)

Yes, 'cos there was no easy way to do it securely when I did it without
a lot of mucking about to setup a .pgpass file in the service users
account.

It does work perfectly well however, and did so long before PostgreSQL
itself did :-)

Regards, Dave

Re: Autovacuum in the backend

From

"Andrew Dunstan"

Date:

16 June 2005, 08:42:51

Gavin Sherry said:
> On Thu, 16 Jun 2005, [ISO-8859-1] Hans-Jürgen Schönig wrote:
>
>> > 2) By no fault of its own, autovacuum's level of granularity is the
>> > table level. For people dealing with non-trivial amounts of data
>> > (and we're not talking gigabytes or terabytes here), this is a
>> > serious drawback. Vacuum at peak times can cause very intense IO
>> > bursts -- even with the enhancements in 8.0. I don't think the
>> > solution to the problem is to give users the impression that it is
>> > solved and then vacuum their tables during peak periods. I cannot
>> > stress this enough.
>>
>>
>> I completly agree with Gavin - integrating this kind of thing into the
>> backend writer or integrate it with FSM would be the ideal solution.
>>
>> I guess everybody who has already vacuumed a 2 TB relation will agree
>> here. VACUUM is not a problem for small "my cat Minka" databases.
>> However, it has been a real problem on large, heavy-load databases. I
>> have even seen people splitting large tables and join them with a view
>> to avoid long vacuums and long CREATE INDEX operations (i am not
>> joking - this is serious).
>
> I think this gets away from my point a little. People with 2 TB tables
> can take care of themselves, as can people who've taken the time to
> partition their tables to speed up vacuum. I'm more concerned about the
> majority of people who fall in the middle -- between the hobbiest and
> the high end data centre.
>

My only problemn with what you say is that we should not incorporate AV into
the backend until these things have been solved. This would be one step down
a long raod, and that's how it should be positioned.

I am very concerned that with Feature Freeze 2 weeks away we seem to be in a
similar position to where we were a year ago. I know we don't even promise
anything, but certainly I and others believed that work was being done to
get AV into the backend in 8.1. Not doing this because we think it could be
lots better would not give people a good impression of our processes. I
certainly don't think it will make matters worse, especially if it's not on
by default.

cheers

andrew

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 10:47:48

Andrew Dunstan wrote:

>Gavin Sherry said:
>  
>
>>I think this gets away from my point a little. People with 2 TB tables
>>can take care of themselves, as can people who've taken the time to
>>partition their tables to speed up vacuum. I'm more concerned about the
>>majority of people who fall in the middle -- between the hobbiest and
>>the high end data centre.
>>
>>    
>>
>
>My only problemn with what you say is that we should not incorporate AV into
>the backend until these things have been solved. This would be one step down
>a long raod, and that's how it should be positioned.
>  
>

Right, I think if VACUUM is improved than the semantics of AV in the 
backend might change, but I think there will always be a need for some 
maintenance, and a daemon that monitors the maintenance needs of your 
database and fires off appropriate maintenance commands for you is 
good.  No it doesn't solve all problems, but I think it solves a lot of 
problems for a lot of people.  Besides VACUUM isn't the only the AV 
does, it also does ANALYZE to keep your stats up-to-date and it watches 
for XID wraparound.  It could also look for REINDEX opportunities and 
who knows what else in the future.

>I am very concerned that with Feature Freeze 2 weeks away we seem to be in a
>similar position to where we were a year ago. I know we don't even promise
>anything, but certainly I and others believed that work was being done to
>get AV into the backend in 8.1. Not doing this because we think it could be
>lots better would not give people a good impression of our processes. I
>certainly don't think it will make matters worse, especially if it's not on
>by default.
>  
>

I agree.  Also, some people in this thread have been making noises about 
wanting AV on by default.  This might be nice, but I am still leaning 
towards off by default at least in 8.1.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 11:02:13

Gavin Sherry wrote:

>On Thu, 16 Jun 2005, Matthew T. O'Connor wrote:
>  
>
>>Would you mind expounding on why you think autovacuum isn't suitable for
>>the general public?  I know it's not a silver bullet, but I think in
>>general, it will be helpful for most people.
>>    
>>
>
>As I said, this is largely the fault of VACUUM. The main thing I'd like to
>see is a complete solution to the problem. I'm not picking on autovacuum.
>However, I will elaborate a little on why I think autovacuum should not
>be a feature of the backend:
>  
>

Don't worry, I don't think you are picking on AV.

>1) The main argument so far is that autovacuum will ensure that users who
>do not read the maintenance section of the manual will not notice a
>deterioration of performance. This means that we anticipate autovacuum
>being on by default. This suggests that the default autovacuum
>configuration will not need tuning. I do not think that will be the case.
>  
>

I disagree with this.   I think the newbie protection benefits of AV are 
not it's primary goal, though I do think it's an important one.  The 
main thing AV brings is the ability to control bloating in your database 
and keep your stats up-to-date no matter what your work load.  It is 
possible for an Admin to setup cron scripts to run VACUUM or ANALYZE on 
particularly needy tables at appropriate intervals, but I guarantee that 
the cron script is going to either fire too many, or too few VACUUMS.  
Also when the workload changes, or a new table is added, the Admin then 
needs to update his cron scripts.  This all goes away with AV and I 
believe this is a much bigger goal than the newbie problem.

>2) By no fault of its own, autovacuum's level of granularity is the table
>level. For people dealing with non-trivial amounts of data (and we're not
>talking gigabytes or terabytes here), this is a serious drawback. Vacuum
>at peak times can cause very intense IO bursts -- even with the
>enhancements in 8.0. I don't think the solution to the problem is to give
>users the impression that it is solved and then vacuum their tables during
>peak periods. I cannot stress this enough.
>  
>

I agree this is a major problem with VACUUM, but I also think it's a 
different problem.  One advantage of integrated AV is that you will be 
able to set per-table thresholds, which include the ability to turn off 
AV for any given table.  If you are running a database with tables this 
big, I think you will be able to figure out how to customize integrated 
AV to your needs.

>3) autovacuum on by default means row level stats are on by default. This
>will have a non-trivial performance impact on users, IMHO. For right or
>wrong, our users take the postgresql.conf defaults pretty seriously and
>this level of stats collection could and will remain enabled in some
>non-trivial percentage of users who turn autovacuum off (consider many
>users' reluctance to change shared_buffers in previous releases). To quote
>from the README:
>
>"The overhead of the stats system has been shown to be significant under
>certain workloads.  For instance, a tight loop of queries performing
>"select 1" was found to run nearly 30% slower when row-level stats were
>enabled."
>
>I'm not one for "select 1" benchmarks but this is a problem that hasn't
>even been mentioned, as far as I recall.
>  
>

I mentioned this in the README because I thought I should, not because I 
think it's a real problem in practice.  I think a real production 
database doing queries that are any more complicated than "select 1" 
will probably not notice the difference.

>4) Related to this, I guess, is that a user's FSM settings might be
>completely inappropriate. The 'Just read the manual' or 'Just read the
>logs' argument doesn't cut it, because the main argument for autovacuum in
>the backend is that people do not and will not.
>  
>

Agreed, it doesn't solve all problems, and I'm not arguing that the 
integration of AV makes PostgreSQL newbie safe it just helps reduce the 
newbie problem.   Again if the default FSM settings are inappropriate 
for a database then the user is probably doing something more 
complicated that a "my cat minka" database and will need to learn some 
tuning skills anyway.

>5) It doesn't actually shrink tables -- ie, there's no VACUUM FULL. If
>we're telling users about VACUUM less often than we are now, there's bound
>to be bloating issues (see 4).
>  
>

Not totally true, regular VACUUM can shrink tables a little (I think 
only if there is free space at the end of the table it can cutoff 
without moving data around).  But if AV is on and the settings are 
reasonable, then a table shouldn't bloat much or at all.  Also, I don't 
think we are telling people to VACUUM less, in fact tables that need it 
will usually get VACUUM'd more, we are just telling the users that if 
they turn AV on, they don't have to manage all the VACUUMing.

>I guess the main point is, if something major like this ships in the
>backend it says to users that the problem has gone away. pg_autovacuum is
>a good contrib style solution: it addresses a problem users have and
>attempts to solve it the way other users might try and solve it. When you
>consider it in the backend, it looks like a workaround. I think users are
>better served by solving the real problem.
>

Which problem goes away?  The problem of users forgetting to VACUUM does 
go away, the problem of  the VACUUM command being problematic on large 
tables doesn't but that is a different question. 

My basic position is that with integrated AV, there will always (or at 
least for the foreseeable future) be some maintenance that users will 
need to do to their databases by hand (or by cron) and that AV does this 
better than cron does.  When VACUUM is improved, the semantics of AV 
might change, but the maintenance work will still need to be done.

Matt

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 11:04:12

Hans-Jürgen Schönig wrote:

> I completly agree with Gavin - integrating this kind of thing into the 
> backend writer or integrate it with FSM would be the ideal solution.

Yes AV should look at FSM data, and it will eventually.  I'm not sure 
how you would integrate AV with the backend writer, but again if 
improvements are made to vacuum, AV might have to change along with it, 
but I still think it will be needed or at least helpful.

Re: Autovacuum in the backend

From

Steve Atkins

Date:

16 June 2005, 11:04:50

On Wed, Jun 15, 2005 at 09:07:30PM -0700, Josh Berkus wrote:
> Josh,
> 
> > Just my own two cents. First I am not knocking the work that has been on
> > autovacuum. I am sure that it was a leap on its own to get it to work.
> > However I will say that I just don't see the reason for it.
> 
> I've personally seen at least a dozen user requests for "autovacuum in the 
> backend", and had this conversation about 1,100 times:
> 
> NB: "After a week, my database got really slow."
> Me: "How often are you running VACUUM ANALYZE?"
> NB: "Running what?"

Yes, me too.

I always understood autovacuum to be a way to avoid having newbies get
burned by not vacuuming, and for simplifying the maintenance of lower
traffic databases.

I don't see people with high-traffic databases (relative to the hardware
they're running on) ever using autovacuum with the current state of
vacuum and autovacuum.

If improvements to vacuum (unrelated to autovacuum) reduce the IO load
that would be a great thing, especially for those of us dealing with
24x7 databases. (I really like the dirty bitmap suggestion - it sounds
a clean way to reduce the amount of work needed). If autovacuum were
extended to allow more flexible scheduling (or even to be aware of the
other IO going on) then it would be of wider use - but I think the real
value of autovacuum is to make sure that new users (Windows...) don't
have a bad experience when they first try PG.

Cheers, Steve

Re: Autovacuum in the backend

From

Andreas Pflug

Date:

16 June 2005, 11:14:03

Matthew T. O'Connor wrote:

> 
> Right, I think if VACUUM is improved than the semantics of AV in the 
> backend might change, but I think there will always be a need for some 
> maintenance, and a daemon that monitors the maintenance needs of your 
> database and fires off appropriate maintenance commands for you is 
> good.  No it doesn't solve all problems, but I think it solves a lot of 
> problems for a lot of people.  Besides VACUUM isn't the only the AV 
> does, it also does ANALYZE to keep your stats up-to-date and it watches 
> for XID wraparound.  It could also look for REINDEX opportunities and 
> who knows what else in the future.

Dave,

i wonder if we should aim to have pgAgent in the backend which was one 
of the reasons why I considered to have it converted from C++ to pure C.

There are many regular maintenance issues that AV can cover, some more 
it could cover and many more we can't even think of right now. Having an 
sql executing agent freely at hand (consistent on _every_ platform, 
without cron/anacron/at/younameit dependencies) should be helpful for that.

Regards,
Andreas

Re: Autovacuum in the backend

From

Alvaro Herrera

Date:

16 June 2005, 12:08:40

On Thu, Jun 16, 2005 at 01:32:16AM -0400, Tom Lane wrote:
> Alvaro Herrera <alvherre@surnet.cl> writes:
> > A question for interested parties.  I'm thinking in handling the
> > user/password issue by reading the flat files (the copies of pg_shadow,
> > pg_database, etc).
> 
> Er, what "user/password issue"?  Context please.
> 
> > The only thing that I'd need to modify is add the datdba field to
> > pg_database, so we can figure out an appropiate user for vacuuming each
> > database.
> 
> The datdba is not necessarily a superuser, and therefore is absolutely
> not the right answer for any question related to autovacuum.  But in
> any case, I would expect that an integrated-into-the-backend autovac
> implementation would be operating at a level below any permission checks
> --- so this question shouldn't be relevant anyway.

Ok, seems things are quite a bit out of context.  What I did was take
Matthew's patch for integrating contrib pg_autovacuum into the
postmaster.  This patch was posted several times as of July and August
2004.  This patch had several issues, like an incorrect shutdown
sequence, forcing libpq to be statically linked into the backend, not
correctly using ereport(), not using the backend's memory management
infrastructure.

There were several suggestions.  One was to separate it in two parts,
one which would be a process launched by postmaster, and another which
would be a shared library, loaded by that other process, which would in
turn load libpq and issue SQL queries (including but not limited to
VACUUM and ANALYZE queries) to a regular backend, using a regular
connection.

Now, the user/password issue is which user and password combination is
used to connect to the regular backend.  Matthew had created a password
file, to be used in a similar fashion to libpq's password file.  This
works but has the drawback that the user has to set the file correctly.
What I'm proposing is using the flatfiles for this.

Now, I'm hearing people don't like using libpq.  This means the whole
thing turn a lot more complicated; for one thing, because it will need
to "connect" to every database in some fashion.  Also, you want it to
"skip" normal permission checks, which would be doable only if it's not
using libpq.  On the other hand, if there were multiple autovacuum
processes, one per database, it'd be all much easier, without using
libpq.

Could we clarify what scenario is people envisioning?  I don't want to
waste time fixing code that in the end is going to be declared as
fundamentally flawed -- I'd rather work on shared dependencies.

Some people say "keep it simple and have one process per cluster."  I
think they don't realize it's actually more complex, not the other way
around.  The only additional complexity is how to handle concurrent
vacuuming, but the code turns out to be simpler because we have access
to system catalogs and standard backend infrastructure in a simple
fashion.

A wholly separate approach is what should the autovacuum daemon be
doing.  At present we only have "full vacuum", "vacuum" and "analyze".
In the future this can be extended and autovacuum can launch partial
vacuums, nappy vacuums, bitmapped vacuums, coffee-with-cream vacuums.
But we need to start somewhere.

-- 
Alvaro Herrera (<alvherre[a]surnet.cl>)
"¿Qué importan los años?  Lo que realmente importa es comprobar que
a fin de cuentas la mejor edad de la vida es estar vivo"  (Mafalda)

Re: Autovacuum in the backend

From

Alvaro Herrera

Date:

16 June 2005, 12:11:46

On Thu, Jun 16, 2005 at 04:20:34PM +1000, Gavin Sherry wrote:

> 2) By no fault of its own, autovacuum's level of granularity is the table
> level. For people dealing with non-trivial amounts of data (and we're not
> talking gigabytes or terabytes here), this is a serious drawback. Vacuum
> at peak times can cause very intense IO bursts -- even with the
> enhancements in 8.0. I don't think the solution to the problem is to give
> users the impression that it is solved and then vacuum their tables during
> peak periods. I cannot stress this enough.

People running systems with petabyte-sized tables can disable autovacuum
for those tables, and leave it running for the rest.  Then they can
schedule whatever maintenance they see fit on their gigantic tables.
Trying to run a database with more than a dozen gigabytes of data
without expert advice (or at least reading the manual) would be
extremely stupid anyway.

-- 
Alvaro Herrera (<alvherre[a]surnet.cl>)
"¿Cómo puedes confiar en algo que pagas y que no ves,
y no confiar en algo que te dan y te lo muestran?" (Germán Poo)

Re: Autovacuum in the backend

From

Hans-Jürgen Schönig

Date:

16 June 2005, 12:33:25

Alvaro Herrera wrote:
> On Thu, Jun 16, 2005 at 04:20:34PM +1000, Gavin Sherry wrote:
> 
> 
>>2) By no fault of its own, autovacuum's level of granularity is the table
>>level. For people dealing with non-trivial amounts of data (and we're not
>>talking gigabytes or terabytes here), this is a serious drawback. Vacuum
>>at peak times can cause very intense IO bursts -- even with the
>>enhancements in 8.0. I don't think the solution to the problem is to give
>>users the impression that it is solved and then vacuum their tables during
>>peak periods. I cannot stress this enough.
> 
> 
> People running systems with petabyte-sized tables can disable autovacuum
> for those tables, and leave it running for the rest.  Then they can
> schedule whatever maintenance they see fit on their gigantic tables.
> Trying to run a database with more than a dozen gigabytes of data
> without expert advice (or at least reading the manual) would be
> extremely stupid anyway.
> 


professional advice won't help you here because you still have to vacuum 
this giant table. this is especially critical in case of 24x7 systems 
(which are quite frequent). in many cases there is no maintenance window 
anymore (e.g. a wastewater system will be only 24x7).

reducing the impact of vacuum and "create index" would be important to 
many people. to me improving vacuum it is as important as Jan's bgwriter 
patch (it reduces the troubles people had with checkpoints).
best regards,
    hans

-- 
Cybertec Geschwinde u Schoenig
Schoengrabern 134, A-2020 Hollabrunn, Austria
Tel: +43/664/393 39 74
www.cybertec.at, www.postgresql.at

Re: Autovacuum in the backend

From

Tom Lane

Date:

16 June 2005, 12:42:09

Alvaro Herrera <alvherre@surnet.cl> writes:
> Now, I'm hearing people don't like using libpq.

Yeah --- a libpq-based solution is not what I think of as integrated at
all, because it cannot do anything that couldn't be done by the existing
external autovacuum process.  About all you can buy there is having the
postmaster spawn the autovacuum process, which is slightly more
convenient to use but doesn't buy any real new functionality.

> Some people say "keep it simple and have one process per cluster."  I
> think they don't realize it's actually more complex, not the other way
> around.

Agreed.  If you aren't connected to a specific database, then you cannot
use any of the normal backend infrastructure for catalog access, which
is pretty much a killer limitation.

A simple approach would be a persistent autovac background process for
each database, but I don't think that's likely to be acceptable because
of the amount of resources tied up (PGPROC slots, open files, etc).

One thing that might work is to have the postmaster spawn an autovac
process every so often.  The first thing the autovac child does is pick
up the current statistics dump file (which it can find without being
connected to any particular database).  It looks through that to
determine which database is most in need of work, then connects to that
database and does some "reasonable" amount of work there, and finally
quits.  Awhile later the postmaster spawns another autovac process that
can connect to a different database and do work there.

This design would mean that the autovac process could not have any
long-term state of its own: any long-term state would have to be in
either system catalogs or the statistics.  But I don't see that as
a bad thing really --- exposing the state will be helpful from a
debugging and administrative standpoint.
        regards, tom lane

Re: Autovacuum in the backend

From

"Dave Page"

Date:

16 June 2005, 13:01:53


> -----Original Message-----
> From: Andreas Pflug [mailto:pgadmin@pse-consulting.de]
> Sent: 16 June 2005 15:14
> To: Dave Page
> Cc: Matthew T. O'Connor; Andrew Dunstan;
> swm@linuxworld.com.au; postgres@cybertec.at;
> alvherre@surnet.cl; pgman@candle.pha.pa.us;
> pgsql-hackers@postgresql.org
> Subject: Re: [HACKERS] Autovacuum in the backend
>
> Dave,
>
> i wonder if we should aim to have pgAgent in the backend
> which was one
> of the reasons why I considered to have it converted from C++
> to pure C.

In previous discussions on -hackers when ppl raised the idea of
something like pgAgent being built into the backend, istm that the
majority of people were against the idea.

Regards, Dave.

Re: Autovacuum in the backend

From

Josh Berkus

Date:

16 June 2005, 13:13:53

Alvaro,

> coffee-with-cream vacuums.

I tried this and now my Hoover makes this horrible noise and smokes.  ;-)

All:

Seriously, all:  when I said that "users" were asking for Autovac in the 
backend (AVitB), I wasn't talking just the newbies on #postgresql.   I'm also 
talking companies like Hyperic, and whole groups like the postgresql.org.br.   
This is a feature that people want, and unless there's something 
fundamentally unstable about it, it seems really stupid to hold it back 
because we're planning VACUUM improvements for 8.2.

AVitB has been on the TODO list for 2 versions.   There's been 2 years to 
question its position there.   Now people are bringing up objections when 
there's no time for discussion left?  This stinks.

-- 
Josh Berkus
Aglio Database Solutions
San Francisco

Re: Autovacuum in the backend

From

Josh Berkus

Date:

16 June 2005, 13:24:44

People,

> AVitB has been on the TODO list for 2 versions.   There's been 2 years to
> question its position there.   Now people are bringing up objections when
> there's no time for discussion left?  This stinks.

Hmmm ... to be specific, I'm referring to the objections to the *idea* of
AVitB, not the problems with the current patch.

--
Josh Berkus
Aglio Database Solutions
San Francisco

Re: Autovacuum in the backend

From

Josh Berkus

Date:

16 June 2005, 13:28:31

Dave,

> In previous discussions on -hackers when ppl raised the idea of
> something like pgAgent being built into the backend, istm that the
> majority of people were against the idea.

Well, you're up against the minimalist approach to core PostgreSQL there.  It 
would pretty much *have* to be an optional add-in, even if it was stored in 
pg_catalog.  I can see a lot of uses for a back-end job scheduler myself, but 
it would need to go through the gauntlet of design criticism first <wry 
grin>.

-- 
Josh Berkus
Aglio Database Solutions
San Francisco

Re: Autovacuum in the backend

From

Douglas McNaught

Date:

16 June 2005, 13:39:22

Josh Berkus <josh@agliodbs.com> writes:

> Seriously, all:  when I said that "users" were asking for Autovac in the 
> backend (AVitB), I wasn't talking just the newbies on #postgresql.   I'm also 
> talking companies like Hyperic, and whole groups like the postgresql.org.br.   
> This is a feature that people want, and unless there's something 
> fundamentally unstable about it, it seems really stupid to hold it back 
> because we're planning VACUUM improvements for 8.2.

Agreed, and I don't see AVitB as standing in the way of any of those
proposed improvements--it's just that AVitB has a chance of making it
into 8.1, and none of the proposed improvements do.  I don't see why
people are objecting.

Also, count me in the "turn it on by default" crowd--I'd rather not
have newbies see unending file bloat from normal usage, it just looks
bad.  Anyone who plans to deploy for large databases and high loads
needs to learn to tune (just as with any other database) and can make
an informed decision about whether AV should be on or not.

-Doug

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 13:54:58

Tom Lane wrote:

>Alvaro Herrera <alvherre@surnet.cl> writes:
>  
>
>>Now, I'm hearing people don't like using libpq.
>>    
>>
>
>Yeah --- a libpq-based solution is not what I think of as integrated at
>all, because it cannot do anything that couldn't be done by the existing
>external autovacuum process.  About all you can buy there is having the
>postmaster spawn the autovacuum process, which is slightly more
>convenient to use but doesn't buy any real new functionality.
>  
>

Yes libpq has to go, I thought this was clear, but perhaps I didn't say 
it clearly enough.  Anyway, this was the stumbling block which prevented 
me from making more progress on autovacuum integration.

>>Some people say "keep it simple and have one process per cluster."  I
>>think they don't realize it's actually more complex, not the other way
>>around.
>>    
>>
>
>A simple approach would be a persistent autovac background process for
>each database, but I don't think that's likely to be acceptable because
>of the amount of resources tied up (PGPROC slots, open files, etc).
>  
>

Agreed, this seems ugly.

>One thing that might work is to have the postmaster spawn an autovac
>process every so often.  The first thing the autovac child does is pick
>up the current statistics dump file (which it can find without being
>connected to any particular database).  It looks through that to
>determine which database is most in need of work, then connects to that
>database and does some "reasonable" amount of work there, and finally
>quits.  Awhile later the postmaster spawns another autovac process that
>can connect to a different database and do work there.
>  
>

I don't think you can use a dump to determine who should be connected to 
next since you don't really know what happened since the last time you 
exited.  What was a priority 5 or 10 minutes ago might not be a priority 
now.

>This design would mean that the autovac process could not have any
>long-term state of its own: any long-term state would have to be in
>either system catalogs or the statistics.  But I don't see that as
>a bad thing really --- exposing the state will be helpful from a
>debugging and administrative standpoint.
>

This is not a problem as my patch,  that Alvaro has now taken over, 
already created a new system catalog for all autovac data, so autovac 
really doesn't contain any static persistent data.

The rough design I had in mind was:
1)  On startup postmaster spawns the master autovacuum process
2)  The master autovacuum process spawns backends to do the vacuuming 
work on a particular database
3)  The master autovacuum waits for this process to exit, then spaws the 
next backend for the next database
4)  Repeat this loop until all databases in the cluster have been 
checked, then sleep for a while, and start over again.

I'm not sure if this is feasible, or if this special master autovacuum 
process would be able to fork off or request that the postmaster fork 
off an autovacuum process for a particular database in the cluster.  
Thoughts or comments?

Matthew

Re: Autovacuum in the backend

From

Tom Lane

Date:

16 June 2005, 14:12:45

"Matthew T. O'Connor" <matthew@zeut.net> writes:
> I don't think you can use a dump to determine who should be connected to 
> next since you don't really know what happened since the last time you 
> exited.  What was a priority 5 or 10 minutes ago might not be a priority 
> now.

Well, the information necessary to make that decision has to be
available from the statistics file.  This doesn't seem like an
insuperable problem.

> The rough design I had in mind was:
> 1)  On startup postmaster spawns the master autovacuum process
> 2)  The master autovacuum process spawns backends to do the vacuuming 
> work on a particular database
> 3)  The master autovacuum waits for this process to exit, then spaws the 
> next backend for the next database
> 4)  Repeat this loop until all databases in the cluster have been 
> checked, then sleep for a while, and start over again.

This is unworkable, I believe, because backends have to be direct
children of the postmaster.  I don't recall the details at the moment
but there are IPC signaling reasons for it.

> I'm not sure if this is feasible, or if this special master autovacuum 
> process would be able to fork off or request that the postmaster fork 
> off an autovacuum process for a particular database in the cluster.  
> Thoughts or comments?

It's possible that we could add some signaling whereby the autovac
master could request the postmaster to fork a child into a particular
database.  I'm not sure why this is a lot better than keeping the
stats out where everyone can see them...
        regards, tom lane

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

16 June 2005, 14:41:51

Tom Lane wrote:

>"Matthew T. O'Connor" <matthew@zeut.net> writes:
>  
>
>>I don't think you can use a dump to determine who should be connected to 
>>next since you don't really know what happened since the last time you 
>>exited.  What was a priority 5 or 10 minutes ago might not be a priority 
>>now.
>>    
>>
>
>Well, the information necessary to make that decision has to be
>available from the statistics file.  This doesn't seem like an
>insuperable problem.
>  
>

Interesting, so the postmaster would kick off an autovacuum process, 
which would read in data from the stats system by hand ( it can do this 
because the stat system writes it's data to flat files?).   I don't know 
how complicated this might be but perhaps a simpler method is to just 
have each autovacuum process write a file for itself noting what 
database it should connect to next.  This would work find assuming we 
want to continue to loop through all the databases in much the same 
fashion as pg_autovacuum currently does.

>>The rough design I had in mind was:
>>1)  On startup postmaster spawns the master autovacuum process
>>2)  The master autovacuum process spawns backends to do the vacuuming 
>>work on a particular database
>>3)  The master autovacuum waits for this process to exit, then spaws the 
>>next backend for the next database
>>4)  Repeat this loop until all databases in the cluster have been 
>>checked, then sleep for a while, and start over again.
>>    
>>
>It's possible that we could add some signaling whereby the autovac
>master could request the postmaster to fork a child into a particular
>database.  I'm not sure why this is a lot better than keeping the
>stats out where everyone can see them...
>

Ok.

Re: Autovacuum in the backend

From

Hannu Krosing

Date:

16 June 2005, 15:42:30

On N, 2005-06-16 at 11:42 -0400, Tom Lane wrote:> Alvaro Herrera <alvherre@surnet.cl> writes:
...> > Some people say "keep it simple and have one process per cluster."  I> > think they don't realize it's actually
morecomplex, not the other way> > around.>> Agreed.  If you aren't connected to a specific database, then you cannot>
useany of the normal backend infrastructure for catalog access, which> is pretty much a killer limitation.>> A simple
approachwould be a persistent autovac background process for> each database, but I don't think that's likely to be
acceptablebecause> of the amount of resources tied up (PGPROC slots, open files, etc).

In this case it should also be configurable, which databases will get
their own AV processes.

Also, there is probably no need to keep an AV process running very long
after last "real" backend for that database has closed, as there won't
be any changes anyway.

Having one AV process per DB will likely be a problem for only
installations, where there is very many single-user user-always-
connected databases, which I don't expect to be that many.

And I also expect that soon (after my vacuums-dont-step-on-each-other
patch goes in), there will be need for running several vacuums in
parallel on the same database (say one with non-intrusive vacuum_page
settings for a really large table and several more agressive ones for
fast-changing small tables at the same time), AFAIKS this will also need
several backends - at least one for each parallel vacuum.
> One thing that might work is to have the postmaster spawn an autovac> process every so often.

my fastest manual vacuum does its job in 5 sec and is repeated at 10 sec
inervals - will this design be able to match this ?
> The first thing the autovac child does is pick> up the current statistics dump file (which it can find without being>
connectedto any particular database).  It looks through that to> determine which database is most in need of work, then
connectsto that> database and does some "reasonable" amount of work there, and finally> quits.  Awhile later the
postmasterspawns another autovac process that> can connect to a different database and do work there.>> This design
wouldmean that the autovac process could not have any> long-term state of its own: any long-term state would have to be
in>either system catalogs or the statistics.  But I don't see that as> a bad thing really --- exposing the state will
behelpful from a> debugging and administrative standpoint.

-- 
Hannu Krosing <hannu@skype.net>

Re: Autovacuum in the backend

From

Andreas Pflug

Date:

16 June 2005, 17:33:44

Josh Berkus wrote:
> Dave,
> 
> 
>>In previous discussions on -hackers when ppl raised the idea of
>>something like pgAgent being built into the backend, istm that the
>>majority of people were against the idea.
> 
> 
> Well, you're up against the minimalist approach to core PostgreSQL there.  It 
> would pretty much *have* to be an optional add-in, even if it was stored in 
> pg_catalog.  I can see a lot of uses for a back-end job scheduler myself, but 
> it would need to go through the gauntlet of design criticism first <wry 
> grin>.

You want to scare me, don't you? :-)

We're having a growing zoo of daemons that can be regarded as tightly 
integrated server add-on processes (slony, autovac, pgAgent), and it 
would be really nice (say: win32 users are used to it, thus requiring 
it) to have a single point of control.

Maybe a super daemon (in win32 probably pg_ctl), controlling postmaster 
and all those helper processes (accessible through pgsql functions, of 
course) would be the solition. This keeps the kernel clean, separates 
backend shmem from helper processes and enables control over all processes.

Regards,
Andreas

Re: Autovacuum in the backend

From

Andrew Dunstan

Date:

16 June 2005, 18:03:36


Andreas Pflug wrote:

>
> We're having a growing zoo of daemons that can be regarded as tightly 
> integrated server add-on processes (slony, autovac, pgAgent), and it 
> would be really nice (say: win32 users are used to it, thus requiring 
> it) to have a single point of control.
>
> Maybe a super daemon (in win32 probably pg_ctl), controlling 
> postmaster and all those helper processes (accessible through pgsql 
> functions, of course) would be the solition. This keeps the kernel 
> clean, separates backend shmem from helper processes and enables 
> control over all processes.
>

And this will be ready when? I thought we were discussing what could be 
done regarding AVitB between now and feature freeze for 8.1 in about 2 
weeks. This surely doesn't come into that category.

cheers

andrew

Re: Autovacuum in the backend

From

Gavin Sherry

Date:

16 June 2005, 19:22:36

On Thu, 16 Jun 2005, Alvaro Herrera wrote:

> On Thu, Jun 16, 2005 at 04:20:34PM +1000, Gavin Sherry wrote:
>
> > 2) By no fault of its own, autovacuum's level of granularity is the table
> > level. For people dealing with non-trivial amounts of data (and we're not
> > talking gigabytes or terabytes here), this is a serious drawback. Vacuum
> > at peak times can cause very intense IO bursts -- even with the
> > enhancements in 8.0. I don't think the solution to the problem is to give
> > users the impression that it is solved and then vacuum their tables during
> > peak periods. I cannot stress this enough.
>
> People running systems with petabyte-sized tables can disable autovacuum
> for those tables, and leave it running for the rest.  Then they can
> schedule whatever maintenance they see fit on their gigantic tables.
> Trying to run a database with more than a dozen gigabytes of data
> without expert advice (or at least reading the manual) would be
> extremely stupid anyway.

As I've said a few times, I'm not concerned about such users. I'm
concerned about users with some busy tables of a few hundred megabytes. I
still don't think VACUUM at arbitary times on such tables is suitable.

Thanks,

Gavin

Re: Autovacuum in the backend

From

Andreas Pflug

Date:

16 June 2005, 20:37:31

Andrew Dunstan wrote:

>
>
> Andreas Pflug wrote:
>
>>
>> We're having a growing zoo of daemons that can be regarded as tightly 
>> integrated server add-on processes (slony, autovac, pgAgent), and it 
>> would be really nice (say: win32 users are used to it, thus requiring 
>> it) to have a single point of control.
>>
>> Maybe a super daemon (in win32 probably pg_ctl), controlling 
>> postmaster and all those helper processes (accessible through pgsql 
>> functions, of course) would be the solition. This keeps the kernel 
>> clean, separates backend shmem from helper processes and enables 
>> control over all processes.
>>
>
> And this will be ready when? 

This is certainly 8.2 stuff. I'm sufficiently glad if the 
instrumentation stuff that was posted pre-8.0 and left out those days 
because committers ran out of time makes it into 8.1...

> I thought we were discussing what could be done regarding AVitB 
> between now and feature freeze for 8.1 in about 2 weeks. This surely 
> doesn't come into that category.

I agree with former posters that we should have a default on AV to have 
a system that performs correct out of the box for smaller installations. 
Even a functionally cut-down version of AV running by default that has 
to be stopped and replaced by a more sophisticated solution for high 
performance installations is better than now.

Regards,
Andreas

Re: Autovacuum in the backend

From

Tim Allen

Date:

16 June 2005, 22:04:53

Josh Berkus wrote:
> Alvaro,
> 
> 
>>coffee-with-cream vacuums.
> 
> I tried this and now my Hoover makes this horrible noise and smokes.  ;-)

Probably related to the quality of American coffee ;).

> All:
> 
> Seriously, all:  when I said that "users" were asking for Autovac in the 
> backend (AVitB), I wasn't talking just the newbies on #postgresql.   I'm also 
> talking companies like Hyperic, and whole groups like the postgresql.org.br.   
> This is a feature that people want, and unless there's something 
> fundamentally unstable about it, it seems really stupid to hold it back 
> because we're planning VACUUM improvements for 8.2.
> 
> AVitB has been on the TODO list for 2 versions.   There's been 2 years to 
> question its position there.   Now people are bringing up objections when 
> there's no time for discussion left?  This stinks.

Complete agreement from me. Incremental improvements are good - pointing 
out that there are some other incremental improvements that would also 
be good to make is not an argument for delaying the first set of 
incremental improvements.

In our case, we want to be able to install postgres at dozens (ideally 
hundreds... no, thousands :) ) of customer sites, where the customers in 
general are not going to have anyone onsite who has a clue about 
postgres. The existing contrib autovacuum gives a good solution to 
setting things up to maintain the database in a reasonable state of 
health without need for further intervention from us. It's not perfect, 
of course, but if it means the difference between having to unleash our 
support team on a customer once a month and once a year, that's a good 
deal for us. Having it integrated into the backend will make it much 
easier for us, we (hopefully...) won't have to fiddle with extra startup 
scripts, and we'll have one fewer point of failure (eg some customer 
might accidentally turn off the separate pg_autovacuum daemon). Being 
able to customise the autovacuum parameters on a per-table basis is also 
attractive.

Just my AUD0.02. I realise that keeping _our_ customers happy is not 
necessarily anyone else's priority. I'd like to be able to invest some 
coding time, but can't. I haven't even gotten around to completing 
Gavin's survey form (sorry Gav, I'll get to it soon, I hope! :)), so I 
can't demand to be listened to.

But for what it's worth, Alvaro, please keep going, don't be dissuaded.

Tim

-- 
-----------------------------------------------
Tim Allen          tim@proximity.com.au
Proximity Pty Ltd  http://www.proximity.com.au/

Re: Autovacuum in the backend

From

"Qingqing Zhou"

Date:

16 June 2005, 23:14:25

"Tom Lane" <tgl@sss.pgh.pa.us> writes:
>
> Yeah --- a libpq-based solution is not what I think of as integrated at
> all, because it cannot do anything that couldn't be done by the existing
> external autovacuum process.  About all you can buy there is having the
> postmaster spawn the autovacuum process, which is slightly more
> convenient to use but doesn't buy any real new functionality.
>

One reason of not using lib-pq is that this one has to wait for the
completion of each vacuum (we don't has async execution in libpq right?),
but by signaling does not.

But by signaling, we have to detect that if the forked backend successfully
done its job. I am not sure how to easily incorporate this into current
signaling framework.

Regards,
Qingqing

Re: Autovacuum in the backend

From

Bruce Momjian

Date:

16 June 2005, 23:15:52

Gavin Sherry wrote:
> In January I was in Toronto with Jan, Tom and others and some ideas about
> vacuum were being discussed. The basic idea is that when we dirty pages we
> need we set a bit in a bitmap to say that the page has been dirty. A
> convenient place to do this is when we are writing dirty buffers out to
> disk. In many situations, this can happen inside the bgwriter meaning that
> there should be little contention for this bitmap. Of course, individual
> backends may be writing pages out and would have to account for the
> dirty pages at that point.
> 
> Now this bitmap can occur on a per heap segment basis (ie, per 1 GB heap
> file). You only need 2 pages for the bitmap to represent all the pages in
> the segment, which is fairly nice. When vacuum is run, instead of visiting
> every page, it would see which pages have been dirtied in the bitmap and
> visit only pages. With large tables and small numbers of modified
> tuples/pages, the effect this change would have would be pretty
> impressive.

Added to TODO:
* Create a bitmap of pages that need vacuuming  Instead of sequentially scanning the entire table, have the background
writeror some other process record pages that have expired rows, then  VACUUM can look at just those pages rather than
theentire table.  In  the event of a system crash, the bitmap would probably be invalidated.
 

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073

Re: Autovacuum in the backend

From

"Thomas F. O'Connell"

Date:

17 June 2005, 01:55:49

Gavin,

For the record, I don't consider myself a PostgreSQL newbie, nor do I
manage any 2 TB databases (much less tables), but I do have an
unusual production use case: thousands (> 10,000) of tables, many of
them inherited, and many of them with hundreds of thousands (a few
with millions) of rows.

Honestly, creating crontab vacuum management for this scenario would
be a nightmare, and pg_autovacuum has been a godsend. Considering the
recent revelations of O(n^2) iterations over table lists in the
current versions and the stated and apparent ease with which this
problem could be solved by integrating the basic functionality of
pg_autovacuum into the backend, I can personally attest to there
being real-world use cases that would benefit tremendously from
integrated autovacuum.

A few months ago, I attempted to solve the wrong problem by
converting a hardcoded threshold into another command-line option. If
I had spotted the O(n^2) problem, I might've spent the time working
on it then instead of the new command-line option. I suppose it's
possible that I'll head down this road anyway if it looks like
integrated pg_autovacuum is going to be put on hold indefinitely
after this discussion.

Anyway, just wanted to throw out some food for thought for the
practicality of a tool like pg_autovacuum.

--
Thomas F. O'Connell
Co-Founder, Information Architect
Sitening, LLC

Strategic Open Source: Open Your i™

http://www.sitening.com/
110 30th Avenue North, Suite 6
Nashville, TN 37203-6320
615-260-0005

On Jun 16, 2005, at 5:22 PM, Gavin Sherry wrote:

> On Thu, 16 Jun 2005, Alvaro Herrera wrote:
>
>
>> On Thu, Jun 16, 2005 at 04:20:34PM +1000, Gavin Sherry wrote:
>>
>>
>>> 2) By no fault of its own, autovacuum's level of granularity is
>>> the table
>>> level. For people dealing with non-trivial amounts of data (and
>>> we're not
>>> talking gigabytes or terabytes here), this is a serious drawback.
>>> Vacuum
>>> at peak times can cause very intense IO bursts -- even with the
>>> enhancements in 8.0. I don't think the solution to the problem is
>>> to give
>>> users the impression that it is solved and then vacuum their
>>> tables during
>>> peak periods. I cannot stress this enough.
>>>
>>
>> People running systems with petabyte-sized tables can disable
>> autovacuum
>> for those tables, and leave it running for the rest.  Then they can
>> schedule whatever maintenance they see fit on their gigantic tables.
>> Trying to run a database with more than a dozen gigabytes of data
>> without expert advice (or at least reading the manual) would be
>> extremely stupid anyway.
>>
>
> As I've said a few times, I'm not concerned about such users. I'm
> concerned about users with some busy tables of a few hundred
> megabytes. I
> still don't think VACUUM at arbitary times on such tables is suitable.
>
> Thanks,
>
> Gavin

Re: Autovacuum in the backend

From

"Joshua D. Drake"

Date:

17 June 2005, 04:01:04

Josh Berkus wrote:
> Josh,
> 
> 
>>Just my own two cents. First I am not knocking the work that has been on
>>autovacuum. I am sure that it was a leap on its own to get it to work.
>>However I will say that I just don't see the reason for it.
> 
> 
> I've personally seen at least a dozen user requests for "autovacuum in the 
> backend", and had this conversation about 1,100 times:
> 
> NB: "After a week, my database got really slow."
> Me: "How often are you running VACUUM ANALYZE?"
> NB: "Running what?"

Can't argue that except... RTFM ;). I am not saying it doesn't have a 
validity. I am just saying that if you actually pay attention to 
PostgreSQL and maintain it, you don't need it ;)

Sincerely,

Joshua D. Drake


>

Re: Autovacuum in the backend

From

"Joshua D. Drake"

Date:

17 June 2005, 04:03:48

Matthew T. O'Connor wrote:
> Joshua D. Drake wrote:
> 
>> Just my own two cents. First I am not knocking the work that has been 
>> on autovacuum. I am sure that it was a leap on its own to get it to 
>> work. However I will say that I just don't see the reason for it.
> 
> 
> 
> The major reasons for autovacuum as I see it are as follows:
> 
> * Reduces administrative overhead having to keep track of what tables 
> need to be vacuumed how often.

Creates more overhead and thus reduces performance.


> * Reduces the total amount of time the system spends vacuuming since it 
> only vacuums when needed.

Can be easily done with cron.

> * Keeps stats up-to-date automatically

Which can be done with cron

> * Eliminates newbie confusion

RTFM

> * Eliminates one of the criticisms that the public has against 
> PostgreSQL (justifed or not)

Agreed.


Just so everyone knows from the get go here. I am purposely playing a 
little devils advocate. Autovacuum has some drawbacks. I think we should
be **publicly** aware of them before we pursue integration.

Heaven knows it would make my life easier if it was integrated but anyway...

Sincerely,

Joshua D. Drake




> 
> Also, as VACUUM improves, autovacuum will improve with it.
>

Re: Autovacuum in the backend

From

Hans-Jürgen Schönig

Date:

17 June 2005, 04:19:49

>> * Reduces the total amount of time the system spends vacuuming since 
>> it only vacuums when needed.
> 
> 
> Can be easily done with cron.
> 
>> * Keeps stats up-to-date automatically
> 
> 
> Which can be done with cron
> 
>> * Eliminates newbie confusion
> 
> 
> RTFM
> 
>> * Eliminates one of the criticisms that the public has against 
>> PostgreSQL (justifed or not)
> 
> 
> Agreed.


I few weeks ago I have set up a database with more than 1.800 tables 
(some complex business thing). inventing a clever cron-vacuum strategy 
is almost impossible (or at least very painful). there should be a 
mechanism (fortunately there is pg_autovacuum) to make this a bit more 
practical.

in case of small databases this is not an issue.
small is always simple. complex and large are the major challenges.
best regards,
    hans


-- 
Cybertec Geschwinde u Schoenig
Schoengrabern 134, A-2020 Hollabrunn, Austria
Tel: +43/664/393 39 74
www.cybertec.at, www.postgresql.at

Re: Autovacuum in the backend

From

Andreas Pflug

Date:

17 June 2005, 05:27:04

Qingqing Zhou wrote:
> "Tom Lane" <tgl@sss.pgh.pa.us> writes:
> 
>>Yeah --- a libpq-based solution is not what I think of as integrated at
>>all, because it cannot do anything that couldn't be done by the existing
>>external autovacuum process.  About all you can buy there is having the
>>postmaster spawn the autovacuum process, which is slightly more
>>convenient to use but doesn't buy any real new functionality.
>>
> 
> 
> One reason of not using lib-pq is that this one has to wait for the
> completion of each vacuum (we don't has async execution in libpq right?),

There *is* async execution in libpq, and it works.

Regards,
Andreas

Re: Autovacuum in the backend

From

Russell Smith

Date:

17 June 2005, 05:56:29

> > The major reasons for autovacuum as I see it are as follows:
> > 
> > * Reduces administrative overhead having to keep track of what tables 
> > need to be vacuumed how often.
> 
> Creates more overhead and thus reduces performance.
Or reduces vacuum overhead because the vacuum strategy is much better than
it was when you used cron.  Especially as people get a chance to improve autovac.

> > * Reduces the total amount of time the system spends vacuuming since it 
> > only vacuums when needed.
> 
> Can be easily done with cron.
Can you do partial table vacuums with CRON?
You can work out the smartest time to vacuum with cron? I thought it just scheduled tasks at certain times.

> 
> > * Keeps stats up-to-date automatically
> 
> Which can be done with cron
An what is the management strategy for adjusting analyze when things change that you weren't aware of? (eg, big table
changesthat were unexpected)
 


> 
> > * Eliminates newbie confusion
> 
> RTFM

RTFM = MySQL in a lot of cases to be honest.

> 
> > * Eliminates one of the criticisms that the public has against 
> > PostgreSQL (justifed or not)
> 
> Agreed.
This is really the same as the previous RTFM question/response.  People criticise because vacuum is foreign to them,
and for newbie's that equals too hard, next db please.  As much as it is a technical issue, it's an advocacy issue
too.

Plus we finally get XID wraparound protection.  We finally decided that for 8.1 we needed some protection, which I
think
Tom committed.  This again may be a newbie thing.  But there are a lot of newbies out there then.   We've see on the
lists
and on IRC this problem pop up a number of times.  And people say "Why didn't it tell me", RTFM it's exactly what they
want
to hear, or the fact they thought they read the manual, and missed understanding that bit.

> 
> 
> Just so everyone knows from the get go here. I am purposely playing a 
> little devils advocate. Autovacuum has some drawbacks. I think we should
> be **publicly** aware of them before we pursue integration.

It does have a number of issues.  But I feel the integration issue is being addressed with a very short term view.
Once it's integrated there are a lot of patches, tweaks and changes that just can't be made until it is integrated.
The usefulness of some of the vacuum ideas that have been presented in the past will be able to become a reality.
The dead space map is a perfect example.  People have talked about it for most of the time I've been around.
But until we have an integrated vacuum none of that can really happen.
> 
> Heaven knows it would make my life easier if it was integrated but anyway...
> 
I understand these are not nessecarily Josh's view, but I thought I would offer comments on them.

> Sincerely,
> 
> Joshua D. Drake
> 
Regards

Russell Smith
> 
> 
> 
> > 
> > Also, as VACUUM improves, autovacuum will improve with it.
> > 
Or because of autovacuum, vacuum and autovacuum will improve.

Re: Autovacuum in the backend

From

Russell Smith

Date:

17 June 2005, 06:01:23

> Added to TODO:
> 
>  * Create a bitmap of pages that need vacuuming
>  
>    Instead of sequentially scanning the entire table, have the background
>    writer or some other process record pages that have expired rows, then
>    VACUUM can look at just those pages rather than the entire table.  In
>    the event of a system crash, the bitmap would probably be invalidated.
> 
Further to this, is there any use case for allowing FSM, or this DSM to spill to disk
if the space fills up.  It would allow the possibility of unusual changes to the db
to not loose space.  You could just load part of the overflow from the disk back
int the FSM in memory and continue using free space.

Regards

Russell Smith

Re: Autovacuum in the backend

From

Russell Smith

Date:

17 June 2005, 06:05:14

> >4) Related to this, I guess, is that a user's FSM settings might be
> >completely inappropriate. The 'Just read the manual' or 'Just read the
> >logs' argument doesn't cut it, because the main argument for autovacuum in
> >the backend is that people do not and will not.
> >  
> >
> 
> Agreed, it doesn't solve all problems, and I'm not arguing that the 
> integration of AV makes PostgreSQL newbie safe it just helps reduce the 
> newbie problem.   Again if the default FSM settings are inappropriate 
> for a database then the user is probably doing something more 
> complicated that a "my cat minka" database and will need to learn some 
> tuning skills anyway.
> 
> >5) It doesn't actually shrink tables -- ie, there's no VACUUM FULL. If
> >we're telling users about VACUUM less often than we are now, there's bound
> >to be bloating issues (see 4).
> >  
> >
> 
But what's stopping the implementation of a Partial VACUUM FULL, where we lock the table,
move enough blocks to shorten the relation so that there is say < 10% bloat, or whatever is
appropriate for that table.  Or even just short the table a few block, and repeat the process
when you have some time too.

> Not totally true, regular VACUUM can shrink tables a little (I think 
> only if there is free space at the end of the table it can cutoff 
> without moving data around).  But if AV is on and the settings are 
> reasonable, then a table shouldn't bloat much or at all.  Also, I don't 
> think we are telling people to VACUUM less, in fact tables that need it 
> will usually get VACUUM'd more, we are just telling the users that if 
> they turn AV on, they don't have to manage all the VACUUMing.

Regards

Russell Smith

Re: Autovacuum in the backend

From

Russell Smith

Date:

17 June 2005, 06:08:11

On Fri, 17 Jun 2005 06:26 pm, Andreas Pflug wrote:
> Qingqing Zhou wrote:
> > "Tom Lane" <tgl@sss.pgh.pa.us> writes:
> > 
> >>Yeah --- a libpq-based solution is not what I think of as integrated at
> >>all, because it cannot do anything that couldn't be done by the existing
> >>external autovacuum process.  About all you can buy there is having the
> >>postmaster spawn the autovacuum process, which is slightly more
> >>convenient to use but doesn't buy any real new functionality.
> >>
> > 
> > 
> > One reason of not using lib-pq is that this one has to wait for the
> > completion of each vacuum (we don't has async execution in libpq right?),
> 
> There *is* async execution in libpq, and it works.

I would have thought the main reasons for not using libpq means you are locked
into only using commands that are available to all users via SQL.  If you don't use
libpq, you open up the ability to use functions that can make use of information available
to the backend, and to also run functions in a way that it is not possible to do via SQL.

Regards

Russell Smith.

Re: Autovacuum in the backend

From

Gavin Sherry

Date:

17 June 2005, 06:16:24

On Fri, 17 Jun 2005, Russell Smith wrote:

> > Added to TODO:
> >
> >  * Create a bitmap of pages that need vacuuming
> >
> >    Instead of sequentially scanning the entire table, have the background
> >    writer or some other process record pages that have expired rows, then
> >    VACUUM can look at just those pages rather than the entire table.  In
> >    the event of a system crash, the bitmap would probably be invalidated.
> >
> Further to this, is there any use case for allowing FSM, or this DSM to spill to disk
> if the space fills up.  It would allow the possibility of unusual changes to the db
> to not loose space.  You could just load part of the overflow from the disk back
> int the FSM in memory and continue using free space.

FSM splilling to disk would be a problem. The reason is that when we need
to allocate an empty page, we hit the FSM first. If that operation becomes
disk bound, large updates and inserts are going to really suck from a
performance point of view.

The idea I discussed is disk backed, because its the first few pages of
every heap segment. This map doesn't mean that pages are free. It means
they've been modified.

Gavin

Re: Autovacuum in the backend

From

Gavin Sherry

Date:

17 June 2005, 06:18:40

On Fri, 17 Jun 2005, Russell Smith wrote:

> > >4) Related to this, I guess, is that a user's FSM settings might be
> > >completely inappropriate. The 'Just read the manual' or 'Just read the
> > >logs' argument doesn't cut it, because the main argument for autovacuum in
> > >the backend is that people do not and will not.
> > >
> > >
> >
> > Agreed, it doesn't solve all problems, and I'm not arguing that the
> > integration of AV makes PostgreSQL newbie safe it just helps reduce the
> > newbie problem.   Again if the default FSM settings are inappropriate
> > for a database then the user is probably doing something more
> > complicated that a "my cat minka" database and will need to learn some
> > tuning skills anyway.
> >
> > >5) It doesn't actually shrink tables -- ie, there's no VACUUM FULL. If
> > >we're telling users about VACUUM less often than we are now, there's bound
> > >to be bloating issues (see 4).
> > >
> > >
> >
> But what's stopping the implementation of a Partial VACUUM FULL, where we lock the table,
> move enough blocks to shorten the relation so that there is say < 10% bloat, or whatever is
> appropriate for that table.  Or even just short the table a few block, and repeat the process
> when you have some time too.

Its a question of where you start off from again. You cannot just say
'I've vacuumed the first 100 pages' because it could well have changed
underneath you.

Gavin

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

17 June 2005, 08:59:10

Joshua D. Drake wrote:

> Josh Berkus wrote:
>
>> I've personally seen at least a dozen user requests for "autovacuum 
>> in the backend", and had this conversation about 1,100 times:
>>
>> NB: "After a week, my database got really slow."
>> Me: "How often are you running VACUUM ANALYZE?"
>> NB: "Running what?"
>
>
> Can't argue that except... RTFM ;). I am not saying it doesn't have a 
> validity. I am just saying that if you actually pay attention to 
> PostgreSQL and maintain it, you don't need it ;) 

I think everyone on this list would agree with you.  The only reason I 
think the newbie protection is important (and I don't think it's the 
most important reason for autovacuum) is that perception is reality to 
some extent.  Valid or not we still suffer from a reputation of being 
more complicated and slower than mysql.  Steps towards reducing / 
eliminating that perception can only be good for us as I think lots of 
developers make their first database decision based solely on their 
perceptions and then just stick with what they know.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

17 June 2005, 09:04:49

Joshua D. Drake wrote:

> Matthew T. O'Connor wrote:
>
>> The major reasons for autovacuum as I see it are as follows:
>>
>> * Reduces administrative overhead having to keep track of what tables 
>> need to be vacuumed how often.
>
>
> Creates more overhead and thus reduces performance.

In the general case, I disagree.  Overall having your tables vacuumed 
and analyzed only when needed and never when not needed can only reduce 
system overhead.  Granted there are limitations in the contrib version 
of autovacuum, some of which go away in the integrated case.

>> * Reduces the total amount of time the system spends vacuuming since 
>> it only vacuums when needed.
>
> Can be easily done with cron.

Really?  What happens when your load / usage patterns change?  When a 
table is added that gets heavily used?

>> * Keeps stats up-to-date automatically
>
>
> Which can be done with cron

Same response as above.

>> * Eliminates newbie confusion
>
>
> RTFM

;-)

>> * Eliminates one of the criticisms that the public has against 
>> PostgreSQL (justifed or not)
>
>
> Agreed.
>
>
> Just so everyone knows from the get go here. I am purposely playing a 
> little devils advocate. Autovacuum has some drawbacks. I think we should
> be **publicly** aware of them before we pursue integration. 

Understood.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

17 June 2005, 09:12:02

Russell Smith wrote:

>>>* Reduces the total amount of time the system spends vacuuming since it 
>>>only vacuums when needed.
>>>      
>>>
>>Can be easily done with cron.
>>    
>>
>Can you do partial table vacuums with CRON?
>You can work out the smartest time to vacuum with cron? I thought it just scheduled tasks at certain times.
>  
>

To be fair, autovacuum can't do partial table vacuums either, in fact 
nothing can right now.  Perhaps someday something like this will be 
feasible.

>>>* Eliminates one of the criticisms that the public has against 
>>>PostgreSQL (justifed or not)
>>>      
>>>
>>Agreed.
>>    
>>
>This is really the same as the previous RTFM question/response.  People criticise because vacuum is foreign to them,
>and for newbie's that equals too hard, next db please.  As much as it is a technical issue, it's an advocacy issue
too.
>  
>

This bullet point is absolutely an advocacy issue.  Every developer that 
says "next db please" will probably not come back to PostgreSQL for 
quite some time, thus bolstering the userbase of the competition and 
reducing the userbase of PostgreSQL.

>Plus we finally get XID wraparound protection.  We finally decided that for 8.1 we needed some protection, which I
think
>Tom committed.  This again may be a newbie thing.  But there are a lot of newbies out there then.   We've see on the
lists
>and on IRC this problem pop up a number of times.  And people say "Why didn't it tell me", RTFM it's exactly what they
want
>to hear, or the fact they thought they read the manual, and missed understanding that bit.
>  
>

I think this point hasn't been stressed enough.  With  nested 
transactions these days (not to mention faster hardware) I can see XID 
wraparound becoming a much bigger issue.

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

17 June 2005, 09:21:38

Russell Smith wrote:

>On Fri, 17 Jun 2005 06:26 pm, Andreas Pflug wrote:
>  
>
>>Qingqing Zhou wrote:
>>    
>>
>>>One reason of not using lib-pq is that this one has to wait for the
>>>completion of each vacuum (we don't has async execution in libpq right?),
>>>      
>>>
>>There *is* async execution in libpq, and it works.
>>    
>>
>
>I would have thought the main reasons for not using libpq means you are locked
>into only using commands that are available to all users via SQL.  If you don't use
>libpq, you open up the ability to use functions that can make use of information available
>to the backend, and to also run functions in a way that it is not possible to do via SQL.
>

Right now we don't really want to fire off more than one VACUUM at a 
time since it will create a more substantial IO storm issue than we 
already have with vacuum.  Perhaps with the introduction of vacuum delay 
settings and table spaces we could / should rethink this, but for now 
it's the easiest way to go.

As for the standard SQL issue, FSM data (or anything else we might want) 
could be exported via regular SQL via some type of super-user only 
system function.  So that isn't really the issue.  I don't remember all 
the details but you can look at the discussion when my patch was 
rejected (around July of 2004).  People just didn't like including libpq 
into the backend for reasons I don't remember.  I don't *think* this is 
up for discussion, I *think* autovacuum has to work without libpq if it 
is going to be accepted.

Matthew

Re: Autovacuum in the backend

From

Christopher Browne

Date:

17 June 2005, 10:47:51

swm@linuxworld.com.au (Gavin Sherry) wrote:
> I guess the main point is, if something major like this ships in the
> backend it says to users that the problem has gone away. pg_autovacuum is
> a good contrib style solution: it addresses a problem users have and
> attempts to solve it the way other users might try and solve it. When you
> consider it in the backend, it looks like a workaround. I think users are
> better served by solving the real problem.

Hear, hear!

It seems to me that the point in time at which it is *really*
appropriate to put this into the backend is when the new GUC variable
"dead_tuple_map_size" (akin to FSM) is introduced, and there is a new
sort of 'VACUUM DEAD TUPLES' command which goes through the DTPM (Dead
Tuple Page Map).

In THAT case, there would be the ability to do a VACUUM on the "dead
bits" of the table that consists of 50M rows without having to go
through the 49M rows that haven't been touched in months.
-- 
"cbbrowne","@","gmail.com"
http://linuxfinances.info/info/languages.html
"I can't escape the sensation  that  I have  already been thinking  in
Lisp all   my programming  career,  but forcing    the ideas into  the
constraints of  bad  languages,  which   explode those  ideas  into  a
bewildering array  of details, most of  which are workarounds  for the
language." -- Kaz Kylheku

Re: Autovacuum in the backend

From

Tom Lane

Date:

17 June 2005, 10:55:02

"Matthew T. O'Connor" <matthew@zeut.net> writes:
> ...  People just didn't like including libpq 
> into the backend for reasons I don't remember.

One reason I can think of is that there would be global-symbol conflicts
--- libpq has copies of some backend routines, but they are not
identical.

In any case, the argument that's being made here is that an integrated
autovac would be simple and newbie-friendly.  I think it's impossible
for a libpq-using autovac to ever be invisible to the DBA, if only
because he has to configure it with an appropriate username/password,
make sure pg_hba.conf will actually let it into every database, etc.
There are way too many foot-guns in that scenario for my taste.
        regards, tom lane

Re: Autovacuum in the backend

From

Rod Taylor

Date:

17 June 2005, 11:37:58

On Fri, 2005-06-17 at 00:03 -0700, Joshua D. Drake wrote:
> Matthew T. O'Connor wrote:
> > Joshua D. Drake wrote:
> > 
> >> Just my own two cents. First I am not knocking the work that has been 
> >> on autovacuum. I am sure that it was a leap on its own to get it to 
> >> work. However I will say that I just don't see the reason for it.
> > 
> > 
> > 
> > The major reasons for autovacuum as I see it are as follows:
> > 
> > * Reduces administrative overhead having to keep track of what tables 
> > need to be vacuumed how often.
> 
> Creates more overhead and thus reduces performance.

Agreed, except I have a number of tables that are over scheduled with
cron because about once a week (at a random time) they do go through
heavy enough churn to require the more frequent vacuum, but the rest of
the time they're fairly quiet. It's not ideal, but autovacuum was the
difference between a 10 minute vacuum 24/7, or a 10 minute vacuum on the
table for the heavy 2 or so hours which randomly appear.

In the case of unexpected or uneven loads, cron isn't particularly
useful.

--

Re: Autovacuum in the backend

From

Josh Berkus

Date:

17 June 2005, 13:18:45

Josh,

> Just so everyone knows from the get go here. I am purposely playing a
> little devils advocate. 

Well, please stop it.  We discussed AV over a year ago when we ran out of time 
to integrate it with 8.0.   This disucussion now is hindering any discussion 
of what needs to be *done* to integrate it.    This isn't a debating society.

Folks, I'm sorry to be so grumpy about this, but so far 80% of the posts on 
this thread have been re-arguing a discussion we had in 2004.  Which isn't 
helping Alvaro get anything done.

-- 
Josh Berkus
Aglio Database Solutions
San Francisco

Re: Autovacuum in the backend

From

"Matthew T. O'Connor"

Date:

17 June 2005, 13:22:05

Christopher Browne wrote:

>swm@linuxworld.com.au (Gavin Sherry) wrote:
>  
>
>>I guess the main point is, if something major like this ships in the
>>backend it says to users that the problem has gone away. pg_autovacuum is
>>a good contrib style solution: it addresses a problem users have and
>>attempts to solve it the way other users might try and solve it. When you
>>consider it in the backend, it looks like a workaround. I think users are
>>better served by solving the real problem.
>>    
>>
>
>Hear, hear!
>
>It seems to me that the point in time at which it is *really*
>appropriate to put this into the backend is when the new GUC variable
>"dead_tuple_map_size" (akin to FSM) is introduced, and there is a new
>sort of 'VACUUM DEAD TUPLES' command which goes through the DTPM (Dead
>Tuple Page Map).
>
>In THAT case, there would be the ability to do a VACUUM on the "dead
>bits" of the table that consists of 50M rows without having to go
>through the 49M rows that haven't been touched in months.
>  
>

This will make VACUUM less painful, but it doesn't eliminate the need / 
desire for autovacuum.  I agree this would be good, but I see it as a 
separate issue.

Re: Autovacuum in the backend

From

"Jim C. Nasby"

Date:

17 June 2005, 21:04:45

On Fri, Jun 17, 2005 at 12:21:44PM -0400, Matthew T. O'Connor wrote:
<snip more stuff about how we need to track pages with dead tuples>
> 
> This will make VACUUM less painful, but it doesn't eliminate the need / 
> desire for autovacuum.  I agree this would be good, but I see it as a 
> separate issue.

Not only is it a seperate issue, but there's also no way it could
possibly be done for 8.1, whereas autovacuum most likely will make it
into 8.1.  Additionally, there are noted improvements that come about by
putting autovacuum in the backend instead of leaving it in contrib. And
as others have mentioned numerous times, any improvements made to vacuum
will help out vacuum as well.  There simply isn't a downside to putting
it in the backend that anyone's brought up.

Autovacuum was originally scheduled for 8.0. There's been plans to put
it in the backend for close to 2 years now. There's no reason at all to
push it out any farther.
-- 
Jim C. Nasby, Database Consultant               decibel@decibel.org 
Give your computer some brain candy! www.distributed.net Team #1828

Windows: "Where do you want to go today?"
Linux: "Where do you want to go tomorrow?"
FreeBSD: "Are you guys coming, or what?"

Space reuse and autovacuum

From

Bruce Momjian

Date:

21 June 2005, 17:07:15

Gavin Sherry wrote:
> On Wed, 15 Jun 2005, Bruce Momjian wrote:
> 
> >
> > I am going to start working on it.  I am concerned it is a big job.
> >
> > I will post questions as I find them, and the one below is a good one.
> >
> 
> I'm wondering if effort is being misdirected here. I remember when Mark
> Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> significant performance loss -- I think on the order of 30% to 40% (I will
> try and dig up a link to the results).
> 
> I think these results can be dramatically improved if the focus is on a
> more effective vacuum.

Let's look at what TODO has for vacuum and how autovacuum fits that:Vacuum======* Improve speed with indexes  For large
tableadjustements during vacuum, it is faster to reindex  rather than update the index.

This is something we should figure out how to do automatically.* Reduce lock time by moving tuples with read lock, then
write lock and truncate table  Moved tuples are invisible to other backends so they don't require a  write lock.
However,the read lock promotion to write lock could lead  to deadlock situations.

The deadlock problem here seems bad.* -Add a warning when the free space map is too small

Done.
* Maintain a map of recently-expired rows  This allows vacuum to target specific pages for possible free space  without
requiringa sequential scan.

I think of this as a secondary Free-space-map (FSM), where instead of
recording rows/pages that have free space, we records rows/pages that
have expired rows that might be free for reuse if all transactions where
the are visible are completed.* Auto-fill the free space map by scanning the buffer cache or by  checking pages written
bythe background writer

This could be used to populate the secondary FSM above.
* Create a bitmap of pages that need vacuuming  Instead of sequentially scanning the entire table, have the background
writeror some other process record pages that have expired rows, then  VACUUM can look at just those pages rather than
theentire table.  In  the event of a system crash, the bitmap would probably be invalidated.

This is an alternative to the FSM that tracks _all_ possible free space
rather than just a limited amount like a seconary FSM in shared memory.
* Auto-vacuum        o Move into the backend code        o Use free-space map information to guide refilling        o
DoVACUUM FULL if table is nearly empty?

It seems no matter what TODO items we complete above, we will need some
type of automatic vacuum to direct filling the free space map.  It might
be done using a different method than a sequential scan vacuum, but it
will be needed, so we are good to integrate autovacuum then improve how
it does its job in future releases.

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
359-1001+  If your life is a hard drive,     |  13 Roberts Road +  Christ can be your backup.        |  Newtown Square,
Pennsylvania19073