Thread: Hot Standby tuning for btree_xlog_vacuum()

Hot Standby tuning for btree_xlog_vacuum()

From
Simon Riggs
Date:
Simple tuning of btree_xlog_vacuum() using an idea I had a while back,
just never implemented. XXX comments removed.

Allows us to avoid reading in blocks during VACUUM replay that are only
required for correctness of index scans.

Objections to commit?

--
 Simon Riggs           www.2ndQuadrant.com

Attachment

Re: Hot Standby tuning for btree_xlog_vacuum()

From
Tom Lane
Date:
Simon Riggs <simon@2ndQuadrant.com> writes:
> Objections to commit?

This is not the time to be hacking stuff like this.  You haven't even
demonstrated that there's a significant performance issue here.
        regards, tom lane


Re: Hot Standby tuning for btree_xlog_vacuum()

From
Jim Nasby
Date:
On Apr 29, 2010, at 3:20 PM, Tom Lane wrote:
> Simon Riggs <simon@2ndQuadrant.com> writes:
>> Objections to commit?
>
> This is not the time to be hacking stuff like this.  You haven't even
> demonstrated that there's a significant performance issue here.

I tend to agree that this point of the cycle isn't a good one to be making changes, but your performance statement
confusesme. If a fairly small patch means we can avoid un-necessary reads why shouldn't we avoid them? 
--
Jim C. Nasby, Database Architect                   jim@nasby.net
512.569.9461 (cell)                         http://jim.nasby.net




Re: Hot Standby tuning for btree_xlog_vacuum()

From
Tom Lane
Date:
Jim Nasby <decibel@decibel.org> writes:
> On Apr 29, 2010, at 3:20 PM, Tom Lane wrote:
>> This is not the time to be hacking stuff like this.  You haven't even
>> demonstrated that there's a significant performance issue here.

> I tend to agree that this point of the cycle isn't a good one to be making changes, but your performance statement
confusesme. If a fairly small patch means we can avoid un-necessary reads why shouldn't we avoid them?
 

Well, by "time of the cycle" I meant "the day before beta1".  I'm not
necessarily averse to making such a change at some point when it would
get more than no testing before hitting our long-suffering beta testers.
But I'd still want to see some evidence that there's a significant
performance improvement to be had.
        regards, tom lane


Re: Hot Standby tuning for btree_xlog_vacuum()

From
Simon Riggs
Date:
On Mon, 2010-05-17 at 16:10 -0400, Tom Lane wrote:
> Jim Nasby <decibel@decibel.org> writes:
> > On Apr 29, 2010, at 3:20 PM, Tom Lane wrote:
> >> This is not the time to be hacking stuff like this.  You haven't even
> >> demonstrated that there's a significant performance issue here.
> 
> > I tend to agree that this point of the cycle isn't a good one to be making changes, but your performance statement
confusesme. If a fairly small patch means we can avoid un-necessary reads why shouldn't we avoid them?
 
> 
> Well, by "time of the cycle" I meant "the day before beta1".  I'm not
> necessarily averse to making such a change at some point when it would
> get more than no testing before hitting our long-suffering beta testers.
> But I'd still want to see some evidence that there's a significant
> performance improvement to be had.

That patch only applies to one record type. However, since we've used
Greg's design of spidering out to each heap record that can usually mean
150-200 random I/Os per btree delete. That will take some time, perhaps
1s per WAL record of this type on a very large I/O bound table. That's
enough to give me cause for concern without performance measurements.

To derive such measurements we'd need to instrument each record type,
which we don't do right now either.

It might be easier to have a look at the patch and see if you think its
worth the fuss of measuring it. 

I don't think this is the patch that will correct the potential/
partially observed context switching issue, but we have yet to recreate
that in lab conditions.

-- Simon Riggs           www.2ndQuadrant.com



Re: Hot Standby tuning for btree_xlog_vacuum()

From
Robert Haas
Date:
On Thu, Apr 29, 2010 at 4:12 PM, Simon Riggs <simon@2ndquadrant.com> wrote:
> Simple tuning of btree_xlog_vacuum() using an idea I had a while back,
> just never implemented. XXX comments removed.
>
> Allows us to avoid reading in blocks during VACUUM replay that are only
> required for correctness of index scans.

Review:

1. The block comment in XLogConfirmBufferIsUnpinned appears to be
copied from somewhere else, and doesn't really seem appropriate for a
new function since it refers to "the original coding of this routine".I think you could just delete the parenthesized
portionof the
 
comment.

2. This bit from ConfirmBufferIsUnpinned looks odd to me.

+         /*
+          * Found it.  Now, pin/unpin the buffer to prove it's unpinned.
+          */
+         if (PinBuffer(buf, NULL))
+             UnpinBuffer(buf, false);

I don't think pinning and unpinning the buffer is sufficient to
provide that it isn't otherwise pinned.  If the buffer isn't in shared
buffers at all, it seems clear that no one can have it pinned.  But if
it's present in shared buffers, it seems like you still need
LockBufferForCleanup().

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise Postgres Company


Re: Hot Standby tuning for btree_xlog_vacuum()

From
Simon Riggs
Date:
Just wanted to say thanks for the review, since I haven't yet managed to
fix and commit this. I expect to later this month.

On Mon, 2010-09-27 at 23:06 -0400, Robert Haas wrote:
> On Thu, Apr 29, 2010 at 4:12 PM, Simon Riggs <simon@2ndquadrant.com> wrote:
> > Simple tuning of btree_xlog_vacuum() using an idea I had a while back,
> > just never implemented. XXX comments removed.
> >
> > Allows us to avoid reading in blocks during VACUUM replay that are only
> > required for correctness of index scans.
> 
> Review:
> 
> 1. The block comment in XLogConfirmBufferIsUnpinned appears to be
> copied from somewhere else, and doesn't really seem appropriate for a
> new function since it refers to "the original coding of this routine".
>  I think you could just delete the parenthesized portion of the
> comment.
> 
> 2. This bit from ConfirmBufferIsUnpinned looks odd to me.
> 
> +         /*
> +          * Found it.  Now, pin/unpin the buffer to prove it's unpinned.
> +          */
> +         if (PinBuffer(buf, NULL))
> +             UnpinBuffer(buf, false);
> 
> I don't think pinning and unpinning the buffer is sufficient to
> provide that it isn't otherwise pinned.  If the buffer isn't in shared
> buffers at all, it seems clear that no one can have it pinned.  But if
> it's present in shared buffers, it seems like you still need
> LockBufferForCleanup().

-- Simon Riggs           http://www.2ndQuadrant.com/books/PostgreSQL Development, 24x7 Support, Training and Services