Re: CLOG extension - Mailing list pgsql-hackers

From Simon Riggs
Subject Re: CLOG extension
Date
Msg-id CA+U5nMKmqnkLD_jJ2YwJ78Q_tsGz6upCLSom_8f54PM4RBWkUA@mail.gmail.com
Whole thread Raw
In response to Re: CLOG extension  (Robert Haas <robertmhaas@gmail.com>)
Responses Re: CLOG extension
List pgsql-hackers
On 4 May 2012 13:59, Robert Haas <robertmhaas@gmail.com> wrote:
> On Fri, May 4, 2012 at 3:35 AM, Simon Riggs <simon@2ndquadrant.com> wrote:
>> On Thu, May 3, 2012 at 9:56 PM, Robert Haas <robertmhaas@gmail.com> wrote:
>>> On Thu, May 3, 2012 at 3:20 PM, Simon Riggs <simon@2ndquadrant.com> wrote:
>>>> Your two paragraphs have roughly opposite arguments...
>>>>
>>>> Doing it every 32 pages would give you 30 seconds to complete the
>>>> fsync, if you kicked it off when half way through the previous file -
>>>> at current maximum rates. So there is utility in doing it in larger
>>>> chunks.
>>>
>>> Maybe, but I'd like to try changing one thing at a time.  If we change
>>> too much at once, it's likely to be hard to figure out where the
>>> improvement is coming from.  Moving the task to a background process
>>> is one improvement; doing it in larger chunks is another.  Those
>>> deserve independent testing.
>>
>> You gave a good argument why background pre-allocation wouldn't work
>> very well if we do it a page at a time. I believe you.
>
> Your confidence is sort of gratifying, but in this case I believe it's
> misplaced.  On more careful analysis, it seems that ExtendCLOG() does
> just two things: (1) evict a CLOG buffer and replace it with a zero'd
> page representing the new page and (2)  write an XLOG record for the
> change.  Apparently, "extending" CLOG doesn't actually involve
> extending anything on disk at all.  We rely on the future buffer
> eviction to do that, which is surprisingly different from the way
> relation extension is handled.
>
> So CLOG extension is normally fast, but occasionally something goes
> wrong.

I don't agree its normally fast.

WALInsert contention is high, so there is usually a long queue. As
we've discussed this can be done offline and and so (2) can completely
avoided in the main line. Considering that all new xids wait for this
action, any wait at all is bad and takes time to drain once it clears.

Evicting a clog has cost because the tail is almost always dirty when
we switch pages.

Doing both of those will ensure switch to new page requires zero wait time.

So you have the solution. Not sure what else you're looking for.

--
 Simon Riggs                   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training & Services


pgsql-hackers by date:

Previous
From: Robert Haas
Date:
Subject: Re: Future In-Core Replication
Next
From: Simon Riggs
Date:
Subject: Re: Future In-Core Replication