Re: RC2 and open issues - Mailing list pgsql-hackers

From Gavin Sherry
Subject Re: RC2 and open issues
Date
Msg-id Pine.LNX.4.58.0412211604100.24276@linuxworld.com.au
Whole thread Raw
In response to Re: RC2 and open issues  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: RC2 and open issues
Re: RC2 and open issues
List pgsql-hackers
On Mon, 20 Dec 2004, Tom Lane wrote:

> Bruce Momjian <pgman@candle.pha.pa.us> writes:
> > Tom Lane wrote:
> >> Exactly.  But 1% would be uselessly small with this definition.  Offhand
> >> I'd think something like 50% might be a starting point; maybe even more.
> >> What that says is that a page isn't a candidate to be written out by the
> >> bgwriter until it's fallen halfway down the LRU list.
>
> > So we are not scanning by buffer address but using the LRU list?  Are we
> > sure they are mostly dirty?
>
> No.  The entire point is to keep the LRU end of the list mostly clean.
>
> Now that you mention it, it might be interesting to try the approach of
> doing a clock scan on the buffer array and ignoring the ARC lists
> entirely.  That would be a fundamentally different way of envisioning
> what the bgwriter is supposed to do, though.  I think the main reason
> Jan didn't try that was he wanted to be sure the LRU page was usually
> clean so that backends would seldom end up doing writes for themselves
> when they needed to get a free buffer.

Neil and I spoke with Jan briefly last week and he mentioned a few
different approaches he'd been tossing over. Firstly, for alternative
runs, start X% on from the LRU, so that we aren't scanning clean buffers
all the time. Secondly, follow something like the approach you've
mentioned above but remember the offset. So, if we're scanning 10%, after
10 runs we will have written out all buffers.

I was also thinking of benchmarking the effect of changing the algorithm
in StrategyDirtyBufferList(): currently, for each iteration of the loop we
read a buffer from each of T1 and T2. I was wondering what effect reading
T1 first then T2 and vice versa would have on performance. I haven't
thought about this too hard, though, so it might be wrong headed.


>
> Maybe we need a hybrid approach: clean a few percent of the LRU end of
> the ARC list in order to keep backends from blocking on writes, plus run
> a clock scan to keep checkpoints from having to do much.  But that's way
> beyond what we have time for in the 8.0 cycle.

Definately.

>
>             regards, tom lane


Thanks,

Gavin


pgsql-hackers by date:

Previous
From: Bruce Momjian
Date:
Subject: Re: RC2 and open issues
Next
From: Bruce Momjian
Date:
Subject: Re: RC2 and open issues