Home > mailing lists

Re: Wait free LW_SHARED acquisition - v0.9 - Mailing list pgsql-hackers

From	Merlin Moncure
Subject	Re: Wait free LW_SHARED acquisition - v0.9
Date	October 14, 2014 13:41:00
Msg-id	CAHyXU0xJxcOo-Erw8z-gAPDUbxf_1giO9UQAnvB4fkk4FzSZgg@mail.gmail.com Whole thread
In response to	Re: Wait free LW_SHARED acquisition - v0.9 (Andres Freund <andres@2ndquadrant.com>)
Responses	Re: Wait free LW_SHARED acquisition - v0.9
List	pgsql-hackers

Tree view

On Fri, Oct 10, 2014 at 11:00 AM, Andres Freund <andres@2ndquadrant.com> wrote:
> On 2014-10-10 16:41:39 +0200, Andres Freund wrote:
>> FWIW, the profile always looks like
>> -  48.61%      postgres  postgres              [.] s_lock
>>    - s_lock
>>       + 96.67% StrategyGetBuffer
>>       + 1.19% UnpinBuffer
>>       + 0.90% PinBuffer
>>       + 0.70% hash_search_with_hash_value
>> +   3.11%      postgres  postgres              [.] GetSnapshotData
>> +   2.47%      postgres  postgres              [.] StrategyGetBuffer
>> +   1.93%      postgres  [kernel.kallsyms]     [k] copy_user_generic_string
>> +   1.28%      postgres  postgres              [.] hash_search_with_hash_value
>> -   1.27%      postgres  postgres              [.] LWLockAttemptLock
>>    - LWLockAttemptLock
>>       - 97.78% LWLockAcquire
>>          + 38.76% ReadBuffer_common
>>          + 28.62% _bt_getbuf
>>          + 8.59% _bt_relandgetbuf
>>          + 6.25% GetSnapshotData
>>          + 5.93% VirtualXactLockTableInsert
>>          + 3.95% VirtualXactLockTableCleanup
>>          + 2.35% index_fetch_heap
>>          + 1.66% StartBufferIO
>>          + 1.56% LockReleaseAll
>>          + 1.55% _bt_next
>>          + 0.78% LockAcquireExtended
>>       + 1.47% _bt_next
>>       + 0.75% _bt_relandgetbuf
>>
>> to me. Now that's with the client count 496, but it's similar with lower
>> counts.
>>
>> BTW, that profile *clearly* indicates we should make StrategyGetBuffer()
>> smarter.
>
> Which is nearly trivial now that atomics are in. Check out the attached
> WIP patch which eliminates the spinlock from StrategyGetBuffer() unless
> there's buffers on the freelist.

Is this safe?

+ victim = pg_atomic_fetch_add_u32(&StrategyControl->nextVictimBuffer, 1);

- if (++StrategyControl->nextVictimBuffer >= NBuffers)
+ buf = &BufferDescriptors[victim % NBuffers];
+
+ if (victim % NBuffers == 0)
<snip>

I don't think there's any guarantee you could keep nextVictimBuffer
from wandering off the end.  You could buy a little safety by CAS'ing
zero in instead, followed by atomic increment.  However that's still
pretty dodgy IMO and requires two atomic ops which will underperform
the spin in some situations.

I like Robert's idea to keep the spinlock but preallocate a small
amount of buffers, say 8 or 16.

merlin

pgsql-hackers by date:

From: Sawada Masahiko
Date: 14 October 2014, 13:32:38
Subject: Drop any statistics of table after it's truncated

From: Andres Freund
Date: 14 October 2014, 13:58:54
Subject: Re: Wait free LW_SHARED acquisition - v0.9

Re: Wait free LW_SHARED acquisition - v0.9 - Mailing list pgsql-hackers

Previous

Next