Home > mailing lists

Re: Our "fallback" atomics implementation doesn't actually work - Mailing list pgsql-hackers

From	Andres Freund
Subject	Re: Our "fallback" atomics implementation doesn't actually work
Date	October 6, 2016 02:48:21
Msg-id	20161006024811.fcll3nnfmdn4qtpy@alap3.anarazel.de Whole thread Raw
In response to	Re: Our "fallback" atomics implementation doesn't actually work (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Our "fallback" atomics implementation doesn't actually work
List	pgsql-hackers

Tree view

On 2016-10-05 15:02:09 -0400, Tom Lane wrote:
> Andres Freund <andres@anarazel.de> writes:
> > Without yet having analyzed this deeply, could it actually be that the
> > reason is that sem_post/wait aren't proper memory barriers?  On a glance
> > the symptoms look like values have been modified without proper locks...
>
> Hmm, possible ...

Hm. After a long battle of head vs. wall I think I see what the problem
is.  For the fallback atomics implementation I somehow had assumed that
pg_atomic_write_u32() doesn't need to lock, as it's just an unlocked
write.  But that's not true, because it has to cause
pg_atomic_compare_exchange_u32 to fail.  The lack of this can cause a
"leftover" lockbit, when UnlockBufHdr() occurs concurrently an operation
using compare_exchange.

For me the problem often takes a lot longer to reproduce (once only
after 40min), could you run with the attached patch, and see whether
that fixes things for you?

Andres

Attachment

atomic-write-fallback.patch

pgsql-hackers by date:

From: Andres Freund
Date: 06 October 2016, 01:48:35
Subject: Re: Hash tables in dynamic shared memory

From: Serge Rielau
Date: 06 October 2016, 03:44:25
Subject: Re: Fast AT ADD COLUMN with DEFAULTs

Re: Our "fallback" atomics implementation doesn't actually work - Mailing list pgsql-hackers

Attachment

Previous

Next