| From: | Andres Freund <andres(at)anarazel(dot)de> |
|---|---|
| To: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> |
| Cc: | pgsql-hackers(at)postgresql(dot)org |
| Subject: | Re: Our "fallback" atomics implementation doesn't actually work |
| Date: | 2016-10-06 02:48:11 |
| Message-ID: | 20161006024811.fcll3nnfmdn4qtpy@alap3.anarazel.de |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
On 2016-10-05 15:02:09 -0400, Tom Lane wrote:
> Andres Freund <andres(at)anarazel(dot)de> writes:
> > Without yet having analyzed this deeply, could it actually be that the
> > reason is that sem_post/wait aren't proper memory barriers? On a glance
> > the symptoms look like values have been modified without proper locks...
>
> Hmm, possible ...
Hm. After a long battle of head vs. wall I think I see what the problem
is. For the fallback atomics implementation I somehow had assumed that
pg_atomic_write_u32() doesn't need to lock, as it's just an unlocked
write. But that's not true, because it has to cause
pg_atomic_compare_exchange_u32 to fail. The lack of this can cause a
"leftover" lockbit, when UnlockBufHdr() occurs concurrently an operation
using compare_exchange.
For me the problem often takes a lot longer to reproduce (once only
after 40min), could you run with the attached patch, and see whether
that fixes things for you?
Andres
| Attachment | Content-Type | Size |
|---|---|---|
| atomic-write-fallback.patch | text/x-patch | 1.4 KB |
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Tom Lane | 2016-10-06 03:04:50 | pgsql: Remove -Wl,-undefined,dynamic_lookup in macOS build. |
| Previous Message | Andres Freund | 2016-10-06 01:48:24 | Re: Hash tables in dynamic shared memory |