Missing extension locks in the nbtree code - Mailing list pgsql-hackers

From Andres Freund
Subject Missing extension locks in the nbtree code
Date
Msg-id 20150706212112.GC30359@alap3.anarazel.de
Whole thread Raw
Responses Re: Missing extension locks in the nbtree code  (Andres Freund <andres@anarazel.de>)
List pgsql-hackers
Hi,

There've recently been more and more reports of "unexpected data beyond
EOF in block %u of relation %s" for me to think that it's likely to be
caused by a kernel bug. It's now been reproduced at least on somewhat
recent linux and freebsd versions.

So I started looking around for causes. Not for the first time.

One, probably harmless thing is that _bt_getroot() creates the initial
root page without an extension lock. That's not pretty, but it should
happen on the first write and be safe due to the content lock on the
metapage.  ISTM we should still not do that, but it's probably not the
explanation.

The fix is just to change    if (fd == -1 || XLByteInSeg(change->lsn, curOpenSegNo))
into    if (fd == -1 || !XLByteInSeg(change->lsn, curOpenSegNo))

the bug doesn't have any correctness implications afaics, just
performance. We read all the spilled files till the end, so even change
spilled to the wrong segment gets picked up.

Greetings,

Andres Freund



pgsql-hackers by date:

Previous
From: Daniele Varrazzo
Date:
Subject: Redundant error messages in policy.c
Next
From: Andres Freund
Date:
Subject: Re: Missing extension locks in the nbtree code