Re: nbtree: assertion failure in _bt_killitems() for posting tuple - Mailing list pgsql-hackers

From Peter Geoghegan
Subject Re: nbtree: assertion failure in _bt_killitems() for posting tuple
Date
Msg-id CAH2-Wz=BkEYkfYd2RuECmjgVnR8KVouEhfGV-30s7PGUtDU6mQ@mail.gmail.com
Whole thread Raw
In response to Re: nbtree: assertion failure in _bt_killitems() for posting tuple  (Anastasia Lubennikova <a.lubennikova@postgrespro.ru>)
Responses Re: nbtree: assertion failure in _bt_killitems() for posting tuple  (Peter Geoghegan <pg@bowt.ie>)
List pgsql-hackers
On Tue, Mar 24, 2020 at 1:00 AM Anastasia Lubennikova
<a.lubennikova@postgrespro.ru> wrote:
> > I think you're right. However, it still seems like we should check
> > that "kitem->indexOffset" is consistent among all of the BTScanPosItem
> > entries that we have for each TID that we believe to be from the same
> > posting list tuple.

The assertion failure happens in the logical replication worker
because it uses a dirty snapshot, which cannot release the pin per
commit 2ed5b87f. This means that the leaf page can change between the
time that we observe an item is dead, and the time we reach
_bt_killitems(), even though _bt_killitems() does get to kill items.

I am thinking about pushing a fix along the lines of the attached
patch. This preserves the assertion, while avoiding the check in cases
where it doesn't apply, such as when a dirty snapshot is in use.

-- 
Peter Geoghegan

Attachment

pgsql-hackers by date:

Previous
From: Andres Freund
Date:
Subject: Re: CLOBBER_CACHE_ALWAYS regression instability
Next
From: Kyotaro Horiguchi
Date:
Subject: Re: [HACKERS] WAL logging problem in 9.4.3?