Thread: visibilitymap_set and checksums
Hi, while thinking about vacuum freeze I noticed that since the checksums patch visibilitymap_set() does:/* * If data checksums are enabled, we need to protect the heap * page from being torn. */if(DataChecksumsEnabled()){ Page heapPage = BufferGetPage(heapBuf); /* caller is expected to set PD_ALL_VISIBLE first */ Assert(PageIsAllVisible(heapPage)); PageSetLSN(heapPage, recptr);} That pattern looks dangerous. Setting the lsn of the heap page will prevent the next action from doing a FPI even if it would be required. Its e.g. called like this from lazy_scan_heap: if (all_visible && !all_visible_according_to_vm){ /* * It should never be the case that the visibility map page isset * while the page-level bit is clear, but the reverse is allowed * (if checksums are not enabled). Regardless,set the both bits * so that we get back in sync. * * NB: If the heap page is all-visible but the VMbit is not set, * we don't need to dirty the heap page. However, if checksums are * enabled, we do need to makesure that the heap page is dirtied * before passing it to visibilitymap_set(), because it may be * logged. Giventhat this situation should only happen in rare * cases after a crash, it is not worth optimizing. */ PageSetAllVisible(page); MarkBufferDirty(buf); visibilitymap_set(onerel, blkno, buf, InvalidXLogRecPtr, vmbuffer, visibility_cutoff_xid);} other callers look similarly dangerous. Greetings, Andres Freund -- Andres Freund http://www.2ndQuadrant.com/PostgreSQL Development, 24x7 Support, Training & Services
On 24 May 2013 18:40, Andres Freund <andres@2ndquadrant.com> wrote: > That pattern looks dangerous. Setting the lsn of the heap page will > prevent the next action from doing a FPI even if it would be required. Can you be more specific about the danger you see? --Simon Riggs http://www.2ndQuadrant.com/PostgreSQL Development, 24x7 Support, Training & Services
On 2013-05-24 19:09:57 +0100, Simon Riggs wrote: > On 24 May 2013 18:40, Andres Freund <andres@2ndquadrant.com> wrote: > > > That pattern looks dangerous. Setting the lsn of the heap page will > > prevent the next action from doing a FPI even if it would be required. > > Can you be more specific about the danger you see? CHECKPOINT at lsn 0/10; vacuum starts vacuum finds page which is all visible vacuum sets all_visible PageSetAllVisible(page); MarkBufferDirty(buf); visibilitymap_set(onerel, blkno, buf, InvalidXLogRecPtr, vmbuffer, visibility_cutoff_xid); recptr = log_heap_visible(rel->rd_node, heapBuf,vmBuf, cutoff_xid); if (DataChecksumsEnabled()) PageSetLSN(heapPage,recptr); So at this point the *heap* page will have the lsn of the xl_heap_visible record. Which I thought to be rather dangerous because I somewow missed the fact that log_heap_visible does:if (DataChecksumsEnabled()){ rdata[1].next = &(rdata[2]); rdata[2].data = NULL; rdata[2].len = 0; rdata[2].buffer = heap_buffer; rdata[2].buffer_std = true; rdata[2].next= NULL;} So. Forget what I said, I just was confused. Greetings, Andres Freund -- Andres Freund http://www.2ndQuadrant.com/PostgreSQL Development, 24x7 Support, Training & Services
On 24 May 2013 20:26, Andres Freund <andres@2ndquadrant.com> wrote: > On 2013-05-24 19:09:57 +0100, Simon Riggs wrote: >> On 24 May 2013 18:40, Andres Freund <andres@2ndquadrant.com> wrote: >> >> > That pattern looks dangerous. Setting the lsn of the heap page will >> > prevent the next action from doing a FPI even if it would be required. >> >> Can you be more specific about the danger you see? > > CHECKPOINT at lsn 0/10; > vacuum starts > vacuum finds page which is all visible > vacuum sets all_visible > PageSetAllVisible(page); > MarkBufferDirty(buf); > visibilitymap_set(onerel, blkno, buf, InvalidXLogRecPtr, > vmbuffer, visibility_cutoff_xid); > recptr = log_heap_visible(rel->rd_node, heapBuf, vmBuf, > cutoff_xid); > if (DataChecksumsEnabled()) > PageSetLSN(heapPage, recptr); > > So at this point the *heap* page will have the lsn of the > xl_heap_visible record. Which I thought to be rather dangerous because I > somewow missed the fact that log_heap_visible does: > if (DataChecksumsEnabled()) > { > rdata[1].next = &(rdata[2]); > > rdata[2].data = NULL; > rdata[2].len = 0; > rdata[2].buffer = heap_buffer; > rdata[2].buffer_std = true; > rdata[2].next = NULL; > } > > So. Forget what I said, I just was confused. I think its perfectly understandable. Robert, Jeff and I discussed that for a while before we passed it. I'm still not happy with it, and think its a pretty confusing section of code with multiple paths through it, but I just can't see a better way. --Simon Riggs http://www.2ndQuadrant.com/PostgreSQL Development, 24x7 Support, Training & Services
On Fri, 2013-05-24 at 22:16 +0100, Simon Riggs wrote: > I think its perfectly understandable. Robert, Jeff and I discussed > that for a while before we passed it. I'm still not happy with it, and > think its a pretty confusing section of code with multiple paths > through it, but I just can't see a better way. Agreed on all counts. Comment patches are welcome, of course. I'll add that if we remove PD_ALL_VISIBLE, this complexity disappears. Regards,Jeff Davis