Re: Incorrect result of bitmap heap scan. - Mailing list pgsql-hackers

From Andres Freund
Subject Re: Incorrect result of bitmap heap scan.
Date
Msg-id 7fpu7ykfmqhkbak44ul4mruavivuvekvzn5nqhqt2hjtkhimba@i5qeftkbap72
Whole thread Raw
In response to Re: Incorrect result of bitmap heap scan.  (Matthias van de Meent <boekewurm+postgres@gmail.com>)
Responses Re: Incorrect result of bitmap heap scan.
Re: Incorrect result of bitmap heap scan.
List pgsql-hackers
Hi,

On 2025-04-02 18:58:11 +0200, Matthias van de Meent wrote:
> And here it is, v6 of the patchset, rebased up to master @ 764d501d.

Thanks!

Does anybody have an opinion about how non-invasive to be in the
back-branches? The minimal version is something like this diff:

diff --git i/src/backend/executor/nodeBitmapHeapscan.c w/src/backend/executor/nodeBitmapHeapscan.c
index 6b48a6d8350..0bd8b3404c1 100644
--- i/src/backend/executor/nodeBitmapHeapscan.c
+++ w/src/backend/executor/nodeBitmapHeapscan.c
@@ -187,6 +187,19 @@ BitmapHeapNext(BitmapHeapScanState *node)
         {
             bool        need_tuples = false;

+            /*
+             * Unfortunately it turns out that the below optimization does not
+             * take the removal of TIDs by a concurrent vacuum into
+             * account. The concurrent vacuum can remove dead TIDs and make
+             * pages ALL_VISIBLE while those dead TIDs are referenced in the
+             * bitmap. This would lead to a !need_tuples scan returning too
+             * many tuples.
+             *
+             * In the back-branches, we therefore simply disable the
+             * optimization. Removing all the relevant code would be too
+             * invasive (and a major backpatching pain).
+             */
+#ifdef NOT_ANYMORE
             /*
              * We can potentially skip fetching heap pages if we do not need
              * any columns of the table, either for checking non-indexable
@@ -197,6 +210,7 @@ BitmapHeapNext(BitmapHeapScanState *node)
              */
             need_tuples = (node->ss.ps.plan->qual != NIL ||
                            node->ss.ps.plan->targetlist != NIL);
+#endif


But it seems a bit weird to continue checking SO_NEED_TUPLES (which is what
need_tuples ends up as) in other parts of the code. But it's less work to
backpatch that way.  Obviously we can't remove the relevant struct fields in
the backbranches.



Other notes:

- Should we commit the test showing that the naive implementation of
  index-only-bitmap-heapscan is broken, in case somebody wants to re-introduce
  it?

  If so, I think we should not backpatch the test. If it turns out to not be
  stable, it's a pain to fix test stability issues over multiple branches.

  And the patch would need a bit of comment editing to explain that, easy
  enough.


- We have some superfluous includes in nodeBitmapHeapscan.c - but I think
  that's not actually the fault of this patch. Seems the read-stream patch
  should have removed the at least the includes of visibilitymap.h, bufmgr.h
  and spccache.h?  And b09ff53667f math.h...


Greetings,

Andres Freund



pgsql-hackers by date:

Previous
From: Jeff Davis
Date:
Subject: Re: statistics import and export: another difference in dump/restore
Next
From: Tom Lane
Date:
Subject: Re: SQLFunctionCache and generic plans