Thread: Amcheck verification of GiST and GIN

Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

30 May 2022, 09:40:06

Hello world!

Few years ago we had a thread with $subj [0]. A year ago Heikki put a lot of effort in improving GIN checks [1] while
huntinga GIN bug. 
And in view of some releases with a recommendation to reindex anything that fails or lacks amcheck verification, I
decidedthat I want to review the thread. 

PFA $subj incorporating all Heikki's improvements and restored GiST checks. Also I've added heapallindexed verification
forGiST. I'm sure that we must add it for GIN too. Yet I do not know how to implement it. Maybe just check that every
entrygenerated from heap present in entry tree? Or that every tids is present in the index? 

GiST verification does parent check despite taking only AccessShareLock. It's possible because when the key discrepancy
isfound we acquire parent tuple with lock coupling. I'm sure that this is correct to check keys this way. And I'm
almostsure it will not deadlock, because split is doing the same locking. 

What do you think?

Best regards, Andrey Borodin.

[0] https://www.postgresql.org/message-id/flat/CAF3eApa07-BajjG8%2BRYx-Dr_cq28ZA0GsZmUQrGu5b2ayRhB5A%40mail.gmail.com
[1]
https://www.postgresql.org/message-id/flat/9fdbb584-1e10-6a55-ecc2-9ba8b5dca1cf%40iki.fi#fec2751faf1ca52495b0a61acc0f5532

Attachment

v10-0001-Amcheck-for-GIN-and-GiST.patch

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

22 June 2022, 17:40:56

> On 30 May 2022, at 12:40, Andrey Borodin <x4mmm@yandex-team.ru> wrote:
>
> What do you think?

Hi Andrey!

Here's a version with better tests. I've made sure that GiST tests actually trigger page reuse after deletion. And
enhancedcomments in both GiST and GIN test scripts. I hope you'll like it. 

GIN heapallindexed is still a no-op check. Looking forward to hear any ideas on what it could be.

Best regards, Andrey Borodin.

Attachment

v11-0001-Amcheck-for-GIN-and-GiST.patch

Re: Amcheck verification of GiST and GIN

From

Nikolay Samokhvalov

Date:

22 June 2022, 19:27:25

On Wed, Jun 22, 2022 at 11:35 AM Andrey Borodin <x4mmm@yandex-team.ru> wrote:

> On 30 May 2022, at 12:40, Andrey Borodin <x4mmm@yandex-team.ru> wrote:
>
> What do you think?

Hi Andrey!

Hi Andrey!

Since you're talking to yourself, just wanted to support you – this is an important thing, definitely should be very useful for many projects; I hope to find time to test it in the next few days.

Thanks for working on it.

Re: Amcheck verification of GiST and GIN

From

Andres Freund

Date:

22 June 2022, 23:29:12

Hi,

I think having amcheck for more indexes is great.

On 2022-06-22 20:40:56 +0300, Andrey Borodin wrote:
>> diff --git a/contrib/amcheck/amcheck.c b/contrib/amcheck/amcheck.c
> new file mode 100644
> index 0000000000..7a222719dd
> --- /dev/null
> +++ b/contrib/amcheck/amcheck.c
> @@ -0,0 +1,187 @@
> +/*-------------------------------------------------------------------------
> + *
> + * amcheck.c
> + *        Utility functions common to all access methods.

This'd likely be easier to read if the reorganization were split into its own
commit.

I'd also split gin / gist support. It's a large enough patch that that imo
makes reviewing easier.


> +void amcheck_lock_relation_and_check(Oid indrelid, IndexCheckableCallback checkable,
> +                                                IndexDoCheckCallback check, LOCKMODE lockmode, void *state)

Might be worth pgindenting - the void for function definitions (but not for
declarations) is typically on its own line in PG code.


> +static GistCheckState
> +gist_init_heapallindexed(Relation rel)
> +{
> +    int64        total_pages;
> +    int64        total_elems;
> +    uint64        seed;
> +    GistCheckState result;
> +
> +    /*
> +    * Size Bloom filter based on estimated number of tuples in index
> +    */
> +    total_pages = RelationGetNumberOfBlocks(rel);
> +    total_elems = Max(total_pages * (MaxOffsetNumber / 5),
> +                        (int64) rel->rd_rel->reltuples);
> +    /* Generate a random seed to avoid repetition */
> +    seed = pg_prng_uint64(&pg_global_prng_state);
> +    /* Create Bloom filter to fingerprint index */
> +    result.filter = bloom_create(total_elems, maintenance_work_mem, seed);
> +
> +    /*
> +     * Register our own snapshot
> +     */
> +    result.snapshot = RegisterSnapshot(GetTransactionSnapshot());

FWIW, comments like this, that just restate exactly what the code does, are
imo not helpful.  Also, there's a trailing space :)


Greetings,

Andres Freund

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

25 June 2022, 19:10:11


> On 23 Jun 2022, at 00:27, Nikolay Samokhvalov <samokhvalov@gmail.com> wrote:
>
> Since you're talking to yourself, just wanted to support you – this is an important thing, definitely should be very
usefulfor many projects; I hope to find time to test it in the next few days.  

Thanks Nikolay!


> On 23 Jun 2022, at 04:29, Andres Freund <andres@anarazel.de> wrote:
Thanks for looking into the patch, Andres!

> On 2022-06-22 20:40:56 +0300, Andrey Borodin wrote:
>>> diff --git a/contrib/amcheck/amcheck.c b/contrib/amcheck/amcheck.c
>> new file mode 100644
>> index 0000000000..7a222719dd
>> --- /dev/null
>> +++ b/contrib/amcheck/amcheck.c
>> @@ -0,0 +1,187 @@
>> +/*-------------------------------------------------------------------------
>> + *
>> + * amcheck.c
>> + *        Utility functions common to all access methods.
>
> This'd likely be easier to read if the reorganization were split into its own
> commit.
>
> I'd also split gin / gist support. It's a large enough patch that that imo
> makes reviewing easier.
I will split the patch in 3 steps:
1. extract generic functions to amcheck.c
2. add gist functions
3. add gin functions
But each this step is just adding few independent files + some lines to Makefile.

I'll fix other notes too in the next version.

Thanks!

Best regards, Andrey Borodin.

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

23 July 2022, 09:40:44


> On 26 Jun 2022, at 00:10, Andrey Borodin <x4mmm@yandex-team.ru> wrote:
> 
> I will split the patch in 3 steps:
> 1. extract generic functions to amcheck.c
> 2. add gist functions
> 3. add gin functions
> 
> I'll fix other notes too in the next version.


Done. PFA attached patchset.

Thanks!

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

17 August 2022, 12:28:02

> On 23 Jul 2022, at 14:40, Andrey Borodin <x4mmm@yandex-team.ru> wrote:
>
> Done. PFA attached patchset.
>
> Best regards, Andrey Borodin.
>
<v12-0001-Refactor-amcheck-to-extract-common-locking-routi.patch><v12-0002-Add-gist_index_parent_check-function-to-verify-G.patch><v12-0003-Add-gin_index_parent_check-to-verify-GIN-index.patch>

Here's v13. Changes:
1. Fixed passing through downlink in GIN index
2. Fixed GIN tests (one test case was not working)

Thanks to Vitaliy Kukharik for trying this patches.

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Andres Freund

Date:

22 September 2022, 15:19:09

Hi,

On 2022-08-17 17:28:02 +0500, Andrey Borodin wrote:
> Here's v13. Changes:
> 1. Fixed passing through downlink in GIN index
> 2. Fixed GIN tests (one test case was not working)
> 
> Thanks to Vitaliy Kukharik for trying this patches.

Due to the merge of the meson based build, this patch needs to be
adjusted. See
https://cirrus-ci.com/build/6637154947301376

The changes should be fairly simple, just mirroring the Makefile ones.

Greetings,

Andres Freund

Re: Amcheck verification of GiST and GIN

From

Andres Freund

Date:

02 October 2022, 07:12:30

Hi,

On 2022-09-22 08:19:09 -0700, Andres Freund wrote:
> Hi,
> 
> On 2022-08-17 17:28:02 +0500, Andrey Borodin wrote:
> > Here's v13. Changes:
> > 1. Fixed passing through downlink in GIN index
> > 2. Fixed GIN tests (one test case was not working)
> > 
> > Thanks to Vitaliy Kukharik for trying this patches.
> 
> Due to the merge of the meson based build, this patch needs to be
> adjusted. See
> https://cirrus-ci.com/build/6637154947301376
> 
> The changes should be fairly simple, just mirroring the Makefile ones.

Here's an updated patch adding meson compat.

I didn't fix the following warnings:

[25/28 3  89%] Compiling C object contrib/amcheck/amcheck.dll.p/amcheck.c.obj
../../home/andres/src/postgresql/contrib/amcheck/amcheck.c: In function ‘amcheck_lock_relation_and_check’:
../../home/andres/src/postgresql/contrib/amcheck/amcheck.c:81:20: warning: implicit declaration of function
‘NewGUCNestLevel’[-Wimplicit-function-declaration]
 
   81 |   save_nestlevel = NewGUCNestLevel();
      |                    ^~~~~~~~~~~~~~~
../../home/andres/src/postgresql/contrib/amcheck/amcheck.c:124:2: warning: implicit declaration of function
‘AtEOXact_GUC’;did you mean ‘AtEOXact_SMgr’? [-Wimplicit-function-declaration]
 
  124 |  AtEOXact_GUC(false, save_nestlevel);
      |  ^~~~~~~~~~~~
      |  AtEOXact_SMgr
[26/28 2  92%] Compiling C object contrib/amcheck/amcheck.dll.p/verify_gin.c.obj
../../home/andres/src/postgresql/contrib/amcheck/verify_gin.c: In function ‘gin_check_parent_keys_consistency’:
../../home/andres/src/postgresql/contrib/amcheck/verify_gin.c:423:8: warning: unused variable ‘heapallindexed’
[-Wunused-variable]
  423 |  bool  heapallindexed = *((bool*)callback_state);
      |        ^~~~~~~~~~~~~~
[28/28 1 100%] Linking target contrib/amcheck/amcheck.dll


Greetings,

Andres Freund

Attachment

Re: Amcheck verification of GiST and GIN

From

Andrew Borodin

Date:

08 October 2022, 22:36:52

On Sun, Oct 2, 2022 at 12:12 AM Andres Freund <andres@anarazel.de> wrote:
>
> Here's an updated patch adding meson compat.

Thank you, Andres! Here's one more rebase (something was adjusted in
amcheck build).
Also I've fixed new warnings except warning about absent
heapallindexed for GIN. It's a TODO.

Thanks!

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Jose Arthur Benetasso Villanova

Date:

25 November 2022, 02:04:37

Hello.

I reviewed this patch and I would like to share some comments.

It compiled with those 2 warnings:

verify_gin.c: In function 'gin_check_parent_keys_consistency':
verify_gin.c:481:38: warning: declaration of 'maxoff' shadows a previous 
local [-Wshadow=compatible-local]
   481 |                         OffsetNumber maxoff = 
PageGetMaxOffsetNumber(page);
       |                                      ^~~~~~
verify_gin.c:453:41: note: shadowed declaration is here
   453 |                                         maxoff;
       |                                         ^~~~~~
verify_gin.c:423:25: warning: unused variable 'heapallindexed' 
[-Wunused-variable]
   423 |         bool            heapallindexed = *((bool*)callback_state);
       |                         ^~~~~~~~~~~~~~


Also, I'm not sure about postgres' headers conventions, inside amcheck.h, 
there is "miscadmin.h" included, and inside verify_gin.c, verify_gist.h 
and verify_nbtree.c both amcheck.h and miscadmin.h are included.

About the documentation, the bt_index_parent_check has comments about the 
ShareLock and "SET client_min_messages = DEBUG1;", and both 
gist_index_parent_check and gin_index_parent_check lack it. verify_gin 
uses DEBUG3, I'm not sure if it is on purpose, but it would be nice to 
document it or put DEBUG1 to be consistent.

I lack enough context to do a deep review on the code, so in this area 
this patch needs more eyes.

I did the following test:

postgres=# create table teste (t text, tv tsvector);
CREATE TABLE
postgres=# insert into teste values ('hello', 'hello'::tsvector);
INSERT 0 1
postgres=# create index teste_tv on teste using gist(tv);
CREATE INDEX
postgres=# select pg_relation_filepath('teste_tv');
  pg_relation_filepath
----------------------
  base/5/16441
(1 row)

postgres=#
\q
$ bin/pg_ctl -D data -l log
waiting for server to shut down.... done
server stopped
$ okteta base/5/16441 # I couldn't figure out the dd syntax to change the 
1FE9 to '0'
$ bin/pg_ctl -D data -l log
waiting for server to start.... done
server started
$ bin/psql -U ze postgres
psql (16devel)
Type "help" for help.

postgres=# SET client_min_messages = DEBUG3;
SET
postgres=# select gist_index_parent_check('teste_tv'::regclass, true);
DEBUG:  verifying that tuples from index "teste_tv" are present in "teste"
ERROR:  heap tuple (0,1) from table "teste" lacks matching index tuple 
within index "teste_tv"
postgres=#

A simple index corruption in gin:

postgres=# CREATE TABLE "gin_check"("Column1" int[]);
CREATE TABLE
postgres=# insert into gin_check values (ARRAY[1]),(ARRAY[2]);
INSERT 0 2
postgres=# CREATE INDEX gin_check_idx on "gin_check" USING GIN("Column1");
CREATE INDEX
postgres=# select pg_relation_filepath('gin_check_idx');
  pg_relation_filepath
----------------------
  base/5/16453
(1 row)

postgres=#
\q
$ bin/pg_ctl -D data -l logfile stop
waiting for server to shut down.... done
server stopped
$ okteta data/base/5/16453 # edited some bits near 3FCC
$ bin/pg_ctl -D data -l logfile start
waiting for server to start.... done
server started
$ bin/psql -U ze postgres
psql (16devel)
Type "help" for help.

postgres=# SET client_min_messages = DEBUG3;
SET
postgres=# SELECT gin_index_parent_check('gin_check_idx', true);
ERROR:  number of items mismatch in GIN entry tuple, 49 in tuple header, 1 
decoded
postgres=#

There are more code paths to follow to check the entire code, and I had a 
hard time to corrupt the indices. Is there any automated code to corrupt 
index to test such code?


--
Jose Arthur Benetasso Villanova

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

27 November 2022, 21:29:18

Hello!

Thank you for the review!

On Thu, Nov 24, 2022 at 6:04 PM Jose Arthur Benetasso Villanova
<jose.arthur@gmail.com> wrote:
>
> It compiled with those 2 warnings:
>
> verify_gin.c: In function 'gin_check_parent_keys_consistency':
> verify_gin.c:481:38: warning: declaration of 'maxoff' shadows a previous
> local [-Wshadow=compatible-local]
>    481 |                         OffsetNumber maxoff =
> PageGetMaxOffsetNumber(page);
>        |                                      ^~~~~~
> verify_gin.c:453:41: note: shadowed declaration is here
>    453 |                                         maxoff;
>        |                                         ^~~~~~
> verify_gin.c:423:25: warning: unused variable 'heapallindexed'
> [-Wunused-variable]

Fixed.

>    423 |         bool            heapallindexed = *((bool*)callback_state);
>        |                         ^~~~~~~~~~~~~~
>

This one is in progress yet, heapallindexed check is not implemented yet...

>
> Also, I'm not sure about postgres' headers conventions, inside amcheck.h,
> there is "miscadmin.h" included, and inside verify_gin.c, verify_gist.h
> and verify_nbtree.c both amcheck.h and miscadmin.h are included.
Fixed.

>
> About the documentation, the bt_index_parent_check has comments about the
> ShareLock and "SET client_min_messages = DEBUG1;", and both
> gist_index_parent_check and gin_index_parent_check lack it. verify_gin
> uses DEBUG3, I'm not sure if it is on purpose, but it would be nice to
> document it or put DEBUG1 to be consistent.
GiST and GIN verifications do not take ShareLock for parent checks.
B-tree check cannot verify cross-level invariants between levels when
the index is changing.

GiST verification checks only one invariant that can be verified if
page locks acquired the same way as page split does.
GIN does not require ShareLock because it does not check cross-level invariants.

Reporting progress with DEBUG1 is a good idea, I did not know that
this feature exists. I'll implement something similar in following
versions.

> I did the following test:

Cool! Thank you!

>
> There are more code paths to follow to check the entire code, and I had a
> hard time to corrupt the indices. Is there any automated code to corrupt
> index to test such code?
>

Heapam tests do this in an automated way, look into this file
t/001_verify_heapam.pl.
Surely we can write these tests. At least automate what you have just
done in the review. However, committing similar checks is a very
tedious work: something will inevitably turn buildfarm red as a
watermelon.

I hope I'll post a version with DEBUG1 reporting and heapallindexed soon.
PFA current state.
Thank you for looking into this!

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

28 November 2022, 01:07:40

On Sun, Nov 27, 2022 at 1:29 PM Andrey Borodin <amborodin86@gmail.com> wrote:
>
> GiST verification checks only one invariant that can be verified if
> page locks acquired the same way as page split does.
> GIN does not require ShareLock because it does not check cross-level invariants.
>

I was wrong. GIN check does similar gin_refind_parent() to lock pages
in bottom-up manner and truly verify downlink-child_page invariant.

Here's v17. The only difference is that I added progress reporting to
GiST verification.
I still did not implement heapallindexed for GIN. Existence of pending
lists makes this just too difficult for a weekend coding project :(

Thank you!

Best regards, Andrey Borodin.

Hi Jose, thank you for review and sorry for so long delay to answer.

On Wed, Dec 14, 2022 at 4:19 AM Jose Arthur Benetasso Villanova
<jose.arthur@gmail.com> wrote:
>
>
> On Sun, 27 Nov 2022, Andrey Borodin wrote:
>
> > On Sun, Nov 27, 2022 at 1:29 PM Andrey Borodin <amborodin86@gmail.com> wrote:
> >>
> > I was wrong. GIN check does similar gin_refind_parent() to lock pages
> > in bottom-up manner and truly verify downlink-child_page invariant.
>
> Does this mean that we need the adjustment in docs?
It seems to me that gin_index_parent_check() docs are correct.

>
> > Here's v17. The only difference is that I added progress reporting to
> > GiST verification.
> > I still did not implement heapallindexed for GIN. Existence of pending
> > lists makes this just too difficult for a weekend coding project :(
> >
> > Thank you!
> >
> > Best regards, Andrey Borodin.
> >
>
> I'm a bit lost here. I tried your patch again and indeed the
> heapallindexed inside gin_check_parent_keys_consistency has a TODO
> comment, but it's unclear to me if you are going to implement it or if the
> patch "needs review". Right now it's "Waiting on Author".
>

Please find the attached new version. In this patchset heapallindexed
flag is removed from GIN checks.

Thank you!

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

09 January 2023, 04:08:05

On Sun, Jan 8, 2023 at 8:05 PM Andrey Borodin <amborodin86@gmail.com> wrote:
>
> Please find the attached new version. In this patchset heapallindexed
> flag is removed from GIN checks.
>
Uh... sorry, git-formatted wrong branch.
Here's the correct version. Double checked.

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Jose Arthur Benetasso Villanova

Date:

13 January 2023, 11:46:45

On Sun, 8 Jan 2023, Andrey Borodin wrote:

> On Sun, Jan 8, 2023 at 8:05 PM Andrey Borodin <amborodin86@gmail.com> wrote:
>>
>> Please find the attached new version. In this patchset heapallindexed
>> flag is removed from GIN checks.
>>
> Uh... sorry, git-formatted wrong branch.
> Here's the correct version. Double checked.
>

Hello again.

I applied the patch without errors / warnings and did the same tests. All 
working as expected.

The only thing that I found is the gin_index_parent_check function in docs 
still references the "gin_index_parent_check(index regclass, 
heapallindexed boolean) returns void"

--
Jose Arthur Benetasso Villanova

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

14 January 2023, 00:18:23

On Fri, Jan 13, 2023 at 3:46 AM Jose Arthur Benetasso Villanova
<jose.arthur@gmail.com> wrote:
>
> The only thing that I found is the gin_index_parent_check function in docs
> still references the "gin_index_parent_check(index regclass,
> heapallindexed boolean) returns void"
>

Correct! Please find the attached fixed version.

Thank you!

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Jose Arthur Benetasso Villanova

Date:

14 January 2023, 03:34:38

On Fri, 13 Jan 2023, Andrey Borodin wrote:

> On Fri, Jan 13, 2023 at 3:46 AM Jose Arthur Benetasso Villanova
> <jose.arthur@gmail.com> wrote:
>>
>> The only thing that I found is the gin_index_parent_check function in docs
>> still references the "gin_index_parent_check(index regclass,
>> heapallindexed boolean) returns void"
>>
>
> Correct! Please find the attached fixed version.
>
> Thank you!
>
> Best regards, Andrey Borodin.
>

Hello again. I see the change. Thanks

--
Jose Arthur Benetasso Villanova

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

14 January 2023, 04:14:46

On Fri, Jan 13, 2023 at 7:35 PM Jose Arthur Benetasso Villanova
<jose.arthur@gmail.com> wrote:
>
> Hello again. I see the change. Thanks
>

Thanks! I also found out that there was a CI complaint about amcheck.h
not including some necessary stuff. Here's a version with a fix for
that.

Best regards, Andrey Borodin.

On Thu, Feb 2, 2023 at 12:15 PM Peter Geoghegan <pg@bowt.ie> wrote:

On Thu, Feb 2, 2023 at 11:51 AM Peter Geoghegan <pg@bowt.ie> wrote:

...

Admittedly there is some value in seeing multiple WARNINGs to true
experts that are performing some kind of forensic analysis, but that
doesn't seem worth it to me -- I'm an expert, and I don't think that
I'd do it this way for any reason other than it being more convenient
as a way to get information about a system that I don't have access
to. Even then, I think that I'd probably have serious doubts about
most of the extra information that I'd get, since it might very well
be a downstream consequence of the same basic problem.

...

I understand your thoughts (I think) and agree with them, but at least one

scenario where I do want to see *all* errors is corruption prevention – running

amcheck in lower environments, not in production, to predict and prevent issues.

For example, not long ago, Ubuntu 16.04 became EOL (in phases), and people

needed to upgrade, with glibc version change. It was quite good to use amcheck

on production clones (running on a new OS/glibc) to identify all indexes that

need to be rebuilt. Being able to see only one of them would be very

inconvenient. Rebuilding all indexes didn't seem a good idea in the case of

large databases.

Re: Amcheck verification of GiST and GIN

From

Peter Geoghegan

Date:

02 February 2023, 20:42:52

On Thu, Feb 2, 2023 at 12:31 PM Nikolay Samokhvalov
<samokhvalov@gmail.com> wrote:
> I understand your thoughts (I think) and agree with them, but at least one
> scenario where I do want to see *all* errors is corruption prevention – running
> amcheck in lower environments, not in production, to predict and prevent issues.
> For example, not long ago, Ubuntu 16.04 became EOL (in phases), and people
> needed to upgrade, with glibc version change. It was quite good to use amcheck
> on production clones (running on a new OS/glibc) to identify all indexes that
> need to be rebuilt. Being able to see only one of them would be very
> inconvenient. Rebuilding all indexes didn't seem a good idea in the case of
> large databases.

I agree that this matters at the level of whole indexes. That is, if
you want to check every index in the database, it is unhelpful if the
whole process stops just because one individual index has corruption.
Any extra information about the index that is corrupt may not be all
that valuable, but information about other indexes remains almost as
valuable.

I think that that problem should be solved at a higher level, in the
program that runs amcheck. Note that pg_amcheck will already do this
for B-Tree indexes. While verify_nbtree.c won't try to limp on with an
index that is known to be corrupt, pg_amcheck will continue with other
indexes.

We should add a "Tip" to the amcheck documentation on 14+ about this.
We should clearly advise users that they should probably just use
pg_amcheck. Using the SQL interface directly should now mostly be
something that only a tiny minority of experts need to do -- and even
the experts won't do it that way unless they have a good reason to.

--
Peter Geoghegan

Re: Amcheck verification of GiST and GIN

From

Nikolay Samokhvalov

Date:

02 February 2023, 20:56:47

On Thu, Feb 2, 2023 at 12:43 PM Peter Geoghegan <pg@bowt.ie> wrote:

I agree that this matters at the level of whole indexes.

I already realized my mistake – indeed, having multiple errors for 1 index

doesn't seem to be super practically helpful.

I think that that problem should be solved at a higher level, in the
program that runs amcheck. Note that pg_amcheck will already do this
for B-Tree indexes.

That's a great tool, and it's great it supports parallelization, very useful

on large machines.

We should add a "Tip" to the amcheck documentation on 14+ about this.
We should clearly advise users that they should probably just use
pg_amcheck.

and with -j$N, with high $N (unless it's production)

Re: Amcheck verification of GiST and GIN

From

Peter Geoghegan

Date:

02 February 2023, 23:16:32

On Thu, Feb 2, 2023 at 12:56 PM Nikolay Samokhvalov
<samokhvalov@gmail.com> wrote:
> I already realized my mistake – indeed, having multiple errors for 1 index
> doesn't seem to be super practically helpful.

I wouldn't mind supporting it if the cost wasn't too high. But I
believe that it's not a good trade-off.

>> I think that that problem should be solved at a higher level, in the
>> program that runs amcheck. Note that pg_amcheck will already do this
>> for B-Tree indexes.
>
>
> That's a great tool, and it's great it supports parallelization, very useful
> on large machines.

Another big advantage of just using pg_amcheck is that running each
index verification in a standalone query avoids needlessly holding the
same MVCC snapshot across all indexes verified (compared to running
one big SQL query that verifies multiple indexes). As simple as
pg_amcheck's approach is (it's doing nothing that you couldn't
replicate in a shell script), in practice that its standardized
approach probably makes things a lot smoother, especially in terms of
how VACUUM is impacted.

--
Peter Geoghegan

Re: Amcheck verification of GiST and GIN

From

Peter Geoghegan

Date:

04 February 2023, 02:49:50

On Thu, Feb 2, 2023 at 12:15 PM Peter Geoghegan <pg@bowt.ie> wrote:
> * Why are there only WARNINGs, never ERRORs here?

Attached revision v22 switches all of the WARNINGs over to ERRORs. It
has also been re-indented, and now uses a non-generic version of
PageGetItemIdCareful() in both verify_gin.c and verify_gist.c.
Obviously this isn't a big set of revisions, but I thought that Andrey
would appreciate it if I posted this much now. I haven't thought much
more about the locking stuff, which is my main concern for now.

Who are the authors of the patch, in full? At some point we'll need to
get the attribution right if this is going to be committed.

I think that it would be good to add some comments explaining the high
level control flow. Is the verification process driven by a
breadth-first search, or a depth-first search, or something else?

I think that we should focus on getting the GiST patch into shape for
commit first, since that seems easier.

-- 
Peter Geoghegan

Attachment

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

04 February 2023, 21:37:29

Thank for working on this, Peter!

On Fri, Feb 3, 2023 at 6:50 PM Peter Geoghegan <pg@bowt.ie> wrote:
>
> I think that we should focus on getting the GiST patch into shape for
> commit first, since that seems easier.
>

Here's the next version. I've focused on GiST part in this revision.
Changes:
1. Refactored index_chackable so that is shared between all AMs.
2. Renamed gist_index_parent_check -> gist_index_check
3. Gathered reviewers (in no particular order). I hope I didn't forget
anyone. GIN patch is based on work by Grigory Kryachko, but
essentially rewritten by Heikki. Somewhat cosmetically whacked by me.
4. Extended comments for GistScanItem,
gist_check_parent_keys_consistency() and gist_refind_parent().

I tried adding support of GiST in pg_amcheck, but it is largely
assuming the relation is either heap or B-tree. I hope to do that part
tomorrow or in nearest future.

Here's the current version. Thank you!

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

06 February 2023, 00:44:53

On Sat, Feb 4, 2023 at 1:37 PM Andrey Borodin <amborodin86@gmail.com> wrote:
>
> I tried adding support of GiST in pg_amcheck, but it is largely
> assuming the relation is either heap or B-tree. I hope to do that part
> tomorrow or in nearest future.
>

Here's v24 == (v23 + a step for pg_amcheck). There's a lot of
shotgun-style changes, but I hope next index types will be easy to add
now.

Adding Mark to cc, just in case.

Thanks!

Best regards, Andrey Borodin.

On Fri, Mar 17, 2023 at 8:40 PM Andrey Borodin <amborodin86@gmail.com> wrote:
>
> On Thu, Mar 16, 2023 at 6:23 PM Peter Geoghegan <pg@bowt.ie> wrote:
> >
> > existence of a "same" routine hints at some confusion about "equality
> > versus equivalence" issues.
>
> Hmm...yes, actually, GiST deals with floats routinely. And there might
> be some sorts of NaNs and Infs that are equal, but not binary
> equivalent.
> I'll think more about it.
>
> gist_get_adjusted() calls "same" routine, which for type point will
> use FPeq(double A, double B). And this might be kind of a corruption
> out of the box. Because it's an epsilon-comparison, ε=1.0E-06.
> GiST might miss newly inserted data, because the "adjusted" tuple was
> "same" if data is in proximity of 0.000001 of any previously indexed
> point, but out of known MBRs.
> I'll try to reproduce this tomorrow, so far no luck.
>
After several attempts to corrupt GiST with this 0.000001 epsilon
adjustment tolerance I think GiST indexing of points is valid.
Because intersection for search purposes is determined with the same epsilon!
So it's kind of odd
postgres=# select point(0.0000001,0)~=point(0,0);
?column?
----------
 t
(1 row)
, yet the index works correctly.



On Thu, Mar 16, 2023 at 4:48 PM Peter Geoghegan <pg@bowt.ie> wrote:
>
> On Sun, Feb 5, 2023 at 4:45 PM Andrey Borodin <amborodin86@gmail.com> wrote:
> > Here's v24 == (v23 + a step for pg_amcheck). There's a lot of
> > shotgun-style changes, but I hope next index types will be easy to add
> > now.
>
> Some feedback on the GiST patch:
>
> * You forgot to initialize GistCheckState.heaptuplespresent to 0.
>
> It might be better to allocate GistCheckState dynamically, using
> palloc0(). That's future proof. "Simple and obvious" is usually the
> most important goal for managing memory in amcheck code. It can be a
> little inefficient if that makes it simpler.
Done.

> * ISTM that gist_index_check() should allow the caller to omit a
> "heapallindexed" argument by specifying "DEFAULT FALSE", for
> consistency with bt_index_check().
Done.

> * What's the point in having a custom memory context that is never reset?
The problem is we traverse index with depth-first scan and must retain
internal tuples for a whole time of the scan.
And gistgetadjusted() will allocate memory only in case of suspicion
of corruption. So, it's kind of an infrequent case.

The context is there only as an overall leak protection mechanism.
Actual memory management is done via pfree() calls.

> Again, "simple and obvious" is good for memory management in amcheck.
Yes, that would be great to come up with some "unit of work" contexts.
Yet, now palloced tuples and scan items have very different lifespans.


> * ISTM that it would be clearer if the per-page code within
> gist_check_parent_keys_consistency() was broken out into its own
> function -- a little like bt_target_page_check()..

I've refactored page logic into gist_check_page().

> * ISTM that gist_refind_parent() should throw an error about
> corruption in the event of a parent page somehow becoming a leaf page.
Done.

> * I suggest using c99 style variable declarations in loops.
Done.


On Thu, Mar 16, 2023 at 6:23 PM Peter Geoghegan <pg@bowt.ie> wrote:
>
> On Thu, Mar 16, 2023 at 4:48 PM Peter Geoghegan <pg@bowt.ie> wrote:
> > Some feedback on the GiST patch:
>
> I see that the Bloom filter that's used to implement heapallindexed
> verification fingerprints index tuples that are formed via calls to
> gistFormTuple(), without any attempt to normalize-away differences in
> TOAST input state. In other words, there is nothing like
> verify_nbtree.c's bt_normalize_tuple() function involved in the
> fingerprinting process. Why is that safe, though? See the "toast_bug"
> test case within contrib/amcheck/sql/check_btree.sql for an example of
> how inconsistent TOAST input state confused verify_nbtree.c's
> heapallindexed verification (before bugfix commit eba775345d). I'm
> concerned about GiST heapallindexed verification being buggy in
> exactly the same way, or in some way that is roughly analogous.
FWIW contrib opclasses, AFAIK, always detoast possibly long datums,
see gbt_var_compress()
https://github.com/postgres/postgres/blob/master/contrib/btree_gist/btree_utils_var.c#L281
But there might be opclasses that do not do so...
Also, there are INCLUDEd attributes. Right now we just put them as-is
to the bloom filter. Does this constitute a TOAST bug as in B-tree?
If so, I think we should use a version of tuple formatting that omits
included attributes...
What do you think?

>
> I do have some concerns about there being analogous problems that are
> unique to GiST, since GiST is an AM that gives opclass authors many
> more choices than B-Tree opclass authors have. In particular, I wonder
> if heapallindexed verification needs to account for how GiST
> compression might end up breaking heapallindexed. I refer to the
> "compression" implemented by GiST support routine 3 of GiST opclasses.
> The existence of GiST support routine 7, the "same" routine, also
> makes me feel a bit squeamish about heapallindexed verification -- the
> existence of a "same" routine hints at some confusion about "equality
> versus equivalence" issues.
>
> In more general terms: heapallindexed verification works by
> fingerprinting index tuples during the index verification stage, and
> then performing Bloom filter probes in a separate CREATE INDEX style
> heap-matches-index stage (obviously). There must be some justification
> for our assumption that there can be no false positive corruption
> reports due only to a GiST opclass (either extant or theoretical) that
> follows the GiST contract, and yet allows an inconsistency to arise
> that isn't really index corruption. This justification won't be easy
> to come up with, since the GiST contract was not really designed with
> these requirements in mind. But...we should try to come up with
> something.
>
> What are the assumptions underlying heapallindexed verification for
> GiST? It doesn't have to be provably correct or anything, but it
> should at least be empirically falsifiable. Basically, something that
> says: "Here are our assumptions, if we were wrong in making these
> assumptions then you could tell that we made a mistake because of X,
> Y, Z". It's not always clear when something is corrupt. Admittedly I
> have much less experience with GiST than other people, which likely
> includes you (Andrey). I am likely missing some context around the
> evolution of GiST. Possibly I'm making a big deal out of something
> without it being unhelpful. Unsure.
>
> Here is an example of the basic definition of correctness being
> unclear, in a bad way: Is a HOT chain corrupt when its root
> LP_REDIRECT points to an LP_DEAD item, or does that not count as
> corruption? I'm pretty sure that the answer is ambiguous even today,
> or was ambiguous until recently, at least. Hopefully the
> verify_heapam.c HOT chain verification patch will be committed,
> providing us with a clear *definition* of HOT chain corruption -- the
> definition itself may not be the easy part.

Rules for compression methods are not described anyware. And I suspect
that it's intentional, to provide more flexibility.
To make heapallindexed check work we need that opclass always returns
the same compression result for the same input datum.
All known to me opclasses (built-in and PostGIS) comply with this requirement.

Yet another behavior might be reasonable. Consider we have a
compression which learns on data. It will observe that some datums are
more frequent and start using shorter version of them.

Compression function actually is not about compression, but kind of a
conversion from heap format to indexable. Many opclasses do not have a
compression function at all.
We can require that the checked opclass would not have a compression
function at all. But GiST is mainly used for PostGIS, and in PostGIS
they use compression to convert complex geometry into a bounding box.

Method "same" is used only for a business of internal tuples, but not
for leaf tuples that we fingerprint in the bloom filter.

We can put requirements for heapallindexed in another way: "the
opclass compression method must be a pure function". It's also a very
strict requirement, disallowing all kinds of detoasting, dictionary
compression etc. And btree_gist opclasses does not comply :) But they
seem to me safe for heapallindexed.

> On a totally unrelated note: I wonder if we should be checking that
> internal page tuples have 0xffff as their offset number? Seems like
> it'd be a cheap enough cross-check.
>

Done.

Thank you!

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

26 March 2023, 22:17:02

On Sun, Mar 19, 2023 at 4:00 PM Andrey Borodin <amborodin86@gmail.com> wrote:
>
> Also, there are INCLUDEd attributes. Right now we just put them as-is
> to the bloom filter. Does this constitute a TOAST bug as in B-tree?
> If so, I think we should use a version of tuple formatting that omits
> included attributes...
> What do you think?
I've ported the B-tree TOAST test to GiST, and, as expected, it fails.
Finds non-indexed tuple for a fresh valid index.
I've implemented normalization, plz see gistFormNormalizedTuple().
But there are two problems:
1. I could not come up with a proper way to pfree() compressed value
after decompressing. See TODO in gistFormNormalizedTuple().
2. In the index tuples seem to be normalized somewhere. They do not
have to be deformed and normalized. It's not clear to me how this
happened.

Thanks!

Best regards, Andrey Borodin.

> On 6 Apr 2023, at 09:00, Alexander Lakhin <exclusion@gmail.com> wrote:
>
> I've tried to use this feature with the latest patch set and discovered that
> modified pg_amcheck doesn't find any gist indexes when running without a
> schema specification.

Thanks, Alexander! I’ve fixed this problem and rebased on current HEAD.
There’s one more problem in pg_amcheck’s GiST verification. We must check that amcheck is 1.5+ and use GiST
verificationonly in that case… 

Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

"Andrey M. Borodin"

Date:

09 July 2024, 06:36:50


> On 5 Jul 2024, at 17:27, Andrey M. Borodin <x4mmm@yandex-team.ru> wrote:
>
> There’s one more problem in pg_amcheck’s GiST verification. We must check that amcheck is 1.5+ and use GiST
verificationonly in that case… 

Done. I’ll set the status to “Needs review”.


Best regards, Andrey Borodin.

Attachment

Re: Amcheck verification of GiST and GIN

From

Tomas Vondra

Date:

10 July 2024, 16:01:40

Hi,

On 7/9/24 08:36, Andrey M. Borodin wrote:
> 
> 
>> On 5 Jul 2024, at 17:27, Andrey M. Borodin <x4mmm@yandex-team.ru> wrote:
>>
>> There’s one more problem in pg_amcheck’s GiST verification. We must
>> check that amcheck is 1.5+ and use GiST verification only in that
>> case …
> 
> Done. I’ll set the status to “Needs review”.
> 

I realized amcheck GIN/GiST support would be useful for testing my
patches adding parallel builds for these index types, so I decided to
take a look at this and do an initial review today.

Attached is a patch series with a extra commits to keep the review
comments and patches adjusting the formatting by pgindent (the patch
seems far enough for this).

Let me quickly go through the review comments:

1) Not sure I like 'amcheck.c' very much, I'd probably go with something
like 'verify_common.c' to match naming of the other files. But it's just
nitpicking and I can live with it.

2) amcheck_lock_relation_and_check seems to be the most important
function, yet there's no comment explaining what it does :-(

3) amcheck_lock_relation_and_check still has a TODO to add the correct
name of the AM

4) Do we actually need amcheck_index_mainfork_expected as a separate
function, or could it be a part of index_checkable?

5) The comment for heaptuplespresent says "debug counter" but that does
not really explain what it's for. (I see verify_nbtree has the same
comment, but maybe let's improve that.)

6) I'd suggest moving the GISTSTATE + blocknum fields to the beginning
of GistCheckState, it seems more natural to start with "generic" fields.

7) I'd adjust the gist_check_parent_keys_consistency comment a bit, to
explain what the function does first, and only then explain how.

8) We seem to be copying PageGetItemIdCareful() around, right? And the
copy in _gist.c still references nbtree - I guess that's not right.

9) Why is the GIN function called gin_index_parent_check() and not
simply gin_index_check() as for the other AMs?

10) The debug in gin_check_posting_tree_parent_keys_consistency triggers
assert when running with client_min_messages='debug5', it seems to be
accessing bogus item pointers.

11) Why does it add pg_amcheck support only for GiST and not GIN?

That's all for now. I'll add this to the stress-testing tests of my
index build patches, and if that triggers more issues I'll report those.

regards

-- 
Tomas Vondra
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

> On Feb 21, 2025, at 9:07 AM, Mark Dilger <mark.dilger@enterprisedb.com> wrote:
>
> I infer that you intend to make v34-0004, v34-0006, and v35-0001 apply cleanly without the other patches and commit
itthat way.  If that is correct, be advised that I'm doing a review and will respond back shortly, maybe in a few
hours.

Ok, here is my review:

v34-0001 looks fine
v34-0002 refactoring is needed by the gin patches, so I kept it in the patchset for review purposes
v34-0004 can mostly be applied without v34-0003, but a few changes are needed to make it apply cleanly.
v34-0006 looks fine
v35-0001 applies cleanly

I find the token quoting and capitalization patterns in sql/check_gin.sql somewhat confusing, but I tried to follow
whatis already there in extending that test to also check gin indexes over jsonb data using jsonb_path_ops.  I think
thisis a common enough usage of gin that we should have test coverage for it. 

After extending the test a bit, I ran the tests and checked lcov:

    verify_common.c    86.3%
    verify_gin.c        38.4%
    verify_heapam.c    57.2%
    verify_nbtree.c    72.4%

Showing that verify_gin has the least coverage of all.  The main areas lacking coverage have to do with posting list
treesand concurrent page splits never being exercised.  My first attempt cover that with a TAP test using pgbench got
thenumber up to 56.8%, but while trying to get that higher, I started getting error reports from verify_gin(),
apparentlyout of function gin_check_parent_keys_consistency(): 

#   at t/006_gin_concurrency.pl line 137.
#                   'pgbench: error: client 14 script 1 aborted in command 0 query 0: ERROR:  index "ginidx" has wrong
tupleorder on entry tree page, block 153, offset 8 
# pgbench: error: client 0 script 1 aborted in command 0 query 0: ERROR:  index "ginidx" has wrong tuple order on entry
treepage, block 153, offset 8 
# pgbench: error: client 12 script 1 aborted in command 0 query 0: ERROR:  index "ginidx" has wrong tuple order on
entrytree page, block 153, offset 8 
# pgbench: error: client 7 script 1 aborted in command 0 query 0: ERROR:  index "ginidx" has wrong tuple order on entry
treepage, block 153, offset 8 
# pgbench: error: client 1 script 1 aborted in command 0 query 0: ERROR:  index "ginidx" has wrong tuple order on entry
treepage, block 153, offset 8 

<MORE LINES LIKE THE ABOVE SNIPPED>

The pgbench script is not corrupting anything overtly, so this looks to either be a bug in gin or a bug in the check.
Iam including a patchset with the original patches reworked plus the extra test cases.  For completeness, I also added
ginindexes to t/002_cic.pl and t/003_cic_2pc.pl. 

—
Mark Dilger
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Attachment

Re: Amcheck verification of GiST and GIN

From

Mark Dilger

Date:

21 February, 23:50:21

> On Feb 21, 2025, at 12:16 PM, Mark Dilger <mark.dilger@enterprisedb.com> wrote:
>
> The pgbench script is not corrupting anything overtly, so this looks to either be a bug in gin or a bug in the check.

I suspected the AccessShareLock taken by verify_gin() might be too weak, and upgraded that to ShareRowExclusiveLock so
asto prevent the concurrent table modifications (and incidentally other concurrent verify_gin() calls), but to my
surprisethat didn't fix anything.  Even AccessExclusiveLock doesn't fix it.  So this seems to either be a bug in the
checkingcode complaining about perfectly valid tuple order, or a bug in Gin corrupting its own entry tree page. 

On successive runs, (instrumented to print out a bit more info), there doesn't seem to be any obvious pattern in where
thecorruption occurs.  The offset in the page changes, neither always being at the beginning, nor always at the maxoff;
likewisethe block where corruption is detected changes from run to run.  I've noticed that the rightlink for the page
isalways the page's block number plus one, but that might just be that I haven't run enough iterations yet to see
counter-examples.

Could one of the patch authors take a look?  I don't have the time to chase this to conclusion just now.  Thanks.

—
Mark Dilger
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Re: Amcheck verification of GiST and GIN

From

Mark Dilger

Date:

22 February, 01:51:01

> On Feb 21, 2025, at 12:50 PM, Mark Dilger <mark.dilger@enterprisedb.com> wrote:
>
> Could one of the patch authors take a look?

I turned the TAP test which triggers the error into a regression test that does likewise, for ease of stepping through
thetest, if anybody should want to do that.  I'm attaching that patch here, but please note that I'm not expecting this
tobe committed. 

—
Mark Dilger
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Attachment

v0-0001-Add-a-reproducible-test-case-for-verify_gin-error.patch.no_apply

Re: Amcheck verification of GiST and GIN

From

Kirill Reshke

Date:

28 February, 09:26:35

On Sat, 22 Feb 2025 at 03:51, Mark Dilger <mark.dilger@enterprisedb.com> wrote:
>
>
>
> > On Feb 21, 2025, at 12:50 PM, Mark Dilger <mark.dilger@enterprisedb.com> wrote:
> >
> > Could one of the patch authors take a look?
>
> I turned the TAP test which triggers the error into a regression test that does likewise, for ease of stepping
throughthe test, if anybody should want to do that.  I'm attaching that patch here, but please note that I'm not
expectingthis to be committed. 

Hi!
Your efforts are much appreciated!
I used this patch to derive a smaller repro.

> this seems to either be a bug in the checking code complaining about perfectly valid tuple order,

I'm doubtful this is the case. I have added some more logging to
gin_index_check, and here is output after running attached:
```
DEBUG:  processing entry tree page at blk 2, maxoff: 125
....
DEBUG:  comparing for offset 78 category 0
DEBUG:  comparing for offset 79 category 2
DEBUG:  comparing for offset 80 category 3
DEBUG:  comparing for offset 81 category 0
LOG:  index "ginidx" has wrong tuple order on entry tree page, block
2, offset 81, rightlink 4294967295
DEBUG:  comparing for offset 82 category 0
....
DEBUG:  comparing for offset 100 category 0
DEBUG:  comparing for offset 101 category 2
DEBUG:  comparing for offset 102 category 3
DEBUG:  comparing for offset 103 category 0
LOG:  index "ginidx" has wrong tuple order on entry tree page, block
2, offset 103, rightlink 4294967295
DEBUG:  comparing for offset 104 category 0
DEBUG:  comparing for offset 105 category 0
```
So, we have an entry tree page, where there is tuple on offset 80,
with gin tuple category = 3, and then it goes category 0 again. And
one more such pattern on the same page.
The ginCompareEntries function compares the gin tuples category first.
I do not understand how this would be a valid order on the page, given
that
ginCompareEntries used in `ginget.c` logic. . Maybe I'm missing
something vital about GIN.

--
Best regards,
Kirill Reshke

Attachment

0001-Much-smaller-repro.patch

Re: Amcheck verification of GiST and GIN

From

Mark Dilger

Date:

28 February, 21:31:29

So, we have an entry tree page, where there is tuple on offset 80,
with gin tuple category = 3, and then it goes category 0 again. And
one more such pattern on the same page.
The ginCompareEntries function compares the gin tuples category first.
I do not understand how this would be a valid order on the page, given
that
ginCompareEntries used in `ginget.c` logic. . Maybe I'm missing
something vital about GIN.

The only obvious definition of "wrong" for this is that gin index scans return different result sets than table scans over the same data. Using your much smaller reproducible test case, and adding rows like:

SELECT COUNT(*) FROM tbl WHERE j @> '"1129BBCABFFAACA9VGVKipnwohaccc9TSIMTOQKHmcGYVeFE_PWKLHmpyj60137672qugtsstugg"'::jsonb;
SELECT COUNT(*) FROM tbl WHERE j @> '{"": "r", "hji4124": "", "HTJP_DAptxn6": 9}'::jsonb;
SELECT COUNT(*) FROM tbl WHERE j @> '[]'::jsonb;
SELECT COUNT(*) FROM tbl WHERE j @> NULL::jsonb;
SELECT COUNT(*) FROM tbl WHERE j @> '{"": -6, "__": [""], "YMb": -22}'::jsonb;
SELECT COUNT(*) FROM tbl WHERE j @> '{"853": -60, "pjx": "", "TGLUG_jxmrggv": null}'::jsonb;
SELECT COUNT(*) FROM tbl WHERE j @> '"D3BDA069074174vx48A37IVHWVXLUP9382542ypsl1465pixtryzCBgrkkhrvCC_BDDFatkyXHLIe"'::jsonb;

SELECT COUNT(*) FROM tbl WHERE j @> '{"F18s": {"": -84194}, "ececab2": [""]}'::jsonb;

The results are the same with or without the index. Can you find any examples where they differ?

—
Mark Dilger
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Re: Amcheck verification of GiST and GIN

From

Kirill Reshke

Date:

01 March, 17:55:31

On Fri, 28 Feb 2025 at 23:31, Mark Dilger <mark.dilger@enterprisedb.com> wrote:

> The only obvious definition of "wrong" for this is that gin index scans return different result sets than table scans
overthe same data. Using your much smaller reproducible test case, and adding rows like:

Yeach, you are 100% right. Actually, along this thread, we have not
spotted any GIN bugs yet, only GIN amcheck bugs.

This turns out to be also an GIN amcheck bug:

```
DEBUG: comparing for offset 79 category 2 key attnum 1
DEBUG: comparing for offset 80 category 3 key attnum 1
DEBUG: comparing for offset 81 category 0 key attnum 2
LOG: index "ginidx" has wrong tuple order on entry tree page, block
2, offset 81, rightlink 4294967295
DEBUG: comparing for offset 82 category 0 key attnum 2
....
DEBUG: comparing for offset 100 category 0 key attnum 2
DEBUG: comparing for offset 101 category 2 key attnum 2
DEBUG: comparing for offset 102 category 3 key attnum 2
DEBUG: comparing for offset 103 category 0 key attnum 3
LOG: index "ginidx" has wrong tuple order on entry tree page, block
2, offset 103, rightlink 4294967295
DEBUG: comparing for offset 104 category 0 key attnum 3
DEBUG: comparing for offset 105 category 0 key attnum 3
```
Turns out we compare page entries for different attributes in
gin_check_parent_keys_consistency.

Trivial fix attached (see v37-0004). I now simply compare current and
prev attribute numbers. This revolves issue discovered by
`v0-0001-Add-a-reproducible-test-case-for-verify_gin-error.patch.no_apply`.
However, the stress test seems to still not pass. On my pc, it never
ens, all processes are in
DELETE waiting/UPDATE waiting state. I will take another look tomorrow.

p.s. I am just about to send this message, while i discovered we now
miss v34-0003-Add-gist_index_check-function-to-verify-GiST-ind.patch &
v34-0005-Add-GiST-support-to-pg_amcheck.patch from this patch series
;(

--
Best regards,
Kirill Reshke

Attachment

Re: Amcheck verification of GiST and GIN

From

Mark Dilger

Date:

27 March, 18:30:35

On Fri, Feb 21, 2025 at 6:29 AM Tomas Vondra <tomas@vondra.me> wrote:

Hi,

I see this patch didn't move since December :-( I still think these
improvements would be useful, it certainly was very helpful when I was
working on the GIN/GiST parallel builds (the GiST builds stalled, but I
hope to push the GIN patches soon).

So I'd like to get some of this in too. I'm not sure about the GiST
bits, because I know very little about that AM (the parallel builds made
me acutely aware of that).

But I'd like to get the GIN parts in. We're at v34 already, and the
recent changes were mostly cosmetic. Does anyone object to me polishing
and pushing those parts?

Kirill may have addressed my concerns in the latest version. I have not had time for another review. Tomas, would you still like to review and push this patch? I have no objection.

—
Mark Dilger
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Re: Amcheck verification of GiST and GIN

From

Tomas Vondra

Date:

27 March, 19:14:00


On 3/27/25 16:30, Mark Dilger wrote:
> 
> 
> On Fri, Feb 21, 2025 at 6:29 AM Tomas Vondra <tomas@vondra.me
> <mailto:tomas@vondra.me>> wrote:
> 
>     Hi,
> 
>     I see this patch didn't move since December :-( I still think these
>     improvements would be useful, it certainly was very helpful when I was
>     working on the GIN/GiST parallel builds (the GiST builds stalled, but I
>     hope to push the GIN patches soon).
> 
>     So I'd like to get some of this in too. I'm not sure about the GiST
>     bits, because I know very little about that AM (the parallel builds made
>     me acutely aware of that).
> 
>     But I'd like to get the GIN parts in. We're at v34 already, and the
>     recent changes were mostly cosmetic. Does anyone object to me polishing
>     and pushing those parts?
> 
> 
> Kirill may have addressed my concerns in the latest version.  I have not
> had time for another review.  Tomas, would you still like to review and
> push this patch?  I have no objection.
> 

Thanks for reminding me. I think the patches are in good share, but I'll
take a look once more, and I hope to get it committed.


regards

-- 
Tomas Vondra

Re: Amcheck verification of GiST and GIN

From

Tomas Vondra

Date:

28 March, 19:26:34

Here's a polished version of the patches. If you have any
comments/objections, please speak now. I don't plan to push 0006 (the
stress test), of course.

Changes I did:

1) update / write proper commit messages, hopefully explaining the
purpose of each patch well enough

2) update the lists of reviewers/authors (would appreciate someone
checking - it's hard to keep track for a thread that runs for years, and
it may not be quite clear what qualifies as a review)

3) squash the fix patch into the right patch, moved the README fix to be
the first patch (doesn't really matter)

4) minor cleanups in the main patches (0002 and 0003), mostly adding the
structs to typedefs.list and tweaking a couple comments

5) I've adjusted names of the memory contexts, because having both with
"amcheck context" seemed confusing, especially as it's in caller-callee
functions. So now it's

- amcheck consistency check context
- posting tree check context


regards

-- 
Tomas Vondra

Hello Arseniy,

I finally got time to look at this more closely, and do some testing.

Are there any cases when the current code incorrectly reports corruption
for a valid index? So far I've been unable to find such case. Or am I wrong?

It seems to me all the proposed changes are "tightening" the checks, in
the sense that we might have missed certain types of issues before. This
is supported by the fact that the new TAP test fails on master, i.e.
master does not report the corruption the TAP introduces.

(The TAP test is great, it would have been great to add something like
this in the original commit.)

Also, I've noticed that the TAP test passes even with some (most) of the
verify_gin.c changes reverted. See the 0002 patch - this does not break
the TAP test. Of course, that does not prove the changes are wrong and
I'm not claiming that. But can we improve the TAP test to trigger this
too? To show the current code (in master) misses this?

Grigory, Andrey, Heikki, any opinions on the tweaks?


regards

-- 
Tomas Vondra

Attachment

Re: Amcheck verification of GiST and GIN

From

Arseniy Mukhin

Date:

26 May, 19:28:47

Hello Tomas,

On Mon, May 26, 2025 at 1:27 PM Tomas Vondra <tomas@vondra.me> wrote:

>
> Hello Arseniy,
>
> I finally got time to look at this more closely, and do some testing.

Thank you for looking into this.

> Are there any cases when the current code incorrectly reports corruption
> for a valid index? So far I've been unable to find such case. Or am I wrong?

I think you are right, I'm not aware of such cases either.

> It seems to me all the proposed changes are "tightening" the checks, in
> the sense that we might have missed certain types of issues before. This
> is supported by the fact that the new TAP test fails on master, i.e.
> master does not report the corruption the TAP introduces.

I would say points 4, 5, 7 - yes, they are about tightening checks.

I think point 1 is more about fixing the existing code. In the current code,
parent_key is always NULL for the entry tree, so a bunch of code
(related to checking consistency between parents and children) is unreachable.

Then if you apply changes of the 1st point and parent_key comparison code starts
working, you will need changes of the 2nd point. The current code
ignores attribute
numbers in parent_key check, which can lead to comparing keys of
different columns.
I see one scenario where it can happen: let's say we have a 2 column
index. The first
attribute type is "int", the second attribute type is "text". In the
multicolumn gin
index tuple has two parts: attno and key value. Let's write it as
(attno, key). While
traversing the entry tree the current code caches parent keys with child blkno.
Let's say it cached (2, "a") parent key. It means that there was a
time when the child
page's high key was (2, "a"). But when the child page check actually
starts, it's possible
that as a result of parallel splits, the child page now contains keys
of the first
attribute only, for example (1, 1), (1, 5), (1, 10). So if we ignore
the attribute
number here, we will end up comparing 10 with "a". Hope the example is
not too confusing.

The 3rd point is about the code that never runs. As I understood it is
supposed that the check detects
splits so we can check more index pages, but If I'm not wrong it
doesn't work now.

The 6th point is about comparison with invalid pointer. I thought that
it's probably
not right to compare it with invalid pointer, but now I'm not sure.

> (The TAP test is great, it would have been great to add something like
> this in the original commit.)

Great, thank you for the feedback.

> Also, I've noticed that the TAP test passes even with some (most) of the
> verify_gin.c changes reverted. See the 0002 patch - this does not break
> the TAP test. Of course, that does not prove the changes are wrong and
> I'm not claiming that. But can we improve the TAP test to trigger this
> too? To show the current code (in master) misses this?

Yes, changes in the undo patch is about posting tree check part (6, 7 points)
and I haven't written tests for it, because to break posting tree you need to
manipulate with tids which is not as easy as replace "aaaa" with "cccc" as tests
for entry tree do. Probably it would be much easier to use page api to
corrupt some
posting tree pages, but I don't know, is it impossible in TAP tests?

Re: Amcheck verification of GiST and GIN

From

Tomas Vondra

Date:

09 June, 01:14:58

On 5/29/25 13:53, Arseniy Mukhin wrote:
> On Mon, May 26, 2025 at 7:28 PM Arseniy Mukhin
> <arseniy.mukhin.dev@gmail.com> wrote:
>> On Mon, May 26, 2025 at 1:27 PM Tomas Vondra <tomas@vondra.me> wrote:
>>> Also, I've noticed that the TAP test passes even with some (most) of the
>>> verify_gin.c changes reverted. See the 0002 patch - this does not break
>>> the TAP test. Of course, that does not prove the changes are wrong and
>>> I'm not claiming that. But can we improve the TAP test to trigger this
>>> too? To show the current code (in master) misses this?
>>
>> Yes, changes in the undo patch is about posting tree check part (6, 7 points)
>> and I haven't written tests for it, because to break posting tree you need to
>> manipulate with tids which is not as easy as replace "aaaa" with "cccc" as tests
>> for entry tree do. Probably it would be much easier to use page api to
>> corrupt some
>> posting tree pages, but I don't know, is it impossible in TAP tests?
> 
> I added the test for the posting tree parent_key check. Now applying
> 'undo patch' results in a test failure.

Great, thank you.

I noticed git-am complaining about a couple whitespace issues in the
test, mostly about mixing spaces/tabs. The v4 fixes them (in a separate
part, but should be merged into 0001). It's a detail, but might be good
to try git-am on patches ;-)

> Also I realized that the test 'invalid_entry_columns_order_test' will
> fail on big endian machines,
> because varlena len encoding is different for little endian and big
> endian, so I changed the test a little bit.
> Now the test doesn't use varlena len byte in regex.

I think it'd make sense to split this into smaller patches, each fixing
a different issue. Not one patch for each of the 11 items in your
original message, that would be an overkill ...

I propose to split it like this, into three parts, each addressing a
particular type of mistake:

1) gin_check_posting_tree_parent_keys_consistency

2) gin_check_parent_keys_consistency / att comparisons

3) gin_check_parent_keys_consistency / setting ptr->parenttup (at the end)

Does this make sense to you? If yes, can you split the patch series like
this, including a commit message for each part, explaining the fix? We'd
need the commit message even with a single patch, ofc.

> I also remove the blksize hardcode and start getting it from the
> cluster configuration. But anyway some tests
> will fail with not standard block size (probably all tests where tree
> growth is expected).
> 

I think that's fine. AFAIK we don't expect tests to be 100% stable with
other block sizes. It shouldn't crash / segfault, ofc, but some tests
may be sensitive to this.

BTW I hoped to get this fix pushed this week, but that didn't happen and
I'll be away most of next week :-( Let's try to get this sorted so that
I can push it on June 16 or so.

regards

-- 
Tomas Vondra

Attachment

Re: Amcheck verification of GiST and GIN

From

Tomas Vondra

Date:

09 June, 18:34:14

On 6/9/25 00:14, Tomas Vondra wrote:
> ...
>
> I propose to split it like this, into three parts, each addressing a
> particular type of mistake:
> 
> 1) gin_check_posting_tree_parent_keys_consistency
> 
> 2) gin_check_parent_keys_consistency / att comparisons
> 
> 3) gin_check_parent_keys_consistency / setting ptr->parenttup (at the end)
> 
> Does this make sense to you? If yes, can you split the patch series like
> this, including a commit message for each part, explaining the fix? We'd
> need the commit message even with a single patch, ofc.
> 
The attached v5 patch splits it along these lines, except that the extra
0001 part merely adds a multicolumn index into the regression test. The
0002-0004 parts are ordered to match the TAP test, i.e. it adds tests.

I've copied the points from the report to the commit messages, but this
needs cleanup/rephrasing, to make it readable. Could you look into
that?Of course, if you think the patches should be split differently,
feel free to move stuff.

And as I said before - if you feel the issues are too intertwined and
can't be split like this (or it just doesn't make sense), please speak
up. We can commit that as a single patch. It still needs the commit
message, though.

regards

-- 
Tomas Vondra

Attachment

Re: Amcheck verification of GiST and GIN

From

Arseniy Mukhin

Date:

09 June, 19:37:39

On Mon, Jun 9, 2025 at 6:34 PM Tomas Vondra <tomas@vondra.me> wrote:
>
> On 6/9/25 00:14, Tomas Vondra wrote:
> > ...
> >
> > I propose to split it like this, into three parts, each addressing a
> > particular type of mistake:
> >
> > 1) gin_check_posting_tree_parent_keys_consistency
> >
> > 2) gin_check_parent_keys_consistency / att comparisons
> >
> > 3) gin_check_parent_keys_consistency / setting ptr->parenttup (at the end)
> >
> > Does this make sense to you? If yes, can you split the patch series like
> > this, including a commit message for each part, explaining the fix? We'd
> > need the commit message even with a single patch, ofc.
> >
> The attached v5 patch splits it along these lines, except that the extra
> 0001 part merely adds a multicolumn index into the regression test. The
> 0002-0004 parts are ordered to match the TAP test, i.e. it adds tests.

Great, thank you.

> I've copied the points from the report to the commit messages, but this
> needs cleanup/rephrasing, to make it readable. Could you look into
> that?Of course, if you think the patches should be split differently,
> feel free to move stuff.

Yes, sure, I will do it ASAP.

> And as I said before - if you feel the issues are too intertwined and
> can't be split like this (or it just doesn't make sense), please speak
> up. We can commit that as a single patch. It still needs the commit
> message, though.

The way it splitted seems reasonable to me. Intertwined issues are
grouped together, and patches are more or less independent.

Also the test for 'posting tree parent_key check' that was added last
started failing locally. Don't know what changed, but I rewrote it
so now it relies on child blkno, which is stable (I hope), instead of
concrete TID. Will include it in the new patchset.


Best regards,
Arseniy Mukhin

Re: Amcheck verification of GiST and GIN

From

Arseniy Mukhin

Date:

10 June, 11:18:42

On Mon, Jun 9, 2025 at 7:37 PM Arseniy Mukhin
<arseniy.mukhin.dev@gmail.com> wrote:
>
> On Mon, Jun 9, 2025 at 6:34 PM Tomas Vondra <tomas@vondra.me> wrote:
> >
> > On 6/9/25 00:14, Tomas Vondra wrote:
> > > ...
> > >
> > > I propose to split it like this, into three parts, each addressing a
> > > particular type of mistake:
> > >
> > > 1) gin_check_posting_tree_parent_keys_consistency
> > >
> > > 2) gin_check_parent_keys_consistency / att comparisons
> > >
> > > 3) gin_check_parent_keys_consistency / setting ptr->parenttup (at the end)
> > >
> > > Does this make sense to you? If yes, can you split the patch series like
> > > this, including a commit message for each part, explaining the fix? We'd
> > > need the commit message even with a single patch, ofc.
> > >
> > The attached v5 patch splits it along these lines, except that the extra
> > 0001 part merely adds a multicolumn index into the regression test. The
> > 0002-0004 parts are ordered to match the TAP test, i.e. it adds tests.
>
> Great, thank you.
>
> > I've copied the points from the report to the commit messages, but this
> > needs cleanup/rephrasing, to make it readable. Could you look into
> > that?Of course, if you think the patches should be split differently,
> > feel free to move stuff.
>
> Yes, sure, I will do it ASAP.
>

Please find a new version in attachments. There are formatted commit
messages and some cosmetic changes in the tests. Please let me know if
anything needs to be changed. Also FWIW points 9th, 10th and 11th from
the report [1] were not addressed in the fixes. I'm not sure about
10th and 11th, but 9th seems like a no-brainer, so I added a patch
deleting an unused field 'parentlsn'. I tried git-am with patches and
it's ok with it. Thank you for the advice, added git-am step in my
patch preparation routine.

> ...
> Also the test for 'posting tree parent_key check' that was added last
> started failing locally. Don't know what changed, but I rewrote it
> so now it relies on child blkno, which is stable (I hope), instead of
> concrete TID. Will include it in the new patchset.
>

Also changed the regex pattern for this failing test, hope it is more
robust now.


[1] https://postgr.es/m/CAE7r3MJ611B9TE=YqBBncewp7-k64VWs+sjk7XF6fJUX77uFBA@mail.gmail.com


Best regards,
Arseniy Mukhin

Attachment

Re: Amcheck verification of GiST and GIN

From

Andrey Borodin

Date:

15 June, 16:24:04

Hi Arseniy!

Thanks for finding these problems.
I had several attempts to wrap my head around original patch with fixes, but when it was broken into several steps it
finallybecame easier for me. 
Here are some thought about patches.

> On 10 Jun 2025, at 13:18, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
> <0001-amcheck-Add-gin_index_check-on-a-multicolumn-index.patch>

The test seems harmless and nice to have. I understand that this test is needed to extend coverage.
Perhaps, we could verify that some code is actually triggered. Personally, I would be happy if we could some add
injectionpoints with notices at tested branches. But, AFAIK, it's too much of a burden to have injection points in
contribextensions. We had very similar problem with sort patch in btree_gist and eventually gave up. elog(DEBUG) was
nota good solution too, because it was unstable. 
See 'gin-finish-incomplete-split' or 'hash-aggregate-enter-spill-mode' for reference.

> On 10 Jun 2025, at 13:18, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
> <0002-patch-1-gin_check_parent_keys_consistency.patch>

Well, we inherited ginCompareEntries() from the very first patch version from 2020. I can't really say anything about
differenceshere, but your proposed change seems correct. 

Kirill excluded rightmost keys in v33 and that was kind of a fix. Kirill, do you remember if was particular problem of
internalpages? Is it safe to enable tuple order check for rightmost tuples on leaf pages? 

You wrote this comment:
+            /*
+             * First block is metadata, skip order check. Also, never check
+             * for high key on rightmost page, as this key is not really
+             * stored explicitly.
+             */

I agree that exclusion (stack->blkno != GIN_ROOT_BLKNO) make no sense. It was with us from the original version from
2020.As I understand some checks on root page will be used in test invalid_entry_columns_order_test. 

Having some TAP tests sounds like a very good idea.

I'm a bit surprised by excluding some letters from random_string(), but perhaps it's fine for this test.

Somewhere here:
+        INSERT INTO $relname (a) VALUES (('{' || 'pppppppppp' || random_string(1870) ||'}')::text[]);
I'd like to have a comment explaining number 1870. And, probably, you expect exactly 2 tuples on root page, right?

Are we 100% certain that 'rrrrrrrrr' will always be on root page?

I do not see much value in having variables $relname and $indexname. I'd just substitute its usages with literals. But
I'mnot sure, maybe this structure will be used in your tests later... 

In this function
+sub string_replace_block
I'd suggest a little bit of comments. Also, perhaps, fsync of files, but 001_verify_heapam.pl does not do fsync. So,
maybeit's OK here too. 

Also, I have a wild idea. Maybe add an assert that block size if 8192 and just exit otherwise?

And, maybe instead of gin_clean_pending_list() you can just create an index with fastupdate=off.

> On 10 Jun 2025, at 13:18, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
> <0003-patch-2-gin_check_parent_keys_consistency.patch>

The patch seems correct to me.
Except this
+    my $blkno = 5;  # leaf
in test reads scary. Will it be stable on buildfarm?

> On 10 Jun 2025, at 13:18, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
> <0004-patch-3-gin_check_posting_tree_parent_keys_consisten.patch>

I generally agree with direction of this patch.
But please also check the approach of PageGetItemIdCareful() in verify_nbtree.c. It goes extra mile to avoid coredump
incase of bogus ItemId. Should we do something like that here too? 

> On 10 Jun 2025, at 13:18, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
>
> <0005-patch-4-remove-unused-parentlsn.patch>

LGTM.

> On 9 May 2025, at 17:43, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
>
> 10) README says "Vacuum never deletes tuples or pages from the entry
> tree." But check assumes that it's possible to have
> deleted leaf page with 0 entries.
>
>    if (GinPageIsDeleted(page))
>    {
>       if (!GinPageIsLeaf(page))
>          ereport(ERROR,
>                (errcode(ERRCODE_INDEX_CORRUPTED),
>                 errmsg("index \"%s\" has deleted internal page %u",
>                      RelationGetRelationName(rel), blockNo)));
>       if (PageGetMaxOffsetNumber(page) > InvalidOffsetNumber)
>          ereport(ERROR,
>                (errcode(ERRCODE_INDEX_CORRUPTED),
>                 errmsg("index \"%s\" has deleted page %u with tuples",
>                      RelationGetRelationName(rel), blockNo)));
>    }

To enforce such an invariant we must be sure that GIN never deleted entry pages in older versions. I do not have enough
knowledgeof the history for this. 

> 11) When we compare entry tree max page key with parent key:
>
>             if (ginCompareAttEntries(&state, attnum, current_key,
>                              current_key_category, parent_key_attnum,
>                                      parent_key, parent_key_category) > 0)
>             {
>                /*
>                 * There was a discrepancy between parent and child
>                 * tuples. We need to verify it is not a result of
>                 * concurrent call of gistplacetopage(). So, lock parent
>                 * and try to find downlink for current page. It may be
>                 * missing due to concurrent page split, this is OK.
>                 */
>                pfree(stack->parenttup);
>                stack->parenttup = gin_refind_parent(rel, stack->parentblk,
>                                            stack->blkno, strategy);
>
> I think we can remove gin_refind_parent() and do ereport right away here.
> The same logic as with 3). AFAIK it's impossible to have a child item
> with a key that is higher than the cached parent key.
> Parent key bounds what keys we can insert into the child page, so it
> seems there is no way how they can appear there.
>

This logic was copied from GiST check. In GiST "Area of responsibility" of internal tuple can be extended in any
direction.That's why we need to lock parent page. 
If in GIN internal tuple keyspace is never extended - it's OK to avoid gin_refind_parent().
But reasoning about GIN concurrency is rather difficult. Unfortunately, we do not have such checks in B-tree
verificationwithout ShareLock. Either way we could peep some idea from there. 

Thank you!

Best regards, Andrey Borodin.

Re: Amcheck verification of GiST and GIN

From

Arseniy Mukhin

Date:

16 June, 01:25:40

On Sun, Jun 15, 2025 at 4:24 PM Andrey Borodin <x4mmm@yandex-team.ru> wrote:
>
>
> Hi Arseniy!
>
> Thanks for finding these problems.
> I had several attempts to wrap my head around original patch with fixes, but when it was broken into several steps it
finallybecame easier for me. 
> Here are some thought about patches.
>

Hi Andrey! Thank you for the review.

>
> > On 10 Jun 2025, at 13:18, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
> > <0001-amcheck-Add-gin_index_check-on-a-multicolumn-index.patch>
>
> The test seems harmless and nice to have. I understand that this test is needed to extend coverage.
> Perhaps, we could verify that some code is actually triggered. Personally, I would be happy if we could some add
injectionpoints with notices at tested branches. But, AFAIK, it's too much of a burden to have injection points in
contribextensions. We had very similar problem with sort patch in btree_gist and eventually gave up. elog(DEBUG) was
nota good solution too, because it was unstable. 
> See 'gin-finish-incomplete-split' or 'hash-aggregate-enter-spill-mode' for reference.

I'm not familiar with injections points much, but I think I got the
idea, sounds interesting. Thank you for the references.

>
> Having some TAP tests sounds like a very good idea.
>
> I'm a bit surprised by excluding some letters from random_string(), but perhaps it's fine for this test.
>

Yeah, there is no reason why we can't use vowels here, so I will add
them so that it doesn't look like there is any point in their absence.

> Somewhere here:
> +               INSERT INTO $relname (a) VALUES (('{' || 'pppppppppp' || random_string(1870) ||'}')::text[]);
> I'd like to have a comment explaining number 1870. And, probably, you expect exactly 2 tuples on root page, right?
>

The idea behind "random_string(1870)" was to get split as fast as
possible, but tuples with size > 2kb are toasted, so we have to use
something about 2k here. I think I took 1870 from some other place
where it was necessary, but here we can round it to 1900. So I'll
replace 1870 with 1900 and add a comment about the size. Also gonna
add some comments about datasets in some tests to make it more clear.

> Are we 100% certain that 'rrrrrrrrr' will always be on root page?

I'm not 100% sure. AFAIK the split algorithm is deterministic and the
idea was that if we use very long tuples, then all other factors will
be too small to influence what key we will see on the root page.

> I do not see much value in having variables $relname and $indexname. I'd just substitute its usages with literals.
ButI'm not sure, maybe this structure will be used in your tests later... 

I added variables just because we use index name and table name
several times, but I don't mind getting rid of them.

>
> In this function
> +sub string_replace_block
> I'd suggest a little bit of comments. Also, perhaps, fsync of files, but 001_verify_heapam.pl does not do fsync. So,
maybeit's OK here too. 
>

Will add a comment here.

> Also, I have a wild idea. Maybe add an assert that block size if 8192 and just exit otherwise?

I like the idea. I thought maybe it would be great to have some
function that every TAP test can use if it needs a certain block size?

> And, maybe instead of gin_clean_pending_list() you can just create an index with fastupdate=off.

Yeah, I think we can do it even simpler if we move index creation to
the end as regression tests do.

>
> > On 10 Jun 2025, at 13:18, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
> > <0003-patch-2-gin_check_parent_keys_consistency.patch>
>
> The patch seems correct to me.
> Except this
> +       my $blkno = 5;  # leaf
> in test reads scary. Will it be stable on buildfarm?
>

Not sure, but I thought that blkno should be more or less the same everywhere.

>
> > On 9 May 2025, at 17:43, Arseniy Mukhin <arseniy.mukhin.dev@gmail.com> wrote:
> >
> > 10) README says "Vacuum never deletes tuples or pages from the entry
> > tree." But check assumes that it's possible to have
> > deleted leaf page with 0 entries.
> >
> >    if (GinPageIsDeleted(page))
> >    {
> >       if (!GinPageIsLeaf(page))
> >          ereport(ERROR,
> >                (errcode(ERRCODE_INDEX_CORRUPTED),
> >                 errmsg("index \"%s\" has deleted internal page %u",
> >                      RelationGetRelationName(rel), blockNo)));
> >       if (PageGetMaxOffsetNumber(page) > InvalidOffsetNumber)
> >          ereport(ERROR,
> >                (errcode(ERRCODE_INDEX_CORRUPTED),
> >                 errmsg("index \"%s\" has deleted page %u with tuples",
> >                      RelationGetRelationName(rel), blockNo)));
> >    }
>
> To enforce such an invariant we must be sure that GIN never deleted entry pages in older versions. I do not have
enoughknowledge of the history for this. 

Agree, good point.

>
> > 11) When we compare entry tree max page key with parent key:
> >
> >             if (ginCompareAttEntries(&state, attnum, current_key,
> >                              current_key_category, parent_key_attnum,
> >                                      parent_key, parent_key_category) > 0)
> >             {
> >                /*
> >                 * There was a discrepancy between parent and child
> >                 * tuples. We need to verify it is not a result of
> >                 * concurrent call of gistplacetopage(). So, lock parent
> >                 * and try to find downlink for current page. It may be
> >                 * missing due to concurrent page split, this is OK.
> >                 */
> >                pfree(stack->parenttup);
> >                stack->parenttup = gin_refind_parent(rel, stack->parentblk,
> >                                            stack->blkno, strategy);
> >
> > I think we can remove gin_refind_parent() and do ereport right away here.
> > The same logic as with 3). AFAIK it's impossible to have a child item
> > with a key that is higher than the cached parent key.
> > Parent key bounds what keys we can insert into the child page, so it
> > seems there is no way how they can appear there.
> >
>
> This logic was copied from GiST check. In GiST "Area of responsibility" of internal tuple can be extended in any
direction.That's why we need to lock parent page. 
> If in GIN internal tuple keyspace is never extended - it's OK to avoid gin_refind_parent().
> But reasoning about GIN concurrency is rather difficult. Unfortunately, we do not have such checks in B-tree
verificationwithout ShareLock. Either way we could peep some idea from there. 
>

Got it.


Here is the new version. I fixed some points that Andrey mentioned.
All of them in the TAP test. Several comments were added, filler size
1870 changed to 1900. Also I added vowels to the replace function and
moved index creation after the data filling. Thank you!


Best regards,
Arseniy Mukhin

On 6/17/25 16:19, Thom Brown wrote:
> On Mon, 16 Jun 2025 at 21:00, Tomas Vondra <tomas@vondra.me> wrote:
>>
>> On 6/16/25 21:09, Arseniy Mukhin wrote:
>>> On Mon, Jun 16, 2025 at 6:58 PM Tomas Vondra <tomas@vondra.me> wrote:
>>>>
>>>> Thanks.
>>>>
>>>> I went through the patches, polished the commit messages and did some
>>>> minor tweaks in patch 0002 (to make the variable names a bit more
>>>> consistent, and reduce the scope a little bit). I left it as a separate
>>>> patch to make the changes clearer, but it should be merged into 0002.
>>>>
>>>> Please read through the commit messages, and let me know if I got some
>>>> of the details wrong (or not clear enough). Otherwise I plan to start
>>>> pushing this soon (~tomorrow).
>>>
>>> LGTM.
>>> Noticed a few typos in messages:
>>> in v8-0002-amcheck-Fix-checks-of-entry-order-for-GIN-indexes.patch
>>>    - parent key is creator
>>>    - as the core incorrectly expected
>>> and 'Arseniy Mikhin' in some patches.
>>>
>>
>> Thanks for noticing those typos, especially the one in the name.
> 
> Do today's commits clear this from the PostgreSQL 18 Open Items list?
> 

That's the intent, yes. There's one remaining commit.


-- 
Tomas Vondra