ltree_gist indexes broken after pg_upgrade from 12 to 13 - Mailing list pgsql-hackers

From Tomas Vondra
Subject ltree_gist indexes broken after pg_upgrade from 12 to 13
Date
Msg-id d80e0a55-6c3e-5b26-53e3-3c4f973f737c@enterprisedb.com
Whole thread Raw
Responses Re: ltree_gist indexes broken after pg_upgrade from 12 to 13  (Tom Lane <tgl@sss.pgh.pa.us>)
Re: ltree_gist indexes broken after pg_upgrade from 12 to 13  (Andres Freund <andres@anarazel.de>)
Re: ltree_gist indexes broken after pg_upgrade from 12 to 13  (Nikita Glukhov <n.gluhov@postgrespro.ru>)
List pgsql-hackers
Hi,

Victor Yegorov reported a crash related to a GiST index on ltree [1],
following a pg_upgrade from 12.x to 14.x, with a data set reproducing
this. I spent some time investigating this, and it seems this is a silly
bug in commit

  commit 911e70207703799605f5a0e8aad9f06cff067c63 (HEAD)
  Author: Alexander Korotkov <akorotkov(at)postgresql(dot)org>
  Date:   Mon Mar 30 19:17:11 2020 +0300

    Implement operator class parameters
    ...

in PG13, which modified ltree_gist so that siglen is opclass parameter
(and not hard-coded). But the procedures use LTREE_GET_ASIGLEN macro to
extract the value, which either extracts the value from the catalog (if
the index has opclass parameter) or uses a default value - but it always
uses LTREE_ASIGLEN_DEFAULT, which belongs to _ltree_gist opclass (i.e.
for array of ltree). And that's 28 instead of the 8, as it should be.

Attached is a simple patch tweaking the macros, which resolves the
issue. Maybe there should be a separate macro though, because "ASIGLEN"
probably refers to "array siglen".

But the bigger issue is this fixes indexes created before the upgrade,
but it breaks indexes created after it. Because those were created with
siglen 28B, so reverting to 8B breaks them. A bit of a symmetry :-/

I'm not sure what to do about this. There's no way to distinguish
indexes, and I think an index may actually be broken both with and
without the patch - imagine an index created before the upgrade (so
using siglen 8), but then receiving a couple new rows (siglen 28).

After thinking about this I only see two options:

1) Don't apply the patch and tell everyone using ltree_gist they need to
rebuild the index after pg_upgrade from 12 to 13+. The downside of this
is larger indexes (because some tuples are 20B larger).

2) Apply the patch + tell those who already upgraded from 12 to rebuild
ltree_gist indexes, because those might be broken due to new inserts.


My opinion is to do (2), because at least those who'll upgrade later
(which is a lot of people) will be fine without a rebuild. And it also
makes the indexes a bit smaller, thanks to saving 20B.


regards


[1]
https://www.postgresql.org/message-id/17406-71e02820ae79bb40%40postgresql.org


-- 
Tomas Vondra
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company
Attachment

pgsql-hackers by date:

Previous
From: Jacob Champion
Date:
Subject: Re: [PATCH] Expose port->authn_id to extensions and triggers
Next
From: Mark Wong
Date:
Subject: real/float example for testlibpq3