Re: BUG #4410: Indexes not seen right away - Mailing list pgsql-bugs

From Bruce Momjian
Subject Re: BUG #4410: Indexes not seen right away
Date
Msg-id 200810022107.m92L7nh25041@momjian.us
Whole thread Raw
In response to Re: BUG #4410: Indexes not seen right away  ("Greg Sabino Mullane" <greg@turnstep.com>)
Responses Re: BUG #4410: Indexes not seen right away  ("Greg Sabino Mullane" <greg@turnstep.com>)
List pgsql-bugs
Greg Sabino Mullane wrote:
> > Does it have pg_index.indcheckxmin = true?  If so, see README.HOT.
>
> Yes, that was probably it. Is this worth noting in the documentation somewhere
> (other than the technical bowels of HOT)? Perhaps in the CREATE INDEX
> docs?

I have done some research on this.  Postgres 8.3 didn't allow
in-progress transactions to see the CREATE INDEX if the index had broken
HOT chains.  However, the 8.4 code has more sophisticated tracking of
snapshots so this should be less of a problem.  The only way I could get
the CREATE INDEX to be invisible in 8.4 was to use a serialized
isolation level.

I have attached the scripts I used for testing.  I don't think it is
worth documenting this until we have 8.4 released and people start using
it.   I believe a mention in the manual would require quite complex
wording.  I have also updated README.HOT, patch attached.

--
  Bruce Momjian  <bruce@momjian.us>        http://momjian.us
  EnterpriseDB                             http://enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +
DROP TABLE test;
CREATE TABLE test (x int);
DROP SEQUENCE seq;
CREATE SEQUENCE seq;
INSERT INTO test SELECT nextval('seq') FROM generate_series(1, 1000);
BEGIN;
-- entable line below to create old snapshot that
-- makes index invisible in 8.4
--SET TRANSACTION ISOLATION LEVEL SERIALIZABLE;
EXPLAIN SELECT * FROM test WHERE x=3;
\echo run HOT.index in another session
UPDATE test SET x = x + 1 WHERE x >= 1000;
SELECT ctid, * FROM test WHERE x > 1000;
UPDATE test SET x = x + 1 WHERE x >= 1000;
SELECT ctid, * FROM test WHERE x > 1000;
UPDATE test SET x = x + 1 WHERE x >= 1000;
SELECT ctid, * FROM test WHERE x > 1000;
UPDATE test SET x = x + 1 WHERE x >= 1000;
SELECT ctid, * FROM test WHERE x > 1000;

CREATE INDEX i_test ON test(x);

EXPLAIN SELECT * FROM test WHERE x=3;
END;

Index: src/backend/access/heap/README.HOT
===================================================================
RCS file: /cvsroot/pgsql/src/backend/access/heap/README.HOT,v
retrieving revision 1.3
diff -c -c -r1.3 README.HOT
*** src/backend/access/heap/README.HOT    21 Mar 2008 13:23:27 -0000    1.3
--- src/backend/access/heap/README.HOT    2 Oct 2008 20:58:25 -0000
***************
*** 301,321 ****
  a "broken" chain that can't be indexed properly.

  To address this issue, regular (non-concurrent) CREATE INDEX makes the
! new index usable only by transactions newer than the CREATE INDEX
! command.  This prevents transactions that can see the inconsistent HOT
! chains from trying to use the new index and getting incorrect results.
! New transactions can only see the rows visible after the index was
! created, hence the HOT chains are consistent for them.

  Entries in the new index point to root tuples (tuples with current index
  pointers) so that our index uses the same index pointers as all other
  indexes on the table.  However the row we want to index is actually at
  the *end* of the chain, ie, the most recent live tuple on the HOT chain.
  That is the one we compute the index entry values for, but the TID
! we put into the index is that of the root tuple.  Since transactions that
  will be allowed to use the new index cannot see any of the older tuple
  versions in the chain, the fact that they might not match the index entry
! isn't a problem.  (Such transactions will check the tuple visibility
  information of the older versions and ignore them, without ever looking at
  their contents, so the content inconsistency is OK.)  Subsequent updates
  to the live tuple will be allowed to extend the HOT chain only if they are
--- 301,322 ----
  a "broken" chain that can't be indexed properly.

  To address this issue, regular (non-concurrent) CREATE INDEX makes the
! new index usable only by new transactions and transactions that don't
! have snapshots older than the the CREATE INDEX command.  This prevents
! queries that can see the inconsistent HOT chains from trying to use the
! new index and getting incorrect results.  Queries that can see the index
! can only see the rows that were visible after the index was created,
! hence the HOT chains are consistent for them.

  Entries in the new index point to root tuples (tuples with current index
  pointers) so that our index uses the same index pointers as all other
  indexes on the table.  However the row we want to index is actually at
  the *end* of the chain, ie, the most recent live tuple on the HOT chain.
  That is the one we compute the index entry values for, but the TID
! we put into the index is that of the root tuple.  Since queries that
  will be allowed to use the new index cannot see any of the older tuple
  versions in the chain, the fact that they might not match the index entry
! isn't a problem.  (Such queries will check the tuple visibility
  information of the older versions and ignore them, without ever looking at
  their contents, so the content inconsistency is OK.)  Subsequent updates
  to the live tuple will be allowed to extend the HOT chain only if they are
***************
*** 331,351 ****
  transaction to commit or roll back.  (We could do that for user tables
  too, but since the case is unexpected we prefer to throw an error.)

! Practically, we prevent old transactions from using the new index by
! setting pg_index.indcheckxmin to TRUE.  Queries are allowed to use such an
! index only after pg_index.xmin is below their TransactionXmin horizon,
! thereby ensuring that any incompatible rows in HOT chains are dead to them.
! (pg_index.xmin will be the XID of the CREATE INDEX transaction.  The reason
! for using xmin rather than a normal column is that the regular vacuum
! freezing mechanism will take care of converting xmin to FrozenTransactionId
! before it can wrap around.)

  This means in particular that the transaction creating the index will be
! unable to use the index.  We alleviate that problem somewhat by not setting
! indcheckxmin unless the table actually contains HOT chains with
! RECENTLY_DEAD members.  (In 8.4 we may be able to improve the situation,
! at least for non-serializable transactions, because we expect to be able to
! advance TransactionXmin intratransaction.)

  Another unpleasant consequence is that it is now risky to use SnapshotAny
  in an index scan: if the index was created more recently than the last
--- 332,350 ----
  transaction to commit or roll back.  (We could do that for user tables
  too, but since the case is unexpected we prefer to throw an error.)

! Practically, we prevent certain transactions from using the new index by
! setting pg_index.indcheckxmin to TRUE.  Transactions are allowed to use
! such an index only after pg_index.xmin is below their TransactionXmin
! horizon, thereby ensuring that any incompatible rows in HOT chains are
! dead to them. (pg_index.xmin will be the XID of the CREATE INDEX
! transaction.  The reason for using xmin rather than a normal column is
! that the regular vacuum freezing mechanism will take care of converting
! xmin to FrozenTransactionId before it can wrap around.)

  This means in particular that the transaction creating the index will be
! unable to use the index if the transaction has old snapshots.  We
! alleviate that problem somewhat by not setting indcheckxmin unless the
! table actually contains HOT chains with RECENTLY_DEAD members.

  Another unpleasant consequence is that it is now risky to use SnapshotAny
  in an index scan: if the index was created more recently than the last

pgsql-bugs by date:

Previous
From: "Jen McCann"
Date:
Subject: BUG #4447: install failed to start; libintl3.dll was not found
Next
From: "Donald Harter"
Date:
Subject: BUG #4448: postgres won't start with latest stable kernel