Re: Tricky bugs in concurrent index build - Mailing list pgsql-hackers

From Gregory Stark
Subject Re: Tricky bugs in concurrent index build
Date
Msg-id 878xletqr2.fsf@enterprisedb.com
Whole thread Raw
In response to Re: Tricky bugs in concurrent index build  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: Tricky bugs in concurrent index build
List pgsql-hackers
Tom Lane <tgl@sss.pgh.pa.us> writes:

> I wrote:
>> The problem case is that we take a tuple and try to insert it into the index.
>> Meanwhile someone else updates the tuple, and they're faster than us so
>> they get the new version into the index first.  Now our aminsert sees a
>> conflicting index entry, and as soon as it commits good aminsert will
>> raise a uniqueness error.  There's no backoff for "oh, the tuple I'm
>> inserting stopped being live while I was inserting it".
>
> It's possible that the problem could be solved by introducing such a
> backoff, ie, make aminsert recheck liveness of the tuple-to-be-inserted
> before declaring error.  Since we're about to fail anyway, performance
> of this code path probably isn't a huge issue.  But I haven't thought
> through whether it can be made to work with that addition.

Yesterday I considered if I could just catch the error in validate_index and
retest HeapSatisfiesVacuum after the insert but found that didn't work any
better. I don't remember the problem though and it's possible it would work if
it were inside aminsert.

> Unless someone's got a brilliant idea, my recommendation at this point
> is that we restrict the patch to building only non-unique indexes.
> Per discussion upthread, that's still a useful feature.  We can revisit
> the problem of doing uniqueness checks correctly in some future release,
> but time to work on it for 8.2 is running out fast.

I agree. There's other functionality in this area that would be nice too such
as REINDEX CONCURRENTLY and deleting the invalid index in case of error. Once
one chunk gets into CVS it makes it easier to extend it without making for a
bigger divergence to merge in one day.

I was also considering going ahead and implementing Hannu's ALTER INDEX SET
UNIQUE too. We would have the option of making CREATE UNIQUE INDEX
CONCURRENTLY automatically invoke that code afterwards. It would require a
second waiting phase though and a full index scan so it would be a much slower
option than handling it in the index build. On the plus side it would never
have to lock anything -- locking things inside a command explicitly billed as
concurrent strikes me as undesirable.

--  Gregory Stark EnterpriseDB          http://www.enterprisedb.com



pgsql-hackers by date:

Previous
From: "Jim C. Nasby"
Date:
Subject: Re: [PATCHES] selecting large result sets in psql using
Next
From: Zoltan Boszormenyi
Date:
Subject: Re: [PATCHES] COPY view