Re: error: could not find pg_class tuple for index 2662 - Mailing list pgsql-hackers

From daveg
Subject Re: error: could not find pg_class tuple for index 2662
Date
Msg-id 20110804195206.GH14353@sonic.net
Whole thread Raw
In response to Re: error: could not find pg_class tuple for index 2662  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: error: could not find pg_class tuple for index 2662
List pgsql-hackers
On Thu, Aug 04, 2011 at 12:28:31PM -0400, Tom Lane wrote:
> daveg <daveg@sonic.net> writes:
> > Summary: the failing process reads 0 rows from 0 blocks from the OLD
> > relfilenode. 
> 
> Hmm.  This seems to mean that we're somehow missing a relation mapping
> invalidation message, or perhaps not processing it soon enough during
> some complex set of invalidations.  I did some testing with that in mind
> but couldn't reproduce the failure.  It'd be awfully nice to get a look
> at the call stack when this happens for you ... what OS are you running?

To recap, a few observations:

When it happens the victim has recently been waiting on a lock for a
several seconds.

We create a lot of temp tables, hundreds of thousands a day.

There are catalog vacuum fulls and reindexes running on 30 odd other databases
at the same time. The script estimates the amount of bloat on each table and
index and chooses either reindex on specific indexes or vacuum full as needed.

This is a 32 core (64 with hype threading) 512GB host with several hundred
connections

We are seeing "cannot read' and 'cannot open' errors too that would be
consistant with trying to use a vanished file.

-dg

-- 
David Gould       daveg@sonic.net      510 536 1443    510 282 0869
If simplicity worked, the world would be overrun with insects.


pgsql-hackers by date:

Previous
From: Bruce Momjian
Date:
Subject: Reduce WAL logging of INSERT SELECT
Next
From: Joe Conway
Date:
Subject: Re: possible new feature: asynchronous sql or something like oracles dbms_job.submit