Home > mailing lists

Re: Block-level CRC checks - Mailing list pgsql-hackers

From	Robert Haas
Subject	Re: Block-level CRC checks
Date	December 1, 2009 13:26:37
Msg-id	603c8f070912010626i1342f23fsa18bb696b0cfe2c0@mail.gmail.com Whole thread Raw
In response to	Re: Block-level CRC checks (Heikki Linnakangas <heikki.linnakangas@enterprisedb.com>)
Responses	Re: Block-level CRC checks (Heikki Linnakangas <heikki.linnakangas@enterprisedb.com>)
List	pgsql-hackers

Tree view

On Tue, Dec 1, 2009 at 8:30 AM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:
> Bruce Momjian wrote:
>> What might be interesting is to report CRC mismatches if the database
>> was shut down cleanly previously;  I think in those cases we shouldn't
>> have torn pages.
>
> Unfortunately that's not true. You can crash, leading to a torn page,
> and then start up the database and shut it down cleanly. The torn page
> is still there, even though the last shutdown was a clean one.

Thinking through this, as I understand it, in order to prevent this
problem, you'd need to be able to predict at recovery time which pages
might have been torn by the unclean shutdown.  In order to do that,
you'd need to know which pages were waiting to be written to disk at
the time of the shutdown.  For ordinary page modifications, that's not
a problem, because there will be WAL records for those pages that need
to be replayed, and we could recompute the CRC at the same time.  But
for hint bit changes, there's no persistent state that would tell us
which hint bits were in the midst of being flipped when the system
went down, so the only way to make sure all the CRCs are correct would
be to rescan every page in the entire cluster and recompute every CRC.

Is that right?

...Robert

pgsql-hackers by date:

From: Aidan Van Dyk
Date: 01 December 2009, 13:26:35
Subject: Re: Block-level CRC checks

From: Robert Haas
Date: 01 December 2009, 13:35:34
Subject: Re: CommitFest status/management

Re: Block-level CRC checks - Mailing list pgsql-hackers

Previous

Next