Re: Block-level CRC checks - Mailing list pgsql-hackers

From Andrew Chernow
Subject Re: Block-level CRC checks
Date
Msg-id 48E2D05E.5030707@esilo.com
Whole thread Raw
In response to Re: Block-level CRC checks  (Joshua Drake <jd@commandprompt.com>)
List pgsql-hackers
Joshua Drake wrote:
> During a 41-month period we observed more than 400,000 instances of
> checksum mistmatches, 8% of which were discovered during RAID
> reconstruction, creating the possibility of real data loss.
> 
> They also have a wonderful term they mention, "Silent Data corruptions".
> 
> 
> 

Exactely!
From my experience, the only assumption to be made about storage is that it can 
and will fail ... frequently!  It is unreliable (not to mention slooow) and 
should not be trusted; regardless of the price tag or brand.

This could help detect:

- fs corruption
- vfs bug
- raid controller firmware bug
- bad disk sector
- power crash
- weird martian-like raid rebuilds

Although, this idea won't prevent anything.  Everything would still sinisterly 
fail on you.  The difference is, no more silence.

-- 
Andrew Chernow
eSilo, LLC
every bit counts
http://www.esilo.com/


pgsql-hackers by date:

Previous
From: "Hitoshi Harada"
Date:
Subject: Re: Common Table Expressions (WITH RECURSIVE) patch
Next
From: "Gurjeet Singh"
Date:
Subject: Re: Bad error message