On Tue, 2009-12-01 at 16:40 +0200, Heikki Linnakangas wrote:
> It's not hard to imagine that when a hardware glitch happens
> causing corruption, it also causes the system to crash. Recalculating
> the CRCs after crash would mask the corruption.
They are already masked from us, so continuing to mask those errors
would not put us in a worse position.
If we are saying that 99% of page corruptions are caused at crash time
because of torn pages on hint bits, then only WAL logging can help us
find the 1%. I'm not convinced that is an accurate or safe assumption
and I'd at least like to see LOG entries showing what happened.
ISTM we could go for two levels of protection. CRC checks and scanner
for Level 1 protection, then full WAL logging for Level 2 protection.
-- Simon Riggs www.2ndQuadrant.com