Re: Behavior for crash recovery when it detects a corrupt WAL record - Mailing list pgsql-hackers

From Heikki Linnakangas
Subject Re: Behavior for crash recovery when it detects a corrupt WAL record
Date
Msg-id 50742FB9.7040906@vmware.com
Whole thread Raw
In response to Behavior for crash recovery when it detects a corrupt WAL record  (Amit Kapila <amit.kapila@huawei.com>)
Responses Re: Behavior for crash recovery when it detects a corrupt WAL record  (Amit Kapila <amit.kapila@huawei.com>)
List pgsql-hackers
On 09.10.2012 16:42, Amit Kapila wrote:
> I have observed that currently during recovery, while it applies the WAL
> records even if it detects that there is a corrupt record
>
> by crc validation, it proceeds.
>
> Basically ReadRecord(), returns NULL in such cases which makes the behavior
> same as it has reached end of WAL.
>
> After that server get started and user can perform operations normally.

Yeah. We rely on the CRC to detect end of WAL during recovery. If the 
system crashes while the WAL is being flushed to disk, it's normal that 
there's a corrupt (ie. partially written) record at the end of the WAL. 
This is a common technique used by pretty much every system with a 
transaction log / journal.

The other option would be to perform two fsyncs for every commit; one to 
flush the WAL to disk, and another to update some global pointer to 
point to the end of valid WAL (e.g in pg_control).

- Heikki



pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: Truncate if exists
Next
From: Tom Lane
Date:
Subject: Re: Truncate if exists