Home > mailing lists

Re: [HACKERS] Checksums by default? - Mailing list pgsql-hackers

From	Ants Aasma
Subject	Re: [HACKERS] Checksums by default?
Date	January 24, 2017 19:06:10
Msg-id	CA+CSw_uS7mGqkDjKgyV5i54gfezU4h3F2d-AriBN7oS3QjoezQ@mail.gmail.com Whole thread Raw
In response to	Re: [HACKERS] Checksums by default? (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: [HACKERS] Checksums by default? (Stephen Frost <sfrost@snowman.net>)
List	pgsql-hackers

Tree view

On Tue, Jan 24, 2017 at 4:07 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Peter Geoghegan <pg@heroku.com> writes:
>> I thought that checksums went in in part because we thought that there
>> was some chance that they'd find bugs in Postgres.
>
> Not really.  AFAICS the only point is to catch storage-system malfeasance.

This matches my understanding. Actual physical media errors are caught
by lower level checksums/error correction codes, and memory errors are
caught by ECC RAM. Checksums do very little for PostgreSQL bugs, which
leaves only filesystem and storage firmware bugs. However the latter
are still reasonably common faults. I have seen multiple cases where,
after reviewing the corruption with a hex editor, the only reasonable
conclusion was a bug in the storage system. Data shifted around by
non-page size amounts, non-page aligned extents that are zeroed out,
etc. Unfortunately none of those customers had checksums turned on at
the time. I feel that reacting to such errors with a non-cryptic and
easily debuggable checksum error is much better than erroring out with
huge memory allocations, crashing or returning bogus data. Timely
reaction to data corruption is really important for minimizing data
loss.

Regards,
Ants Aasma

pgsql-hackers by date:

From: David Steele
Date: 24 January 2017, 19:03:24
Subject: Re: [HACKERS] patch proposal

From: Tom Lane
Date: 24 January 2017, 19:12:07
Subject: Re: [HACKERS] Failure in commit_ts tap tests

Re: [HACKERS] Checksums by default? - Mailing list pgsql-hackers

Previous

Next