Re: fsync error handling in pg_receivewal, pg_recvlogical - Mailing list pgsql-hackers

From Sehrope Sarkuni
Subject Re: fsync error handling in pg_receivewal, pg_recvlogical
Date
Msg-id CAH7T-arrc8KQyKUQYpGYL6XaDH-oA7x=+nCUJDZbgfq6Y7P9BQ@mail.gmail.com
Whole thread Raw
In response to Re: fsync error handling in pg_receivewal, pg_recvlogical  (Michael Paquier <michael@paquier.xyz>)
Responses Re: fsync error handling in pg_receivewal, pg_recvlogical  (Sehrope Sarkuni <sehrope@jackdb.com>)
Re: fsync error handling in pg_receivewal, pg_recvlogical  (Peter Eisentraut <peter.eisentraut@2ndquadrant.com>)
List pgsql-hackers
Hi,

Tried out this patch and it applies, compiles, and passes check-world. Also flipped things around in pg_recvlogical.c to exit-on-success to ensure it's actually being called and that worked too. Outside of a more complicated harness that simulates fsync errors not sure how else to test this further.

I did some searching and found a FUSE based on that looks interesting: CharybdeFS[1]. Rather than being fixed at mount time, it has a client/server interface so you can change the handling of syscalls on the fly[2]. For example you can error out fsync calls halfway through a test rather than always or randomly. Haven't tried it out but leaving it here as it seems relevant.

[1]: https://github.com/scylladb/charybdefs
[2]: https://www.scylladb.com/2016/05/02/fault-injection-filesystem-cookbook/

On Wed, Jun 26, 2019 at 12:11 AM Michael Paquier <michael@paquier.xyz> wrote:
Why using a different error code.  Using EXIT_FAILURE is a more common
practice in the in-core binaries.  The patch looks fine to me except
that, that's a good first cut.

An error code specific to fsync issues could help with tests as the harness could check it to ensure things died for the right reasons. With a generic "messed up fsync" harness you might even be able to run some existing tests that would otherwise pass and check for the fsync-specific exit code.

Regards,
-- Sehrope Sarkuni
Founder & CEO | JackDB, Inc. | https://www.jackdb.com/

 

pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Testing LISTEN/NOTIFY more effectively
Next
From: Sehrope Sarkuni
Date:
Subject: Re: fsync error handling in pg_receivewal, pg_recvlogical