Re: Anti-critical-section assertion failure in mcxt.c reached by walsender - Mailing list pgsql-hackers

From Noah Misch
Subject Re: Anti-critical-section assertion failure in mcxt.c reached by walsender
Date
Msg-id 20210507022831.GA2991266@rfd.leadboat.com
Whole thread Raw
In response to Re: Anti-critical-section assertion failure in mcxt.c reached by walsender  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
On Thu, May 06, 2021 at 09:43:32PM -0400, Tom Lane wrote:
> 2. We evidently need to put a bit more effort into this error
> reporting logic.  More generally, I wonder how we could audit
> the code for similar hazards elsewhere, because I bet there are
> some.  (Or ... could it be sane to run functions included in
> the ereport's arguments in ErrorContext?)

Seems reasonable.  I don't have good ideas for auditing; just making the
palloc work may be easier.

> 3. One might wonder why we're getting an fdatasync failure at
> all, when thorntail is configured to run with fsync = off.
> The answer to that one is that 008_fsm_truncation.pl takes it
> upon itself to force fsync = on, overriding the express wishes
> of the buildfarm owner, not to mention general project policy.
> AFAICT that was added with little if any thought in the initial
> creation of 008_fsm_truncation.pl, and I think we should take
> it out.  There's certainly no visible reason for this one
> TAP script to be running with fsync on when no others do.

I've caught that one test taking ~10min due to its fsync use.  If fsync=on
isn't important to the test, +1 for removing it.



pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: Anti-critical-section assertion failure in mcxt.c reached by walsender
Next
From: Amit Kapila
Date:
Subject: Re: Replication slot stats misgivings