Re: Race conditions with checkpointer and shutdown - Mailing list pgsql-hackers

From Tom Lane
Subject Re: Race conditions with checkpointer and shutdown
Date
Msg-id 6528.1555645687@sss.pgh.pa.us
Whole thread Raw
In response to Re: Race conditions with checkpointer and shutdown  (Thomas Munro <thomas.munro@gmail.com>)
Responses Re: Race conditions with checkpointer and shutdown
List pgsql-hackers
Thomas Munro <thomas.munro@gmail.com> writes:
> On Fri, Apr 19, 2019 at 10:22 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> Maybe what we should be looking for is "why doesn't the walreceiver
>> shut down"?  But the dragonet log you quote above shows the walreceiver
>> exiting, or at least starting to exit.  Tis a puzzlement.

> ... Is there some way that the exit code could hang
> *after* that due to corruption of libc resources (FILE streams,
> malloc, ...)?  It doesn't seem likely to me (we'd hopefully see some
> more clues) but I thought I'd mention the idea.

I agree it's not likely ... but that's part of the reason I was thinking
about adding some postmaster logging.  Whatever we're chasing here is
"not likely", per the observed buildfarm failure rate.

            regards, tom lane



pgsql-hackers by date:

Previous
From: Thomas Munro
Date:
Subject: Re: Race conditions with checkpointer and shutdown
Next
From: Amit Langote
Date:
Subject: Re: bug in update tuple routing with foreign partitions