Home > mailing lists

Re: when the startup process doesn't (logging startup delays) - Mailing list pgsql-hackers

From	Simon Riggs
Subject	Re: when the startup process doesn't (logging startup delays)
Date	November 14, 2022 12:37:27
Msg-id	CANbhV-FCvygqcWtQpXAVmrktqBGVVPxntfV4=H=gF3WiXaEiyA@mail.gmail.com Whole thread Raw
In response to	Re: when the startup process doesn't (logging startup delays) (Bharath Rupireddy <bharath.rupireddyforpostgres@gmail.com>)
Responses	Re: when the startup process doesn't (logging startup delays)
List	pgsql-hackers

Tree view

On Tue, 8 Nov 2022 at 12:33, Bharath Rupireddy
<bharath.rupireddyforpostgres@gmail.com> wrote:
>
> On Tue, Nov 8, 2022 at 4:35 PM Thomas Munro <thomas.munro@gmail.com> wrote:
> >
> > On Sat, Oct 30, 2021 at 7:44 AM Robert Haas <robertmhaas@gmail.com> wrote:
> > > Committed.
> >
> > Is it expected that an otherwise idle standby's recovery process
> > receives SIGALRM every N seconds, or should the timer be canceled at
> > that point, as there is no further progress to report?
>
> Nice catch. Yeah, that seems unnecessary, see the below standby logs.
> I think we need to disable_timeout(STARTUP_PROGRESS_TIMEOUT, false);,
> something like the attached? I think there'll be no issue with the
> patch since the StandbyMode gets reset only at the end of recovery (in
> FinishWalRecovery()) but it can very well be set during recovery (in
> ReadRecord()). Note that I also added an assertion in
> has_startup_progress_timeout_expired(), just in case.
>
> 2022-11-08 11:28:23.563 UTC [980909] LOG:  SIGALRM handle_sig_alarm received
> 2022-11-08 11:28:23.563 UTC [980909] LOG:
> startup_progress_timeout_handler called
> 2022-11-08 11:28:33.563 UTC [980909] LOG:  SIGALRM handle_sig_alarm received
> 2022-11-08 11:28:33.563 UTC [980909] LOG:
> startup_progress_timeout_handler called
> 2022-11-08 11:28:43.563 UTC [980909] LOG:  SIGALRM handle_sig_alarm received
> 2022-11-08 11:28:43.563 UTC [980909] LOG:
> startup_progress_timeout_handler called
> 2022-11-08 11:28:53.563 UTC [980909] LOG:  SIGALRM handle_sig_alarm received
> 2022-11-08 11:28:53.563 UTC [980909] LOG:
> startup_progress_timeout_handler called
>
> Whilte at it, I noticed that we report redo progress for PITR, but we
> don't report when standby enters archive recovery mode, say due to a
> failure in the connection to primary or after the promote signal is
> found. Isn't it useful to report in this case as well to know the
> recovery progress?

I think your patch disables progress too early, effectively turning
off the standby progress feature. The purpose was to report on things
that take long periods during recovery, not just prior to recovery.

I would advocate that we disable progress only while waiting, as I've done here:
https://www.postgresql.org/message-id/CANbhV-GcWjZ2cmj0uCbZDWQUHnneMi_4EfY3dVWq0-yD5o7Ccg%40mail.gmail.com

--
Simon Riggs                http://www.EnterpriseDB.com/

pgsql-hackers by date:

From: Daniel Gustafsson
Date: 14 November 2022, 12:36:56
Subject: Re: pg_basebackup's --gzip switch misbehaves

From: John Naylor
Date: 14 November 2022, 12:59:51
Subject: Re: [PoC] Improve dead tuple storage for lazy vacuum

Re: when the startup process doesn't (logging startup delays) - Mailing list pgsql-hackers

Previous

Next