Re: bad logging around broken restore_command - Mailing list pgsql-hackers

From Fujii Masao
Subject Re: bad logging around broken restore_command
Date
Msg-id 7feeacc6-9dfb-0b90-23f9-f836bc5b52ad@oss.nttdata.com
Whole thread Raw
In response to Re: bad logging around broken restore_command  (Kyotaro Horiguchi <horikyota.ntt@gmail.com>)
Responses Re: bad logging around broken restore_command  (Pavel Borisov <pashkin.elfe@gmail.com>)
List pgsql-hackers

On 2020/03/10 11:47, Kyotaro Horiguchi wrote:
> At Thu, 6 Feb 2020 23:23:42 +0900, Fujii Masao <masao.fujii@oss.nttdata.com> wrote in
>> On 2020/02/06 1:10, Jeff Janes wrote:
>>> If the restore command claims to have succeeded, but fails to have created
>>> a file with the right name (due to typos or escaping or quoting issues, for
>>> example), no complaint is issued at the time, and it then fails later with
>>> a relatively uninformative error message like "could not locate required
>>> checkpoint record".
> ...
>>> I don't see why ENOENT is thought to deserve the silent treatment.  It
>>> seems weird that success gets logged ("restored log file \"%s\" from
>>> archive"), but one particular type of unexpected failure does not.
>>
>> Agreed.
> 
> In the first place it is not perfectly silent and that problem cannot
> happen.  In the ENOENT case, the function reports "could not restore
> file \"%s\" from archive: %s", but with DEBUG2 then returns false, and
> the callers treat the failure properly.

Yes.

>> I've attached a patch which emits a LOG message for ENOENT.
>>
>> Isn't it better to use "could not stat file" message even in ENOENT
>> case?
>> If yes, something like message that you used in the patch should be
>> logged as DETAIL or HINT message. So, what about the attached patch?
> 
> If you want to see some log messages in the case, it is sufficient to
> raise the loglevel of the existing message from DEBUG2 to LOG.

Isn't it too noisy to log every time when we could not restore
the archived file? In archive recovery case, it's common to fail
to restore archive files and try to replay WAL files in pg_wal.

Regards,

-- 
Fujii Masao
NTT DATA CORPORATION
Advanced Platform Technology Group
Research and Development Headquarters



pgsql-hackers by date:

Previous
From: Amit Kapila
Date:
Subject: Re: error context for vacuum to include block number
Next
From: Justin Pryzby
Date:
Subject: Re: error context for vacuum to include block number