Re: Change log level for notifying hot standby is waiting non-overflowed snapshot - Mailing list pgsql-hackers

From torikoshia
Subject Re: Change log level for notifying hot standby is waiting non-overflowed snapshot
Date
Msg-id a5f271cd82585772becf8967d82c85eb@oss.nttdata.com
Whole thread Raw
In response to Re: Change log level for notifying hot standby is waiting non-overflowed snapshot  (Fujii Masao <masao.fujii@oss.nttdata.com>)
Responses Re: Change log level for notifying hot standby is waiting non-overflowed snapshot
List pgsql-hackers
Hi,

I had another off-list discussion with Fujii-san, and according to the 
following manual[1], it seems that a transaction with an overflowed 
subtransaction is already considered inconsistent:

   Reaching a consistent state can also be delayed in the presence of 
both of these conditions:

   - A write transaction has more than 64 subtransactions
   - Very long-lived write transactions

IIUC, the manual suggests that both conditions must be met -- recovery 
reaching at least minRecoveryPoint and no overflowed subtransactions —- 
for the standby to be considered consistent.

OTOH, the following log message is emitted even when subtransactions 
have overflowed, which appears to contradict the definition of 
consistency mentioned above:

   LOG:  consistent recovery state reached

This log message is triggered when recovery progresses beyond 
minRecoveryPoint(according to CheckRecoveryConsistency()).
However, since this state does not satisfy 'consistency' defined in the 
manual, I think it would be more accurate to log that it has merely 
reached the "minimum recovery point".
Furthermore, it may be better to emit the above log message only when 
recovery has progressed beyond minRecoveryPoint and there are no 
overflowed subtransactions.

Attached patch does this.

Additionally, renaming variables such as reachedConsistency in 
CheckRecoveryConsistency might also be appropriate.
However, in the attached patch, I have left them unchanged for now.


On 2025-03-25 00:55, Fujii Masao wrote:
> -                  case CAC_NOTCONSISTENT:
> +                  case CAC_NOTCONSISTENT_OR_OVERFLOWED:
> 
> This new name seems a bit too long. I'm OK to leave the name as it is.
> Or, something like CAC_NOTHOTSTANDBY seems simpler and better to me.

Beyond just the length issue, given the understanding outlined above, I 
now think CAC_NOTCONSISTENT does not actually need to be changed.


> In high-availability.sgml, the "Administrator's Overview" section 
> already
> describes the conditions for accepting hot standby connections.
> This section should also be updated accordingly.

Agreed.
I have updated this section to mention that the resolution is to close 
the problematic transaction.
OTOH the changes made in v2 patch seem unnecessary, since the concept of 
'consistent' is already explained in the "Administrator's Overview."


-                             errdetail("Consistent recovery state has not been yet 
reached.")));
+                             errdetail("Consistent recovery state has not been yet reached, 
or snappshot is pending because subtransaction is overflowed."),

Given the above understanding, "or" is not appropriate in this context, 
so I left this message unchanged.
Instead, I have added an errhint. The phrasing in the hint message 
aligns with the manual, allowing users to search for this hint and find 
the newly added resolution instructions.


What do you think?

[1] https://www.postgresql.org/docs/devel/hot-standby.html

-- 
Regards,

--
Atsushi Torikoshi
Seconded from NTT DATA GROUP CORPORATION to SRA OSS K.K.
Attachment

pgsql-hackers by date:

Previous
From: wenhui qiu
Date:
Subject: Re: POC: make mxidoff 64 bits
Next
From: Ajin Cherian
Date:
Subject: Re: Historic snapshot doesn't track txns committed in BUILDING_SNAPSHOT state