Thread: Re: BF member drongo doesn't like 035_standby_logical_decoding.pl

Re: BF member drongo doesn't like 035_standby_logical_decoding.pl

From
Bertrand Drouvot
Date:
Hi,

On Fri, Jan 24, 2025 at 11:42:15AM -0500, Tom Lane wrote:
> I realized just now that drongo has been intermittently failing like this:

Thanks for the report!

> PS: FTR, the hits I got on this in the past 90 days were
> 
>  sysname |    branch     |      snapshot       |     stage     |
l                                                     
 
>
---------+---------------+---------------------+---------------+------------------------------------------------------------------------------------------------------------
>  drongo  | HEAD          | 2024-10-29 10:52:07 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_class'\r\r
 
>  drongo  | REL_16_STABLE | 2024-10-31 08:07:11 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | REL_17_STABLE | 2024-11-05 11:11:28 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | REL_16_STABLE | 2024-11-22 08:58:52 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_class'\r\r
 
>  drongo  | HEAD          | 2024-11-29 22:23:47 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | HEAD          | 2024-12-04 20:33:42 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | REL_17_STABLE | 2024-12-20 17:00:28 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_class'\r\r
 
>  drongo  | HEAD          | 2024-12-24 21:07:27 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_class'\r\r
 
>  drongo  | REL_16_STABLE | 2025-01-10 15:48:16 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | REL_17_STABLE | 2025-01-10 18:05:18 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | HEAD          | 2025-01-11 20:40:09 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | REL_16_STABLE | 2025-01-12 14:06:11 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_class'\r\r
 
>  drongo  | REL_16_STABLE | 2025-01-13 07:19:22 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | REL_16_STABLE | 2025-01-14 10:48:16 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | HEAD          | 2025-01-20 20:11:00 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | REL_16_STABLE | 2025-01-24 01:54:55 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
>  drongo  | REL_17_STABLE | 2025-01-24 04:21:44 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 


Out of curiosity, how did you generate this output? (that looks wery useful) 

Regards,

-- 
Bertrand Drouvot
PostgreSQL Contributors Team
RDS Open Source Databases
Amazon Web Services: https://aws.amazon.com



Bertrand Drouvot <bertranddrouvot.pg@gmail.com> writes:
> On Fri, Jan 24, 2025 at 11:42:15AM -0500, Tom Lane wrote:
>> PS: FTR, the hits I got on this in the past 90 days were
>>
>> sysname |    branch     |      snapshot       |     stage     |
l                                                      
>>
---------+---------------+---------------------+---------------+------------------------------------------------------------------------------------------------------------
>> drongo  | HEAD          | 2024-10-29 10:52:07 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_class'\r\r 
>> drongo  | REL_16_STABLE | 2024-10-31 08:07:11 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r 
>> ...

> Out of curiosity, how did you generate this output? (that looks wery useful)

That is from a SQL query on the database that stores the last few
months' worth of buildfarm logs.  I believe current policy is that
only committers will be granted access to that machine, partly
because of security concerns and partly because of resource
limitations.  But in the interests of transparency, what I did was

pgbfprod=> select * from
(select sysname, s.branch, snapshot, stage,
    unnest(string_to_array(log_text, E'\n')) as l
    from build_status s join build_status_log l using (sysname, snapshot)
    where stage != 'OK' and snapshot > localtimestamp - '3 mons'::interval) ss
where l like '%Failed test%activeslot slot invalidation%'
\g outfile

(Other than the specific string-to-search-for, this is a canned
query pattern that I use a lot.  It scans all failed tests' logs
in the past 3 months.)  I did some manual duplicate-removal
and hand filtering afterwards, IIRC.

            regards, tom lane



Re: BF member drongo doesn't like 035_standby_logical_decoding.pl

From
Bertrand Drouvot
Date:
Hi,

On Mon, Jan 27, 2025 at 09:28:57AM -0500, Tom Lane wrote:
> Bertrand Drouvot <bertranddrouvot.pg@gmail.com> writes:
> > On Fri, Jan 24, 2025 at 11:42:15AM -0500, Tom Lane wrote:
> >> PS: FTR, the hits I got on this in the past 90 days were
> >> 
> >> sysname |    branch     |      snapshot       |     stage     |
l                                                      
 
> >>
---------+---------------+---------------------+---------------+------------------------------------------------------------------------------------------------------------
> >> drongo  | HEAD          | 2024-10-29 10:52:07 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_class'\r\r
 
> >> drongo  | REL_16_STABLE | 2024-10-31 08:07:11 | recoveryCheck | #   Failed test 'activeslot slot invalidation is
loggedwith vacuum on pg_authid'\r\r
 
> >> ...
> 
> > Out of curiosity, how did you generate this output? (that looks wery useful) 
> 
> That is from a SQL query on the database that stores the last few
> months' worth of buildfarm logs.  I believe current policy is that
> only committers will be granted access to that machine, partly
> because of security concerns and partly because of resource
> limitations.

Makes sense.

> But in the interests of transparency, what I did was
> 
> pgbfprod=> select * from
> (select sysname, s.branch, snapshot, stage,
>     unnest(string_to_array(log_text, E'\n')) as l
>     from build_status s join build_status_log l using (sysname, snapshot)
>     where stage != 'OK' and snapshot > localtimestamp - '3 mons'::interval) ss
> where l like '%Failed test%activeslot slot invalidation%'
> \g outfile
> 
> (Other than the specific string-to-search-for, this is a canned
> query pattern that I use a lot.  It scans all failed tests' logs
> in the past 3 months.)  I did some manual duplicate-removal
> and hand filtering afterwards, IIRC.
> 

Thanks for the explanation! Having access to this database looks like a big 
time saver in some situations.

Regards,

-- 
Bertrand Drouvot
PostgreSQL Contributors Team
RDS Open Source Databases
Amazon Web Services: https://aws.amazon.com