Re: soft lockup - CPU#16 stuck for 3124s! [postmaster:2273] - Mailing list pgsql-general

From Matthias Apitz
Subject Re: soft lockup - CPU#16 stuck for 3124s! [postmaster:2273]
Date
Msg-id Zf6jpo0Zb__PrxIk@c720-1400094
Whole thread Raw
In response to Re: soft lockup - CPU#16 stuck for 3124s! [postmaster:2273]  (Ron Johnson <ronljohnsonjr@gmail.com>)
List pgsql-general
El día viernes, marzo 22, 2024 a las 01:31:43p. m. -0400, Ron Johnson escribió:

> On Fri, Mar 22, 2024 at 1:27 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> 
> > Matthias Apitz <guru@unixarea.de> writes:
> > > We have a PostgreSQL 15.1 server in production at a customer for some
> > > weeks (migrated from an older version) on SuSE SLES 15.
> >
> > > The customer is facing machine locks and before the Linux server does
> > > not respond any more (not even on SSH, only power-cycle reset helps to
> > > get it back), short before the fault a lot of messages are in
> > > /var/log/messages of the content:
> >
> > > # grep watchdog: /var/log/messages
> > > ...
> > > 2024-03-22T13:11:32.056154+01:00 sunrise kernel: [327844.313048][   C25]
> > watchdog: BUG: soft lockup - CPU#25 stuck for 3069s! [migration/25:166]
> > > 2024-03-22T13:12:28.056244+01:00 sunrise kernel: [327900.310267][   C16]
> > watchdog: BUG: soft lockup - CPU#16 stuck for 3124s! [postmaster:2273]
> > > 2024-03-22T13:12:28.056340+01:00 sunrise kernel: [327900.311052][   C25]
> > watchdog: BUG: soft lockup - CPU#25 stuck for 3121s! [migration/25:166]
> >
> > Sounds like failing hardware to me :-(
> 
> 
> Updating to 15.6 would rule out any bugs squashed in the last 15 months.

Yesterday the message appeared 300 times:

sunrise:~ # xz -dc /var/log/messages-20240323.xz | grep lockup | wc -l
323

Since the WAL copy and backup is working correctly, no such message
anymore.

    matthias

-- 
Matthias Apitz, ✉ guru@unixarea.de, http://www.unixarea.de/ +49-176-38902045
Public GnuPG key: http://www.unixarea.de/key.pub



pgsql-general by date:

Previous
From: arun chirappurath
Date:
Subject: Statistics information.
Next
From: Ron Johnson
Date:
Subject: Re: Statistics information.