Re: Stuck LSI 9650SE-12 RAID Controller - Mailing list pgsql-admin

From Craig James
Subject Re: Stuck LSI 9650SE-12 RAID Controller
Date
Msg-id CAFwQ8rc_yg14yku_Oid+pDC8UkRXcYn4=YOONjaX6GJre-M3yQ@mail.gmail.com
Whole thread Raw
In response to Stuck LSI 9650SE-12 RAID Controller  (Craig James <cjames@emolecules.com>)
List pgsql-admin
On Tue, Aug 5, 2014 at 9:00 AM, Craig James <cjames@emolecules.com> wrote:
Has anyone seen anything like this?

Our LSI 9650SE-12 RAID Controller dropped the main Postgres disk offline ... it just disappeared as though the disk wasn't there.  It was an 8-disk RAID10 unit. The other unit (RAID1 for Linux & pg_xlog) was still functional.

Using tw_cli, it showed the array as "DEGRADED" and claimed to be verifying it. One disk in the array was "DEGRADED". There was no /dev entry for the device; Linux couldn't see it at all.

Aha. I found this. Check out the first item in the "bugs" section: "RAID-10 arrays going Inoperable/Verifying Mode (SCR-2278)".

A lesson ... keep a device's firmware up to date.

Craig

pgsql-admin by date:

Previous
From: Scott Whitney
Date:
Subject: Re: Stuck LSI 9650SE-12 RAID Controller
Next
From: Murthy Nunna
Date:
Subject: How to determine replication lag