Re: Stuck LSI 9650SE-12 RAID Controller

From: Craig James <cjames(at)emolecules(dot)com>
To: "pgsql-admin(at)postgresql(dot)org" <pgsql-admin(at)postgresql(dot)org>
Subject: Re: Stuck LSI 9650SE-12 RAID Controller
Date: 2014-08-05 17:11:31
Message-ID: CAFwQ8rc_yg14yku_Oid+pDC8UkRXcYn4=YOONjaX6GJre-M3yQ@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

On Tue, Aug 5, 2014 at 9:00 AM, Craig James <cjames(at)emolecules(dot)com> wrote:

> Has anyone seen anything like this?
>
> Our LSI 9650SE-12 RAID Controller dropped the main Postgres disk offline
> ... it just disappeared as though the disk wasn't there. It was an 8-disk
> RAID10 unit. The other unit (RAID1 for Linux & pg_xlog) was
> still functional.
>
> Using tw_cli, it showed the array as "DEGRADED" and claimed to be
> verifying it. One disk in the array was "DEGRADED". There was no /dev entry
> for the device; Linux couldn't see it at all.
>

Aha. I found this
<http://www.lsi.com/downloads/Public/SATA/SATA%20Common%20Files/9.5.5_Release_Notes.pdf>.
Check out the first item in the "bugs" section: "RAID-10 arrays going
Inoperable/Verifying Mode (SCR-2278)".

A lesson ... keep a device's firmware up to date.

Craig

In response to

Browse pgsql-admin by date

  From Date Subject
Next Message Murthy Nunna 2014-08-07 19:09:32 How to determine replication lag
Previous Message Scott Whitney 2014-08-05 16:14:20 Re: Stuck LSI 9650SE-12 RAID Controller