Drive is failing but LSI MegaRAID controller does not detect it
Solution 1:
To see the RAID controller logs, run this command:
/opt/MegaRAID/MegaCli/MegaCli -AdpEventLog -GetLatest 1000 -f events.log -aALL
The events.log file contained entries like these which indicates a problem with the disk:
Code: 0x0000006e
Class: 0
Locale: 0x02
Event Description: Corrected medium error during recovery on PD 07(e0xfc/s2) at f04cb53
Event Data:
===========
Device ID: 7
Enclosure Index: 252
Slot Number: 2
LBA: 251972435
seqNum: 0x00004f65
Time: Wed Mar 6 05:36:48 2013
Code: 0x00000071
Class: 0
Locale: 0x02
Event Description: Unexpected sense: PD 07(e0xfc/s2) Path 4433221101000000, CDB: 28 00 0f 04 d1 f7 00 01 e0 00, Sense: 3/11/00
Event Data:
===========
Device ID: 7
Enclosure Index: 252
Slot Number: 2
CDB Length: 10
CDB Data:
0028 0000 000f 0004 00d1 00f7 0000 0001 00e0 0000 0000 0000 0000 0000 0000 0000 Sense Length: 18
Sense Data:
00f0 0000 0003 000f 0004 00d2 0046 000a 0000 0000 0000 0000 0011 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
seqNum: 0x00004f64
Time: Wed Mar 6 05:36:43 2013