HP Smart Array; How to safely remove a physcial drive with SMART predictive failure from array so it can be replaced?
hpacucli controller slot=1 ld 1 show detail
Smart Array P400 in Slot 1
array A
Logical Drive: 1
Size: 273.3 GB
Fault Tolerance: RAID 1+0
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Stripe Size: 128 KB
Status: OK
Array Accelerator: Enabled
Unique Identifier: xxxx
Disk Name: /dev/cciss/c0d0
Mount Points: /boot 196 MB, / 7.8 GB
Logical Drive Label: xxxxx
Mirror Group 0:
physicaldrive 1I:1:8 (port 1I:box 1:bay 8, SAS, 72 GB, Predictive Failure)
physicaldrive 1I:1:7 (port 1I:box 1:bay 7, SAS, 72 GB, OK)
physicaldrive 1I:1:6 (port 1I:box 1:bay 6, SAS, 72 GB, OK)
physicaldrive 1I:1:5 (port 1I:box 1:bay 5, SAS, 72 GB, OK)
Mirror Group 1:
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, SAS, 72 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, SAS, 72 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, SAS, 72 GB, OK)
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, SAS, 72 GB, OK)
hpacucli controller slot=1 show
Smart Array P400 in Slot 1
Bus Interface: PCI
Slot: 1
Serial Number: xxxx
Cache Serial Number: xxxx
RAID 6 (ADG) Status: Disabled
Controller Status: OK
Chassis Slot:
Hardware Revision: Rev D
Firmware Version: 4.06
Rebuild Priority: Medium
Expand Priority: Medium
Surface Scan Delay: 15 secs
Post Prompt Timeout: 0 secs
Cache Board Present: True
Cache Status: OK
Accelerator Ratio: 100% Read / 0% Write
Drive Write Cache: Disabled
Total Cache Size: 256 MB
Battery Pack Count: 0
SATA NCQ Supported: True
Is it safe to run this sequence of commands?
hpacucli controller slot=1 array A remove drives=1:8
hpacucli controller slot=1 pd 1:8 modify led=on
get remote hands to remove the drive and replace. Then run:
hpacucli controller slot=1 array A add drives=1:8
Will this get the array to rebuild safely?
Solution 1:
It is safe to run those commands. The mirror group can survive the absence of one disk. It should rebuild automatically, but if it doesn't the command you already identified will kick it into gear.
Solution 2:
You can just pull the dead disk and replace it - there's no need for OS involvement at all.
Solution 3:
A drive with prefailure won't necessarily have an LED indicator (sometimes it's a slow amber blink), so identifying it for smart hands is a good idea. You don't need to remove the drive from the array or re-add it, though. Those functions will be handled by the controller automatically. All you will need is the hpacucli controller slot=1 pd 1:8 modify led=on
line.
Solution 4:
The sequence of commands that you specify do not work on our Smart Array 641/642 controllers. A This operation is not supported with the current configuration
error is encounter. On my class of array, these commands do not work, even if all the disks are properly operation. The best solution is to ewwhite's process to blink the drive, and physically replace.