PASSEDvendor / dell_equallogic_san_volume_offline

Dell EqualLogic SAN Volume Offline

A Dell EqualLogic PS6210 SAN group member experiences multiple disk failures in the same RAID set, taking a critical iSCSI volume offline and causing all VMs on that datastore to pause.

Pattern

RAID_DEGRADATION

Severity

CRITICAL

Confidence

95%

Remediation

Remote Hands

Test Results

Metric	Expected	Actual
Pattern Recognition	RAID_DEGRADATION	RAID_DEGRADATION
Severity Assessment	CRITICAL	CRITICAL
Incident Correlation	Yes	26 linked
Cascade Escalation	Yes	Yes
Remediation	—	Remote Hands — Corax contacts on-site support via call, email, or API

Scenario Conditions

Dell EqualLogic PS6210 storage array, 24-disk shelf. Two disks failed in same RAID-6 set within 4 hours. Volume hosting 15 VMware VMs went offline. iSCSI paths severed. No remaining spare disks.

Injected Error Messages (3)

dell EqualLogic PS6210 volume offline — two disk failures in RAID-6 set (bay 7 and bay 12) within 4-hour window, dell EqualLogic group manager reporting volume 'EQL-VOL-01' status: offline, no spare disks available for rebuild, dell SAN group health: critical, data unavailable on affected volume, 15 VMs impacted

dell EqualLogic iSCSI volume 'EQL-VOL-01' not accessible — VMware datastore showing 'All Paths Down (APD)', 15 VMs on datastore paused with 'lost access to volume' error, dell EqualLogic reporting volume cannot recover without replacement disks, VMware HA unable to restart VMs due to missing storage

dell EqualLogic PS6210 iSCSI target portal responding but volume unavailable — target IQN iqn.2001-05.com.equallogic:eql-vol-01 returning SCSI check condition, dell SAN member reporting double-disk failure in RAID group, volume I/O suspended, iSCSI initiators receiving path errors on all multipath connections

Neural Engine Root Cause Analysis

The Dell EqualLogic PS6210 SAN is experiencing a catastrophic storage failure due to two simultaneous disk failures in bays 7 and 12 within a 4-hour window on a RAID-6 configuration. While RAID-6 can tolerate two disk failures, the lack of available spare disks has prevented automatic rebuild, causing volume EQL-VOL-01 to go offline. This hardware failure has cascaded to affect 15 virtual machines that depend on this storage volume, creating a critical service outage.

Remediation Plan

1. Immediately check if any spare disks can be hot-swapped into the array 2. If spares available, initiate manual rebuild process through EqualLogic Group Manager 3. If no spares, emergency procurement of compatible replacement drives (10K/15K SAS drives matching array specs) 4. Replace failed drives in bays 7 and 12 with new hardware 5. Monitor rebuild progress and verify data integrity 6. Gradually bring affected VMs back online after volume restoration 7. Implement proactive monitoring for remaining drives and ensure adequate spare inventory

Tested: 2026-03-30Monitors: 3 | Incidents: 3Test ID: cmnck622q070cobqewfjls83v