NodeRAIDDegraded #
Meaning #
RAID Array is degraded.
This alert is triggered when a node has a storage configuration with RAID array, and the array is reporting as being in a degraded state due to one or more disk failures.
Impact #
The affected node could go offline at any moment if the RAID array fully fails due to further issues with disks.
Diagnosis #
You can open a shell on the node and use the standard Linux utilities to diagnose the issue, but you may need to install additional software in the debug container:
$ NODE_NAME='<value of instance label from alert>'
$ oc debug "node/$NODE_NAME"
$ cat /proc/mdstat
Mitigation #
Cordon and drain node if possible, proceed to RAID recovery.
See the Red Hat Enterprise Linux [documentation][1] for potential steps.