I received this rather nice email today suggesting one of the drives in a RAID1 array has failed.
A Fail event had been detected on md device /dev/md4.
It could be related to component device /dev/sdc2.
Faithfully yours, etc.
P.S. The /proc/mdstat file currently contains the following:
Personalities : [raid1] md4 : active raid1 sdd2[1] sdc2[2](F)
87667136 blocks [2/1] [_U]
md3 : active raid1 sdd1[1] sdc1[0]
250304 blocks [2/2] [UU]
The strange thing is that sdc2 is just one partition and the other partition has not failed.
Since the server in another country, I can't physically inspect it. Any suggestions as to how to test whether this is really a failure or a glitch?
Best Answer
If your hard is smart-enabled:
smartctl -t long /dev/sdc
and after one or two hours:
smartctl -a /dev/sdc
and have a look if it reports errors.