Defective Disk not reported as Dead

From: Oktay Akbal (oktay.akbal@s-tec.de)
Date: Fri May 23 2003 - 02:23:03 EST


Hello !

We do have some strange problem here.
A Server with some qlogic qla2002f-Adapter (2-channel)is connected to 2
external Raid-Arrays via multipathing and raid1 on top of it. But the
multipathing and raid should not be the problem here.

The Raid-Arrays itself are Fibre-to-Ide and present themselfs as 1 disks
each. Now for the problem: Due to a firmware bug the raid-boxes sometimes
seems to loose the ability to write (and read i think) to their internal
disks. The effect is, that the cache fills up but does not get flushed to
disks. When full, the box is in a strange state. It gets detected when
reloading the kernel-modules but linux can no longer access the disks.
when accessing the partition all processes on that disk hang (like ls or
ps -efa etc.)

The disk is never recognized as defective and never thrown out of raid.
So the processes do not continue to work.

Is this fault simply not detectable, or could this be a problem with the
qlogic-driver ?

The Kernel used is some 2.4.18 Version by SuSE.

Thanks for help

Oktay Akbal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Fri May 23 2003 - 22:00:53 EST