Re: FYI: RAID5 unusably unstable through 2.6.14

From: Martin Drab
Date: Tue Jan 17 2006 - 21:29:23 EST


On Tue, 17 Jan 2006, Michael Loftis wrote:
> --On January 17, 2006 9:13:49 PM +0100 Martin Drab <drab@xxxxxxxxxxxxxxxxxxx> wrote:
>
> > I've consulted this with Mark Salyzyn, because I thought it was a problem
> > of the AACRAID driver. But I was told, that there is nothing that AACRAID
> > can possibly do about it, and that it is a problem of the upper Linux
> > layers (block device layer?) that are strictly fault intollerant, and
> > thouth the problem was just an inconsistency of one particular localized
> > region inside /dev/sda2, Linux was COMPLETELY UNABLE (!!!!!) to read a
> > single byte from the ENTIRE VOLUME (/dev/sda)!
>
> Actually...this is also related to how the controller reports the error. If it
> reports a device level death/failure rather than a read error, Linux is just

Yes, but that wasn't the case here. I've witnessed that a while ago, but
this time, no. Just a read error, no device death nor going off-line.
Otherwise I wouldn't be that much surprised that Linux didn't even try.
The controller didn't do anything that would prevent system from reading.
Windows used that and worked, Linux unfortunatelly didn't even try. That's
why I'm talking about it here.

> taking that on face value. Yup, it should retry though. Other possibilities
> exist including the volume going offline at the controller level, having
> nothing to do with Linux, this is most often the problem I see with RAIDs.

Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/