Re: libata timeout, MCE and kernel panic

From: Kevin Shanahan
Date: Wed Jul 23 2008 - 20:23:37 EST


On Thu, Jul 24, 2008 at 12:49:28AM +0100, Alan Cox wrote:
> > recovery was required. It seems the journal recovery combined with the
> > md device reconstruction was enough to cause an ata command
> > timeout. After that I saw several MCEs and the kernel paniced.
>
> An MCE is amachine check, and indicates the hardware detected a problem.
> Given the sequence of heavy I/O, followed by a timeout and the processors
> detecting inconsistencies - is your PSU sufficient ?

Hmm, you may be on to something there. This machine was designed with
two disksoriginally, but due to a hardware failure on another machine
it's actually running with 5 (off a 450W redundant PSU). I have bigger
replacement PSUs on the way, so here's hoping that resolves it. Thanks
for the reply.

Regards,
Kevin.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/