Re: Machine Check Exception on Opteron 265

From: Espen FjellvÃr Olsen
Date: Tue Apr 17 2007 - 11:06:59 EST


Alan Cox wrote:
> On Sat, 14 Apr 2007 16:58:43 +0200
> Espen FjellvÃr Olsen <espen@xxxxxxxxxxxxxxxxx> wrote:
>
>
>> Hi!
>> Today our Opteron 265, 2x2, paniced after many months uptime, giving
>> only this error message:
>>
>> HARDWARE ERROR
>> CPU 2: Machine Check Exception: 4 Bank 4: b60a200100000813
>> TSC 6bb9fd0142921a ADDR a891e9b8
>> This is not a software problem!
>>
*snip*
> Consult your hardware vendor but if its a single event in a year it might
> be anything - even cosmic rays.
>
Yeah, we have had more crashes now, and have removed some of our DIMMs
in hope of getting a stable system again.
And ofcourse running memtest on those DIMMs. Hope it is one of those,
and not one the CPUs =)

--
Mvh
Espen FjellvÃr Olsen
Drift @ Tihlde
espenfo@xxxxxxxxxx

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/