Re: Pls help me understand this MCE

From: Petr Vandrovec
Date: Thu Aug 11 2005 - 09:10:51 EST


David King wrote:
CPU 0 4 northbridge TSC f03d1e587b
Northbridge Watchdog error
bit57 = processor context corrupt
bit61 = error uncorrected
bus error 'generic participation, request timed out
generic error mem transaction
generic access, level generic'
STATUS b200000000070f0f MCGSTATUS 4

That's all meaningless to me so I'm looking for help understanding what
it means and what parts of my system I should be looking at in order to
try to resolve this MCE.

A Google search found one hit that suggested that "Something tried to
access a physical memory address that was not mapped in the CPU." If
that is indeed the correct interpretation, is there any wany to figure
out what that "something" is?

Try dumping *all* MCE values, as well as a call stack. Even although
MCE is tagged as processor context corrupt, there is rather big chance
that stack trace will point back to the instruction which caused MCE
(it always did in my case), especially if it is single processor system.
Then you'll at least know which subsystem/driver did that.

Best regards,
Petr Vandrovec

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/