logbuf_lock deadlock on NMI

From: Luca Barbieri (ldb@ldb.ods.org)
Date: Sat Aug 31 2002 - 04:12:19 EST


I already mentioned this in an unrelated thread but I got no responses.

NMIs, by definition and documentation, are non maskable. This means that
we can get them anywhere, even when interrupts are disabled.

The problem is that in some paths of the NMI handler (mem_parity_error,
io_check_error, mca_handle_nmi), we call printk without busting
spinlocks first.
As a consequence, if I'm not missing something, if we get e.g. a memory
parity error while inside printk, we deadlock on logbuf_lock.

Apart from removing logbuf_lock and other locks that might be held in
printk, we could solve this by telling the APIC (I think that the same
can be done with the 8259 but I'm not sure) to send an interrupt to the
current CPU.
The interrupt, being maskable, will be triggered only outside
irq-protected spinlocks so we can safely do the NMI printk inside it.

Alternatively we may just reset the locks but since some errors are
non-fatal it probably isn't a good idea.

Any comments?



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Sat Aug 31 2002 - 22:00:32 EST