Re: [patch] sched: unlocked context-switches

From: Ingo Molnar
Date: Sun Apr 10 2005 - 01:45:07 EST



* David S. Miller <davem@xxxxxxxxxxxxx> wrote:

> > Yes, of course. The deadlock was due to context-switching, not
> > switch_mm() per se. Hopefully someone else beats me to remembering
> > the details before Monday.
>
> Sparc64 has a deadlock because we hold mm->page_table_lock during
> switch_mm(). I bet IA64 did something similar, as I remember it had a
> very similar locking issue in this area.
>
> So the deadlock was, we held the runqueue locks over switch_mm(),
> switch_mm() spins on mm->page_table_lock, the cpu which does have
> mm->page_table_lock tries to do a wakeup on the first cpu's runqueue.
> Classic AB-BA deadlock.

yeah, i can see that happening - holding the runqueue lock and enabling
interrupts. (it's basically never safe to enable irqs with the runqueue
lock held.)

the patch drops both the runqueue lock and enables interrupts, so this
particular issue should not trigger.

Ingo
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/