Re: softlockup: automatically detect hung TASK_UNINTERRUPTIBLEtasks

From: Andrew Morton
Date: Wed Feb 06 2008 - 19:58:09 EST


On Thu, 7 Feb 2008 01:04:25 +0100
Ingo Molnar <mingo@xxxxxxx> wrote:

>
> * Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> wrote:
>
> > > Does that kernel have:
> > >
> > > commit ed50d6cbc394cd0966469d3e249353c9dd1d38b9
> > > Author: Peter Zijlstra <a.p.zijlstra@xxxxxxxxx>
> > > Date: Sat Feb 2 00:23:08 2008 +0100
> > >
> > > debug: softlockup looping fix
> >
> > yup. It was fetched less than 24 hours ago.
>
> does the patch below improve the situation?
>

Nope.

But I tested it on mainline, and mainline exhibits the never-powers-off
symptom, whereas ed50d6cbc394cd0966469d3e249353c9dd1d38b9 demonstrates the
powers-off-after-20-seconds symptom.

So we _may_ be dealing with two bugs here, and your patch might have fixed
the first, but that success is obscured by the second. I guess I need to
prepare a tree which has ed50d6cbc394cd0966469d3e249353c9dd1d38b9 at its
tip. (Wonders how to do that).

btw, mainline (plus this patch, not that it changed anything) prints

<stopping disk stuff>
Disabling non-boot CPUs
CPU 1 is now offline

and that's it. This machine has eight cpus. Might be a hint?

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/