Re: Regression in gdm-2.18 since 2.6.24

From: Peter Zijlstra
Date: Tue Apr 08 2008 - 04:43:30 EST


On Tue, 2008-04-08 at 14:20 +0530, Srivatsa Vaddagiri wrote:
> On Mon, Apr 07, 2008 at 12:48:33AM +0100, Ken Moffat wrote:
> > Well, I found your analysis convincing. Unfortunately, my hardware
> > disagreed. Testing -rc8 with CONFIG_GROUP_SCHED disabled (a test is
> > a mixture of 5 attempts to restart and 5 to shutdown):
> >
> > 1. the base version success is 4/10
> >
> > 2. increasing the granularity by a factor of 10 as you requested,
> > success is 8/10
>
> This makes me think that we are just exposing a timing related problem
> in gdm here.
>
> How abt a larger factor?
>
> # echo 200000000 > /proc/sys/kernel/sched_wakeup_granularity_ns
>
> Does that make it 10/10 ?!
>
> Anyway, it would be interesting to analyze the failure scenario more
> (with help from gdm developers). Can you get some more debug data in this
> regard?
>
> Before you shutdown,
>
> # strace -p <gdm-binary-pid1> 2>/tmp/gdmlog1 &
> # strace -p <gdm-binary-pid2> 2>/tmp/gdmlog2 &
>
> Now shutdown and wait few minutes to confirm its not working. Send me
> the strace log files ..Hopefully this will give a hint on what they are
> deadlocked on (in the last log you sent, i can see both gdm-binaries in
> sleep state ..whether that was a momentary state or whether they are
> actually deadlocked, will be confirmed by strace logs above).
>
> > If I was confused earlier, I guess I must be dazed and confused
> > now!
>
> me too!
>
> Ingo/Peter, Any other suggestions you have?

Sounds like a race condition to me; non of these changes affect
correctness in a strict manner of speaking.



--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/