Re: multi-second application stall in open()

From: Vivek Goyal
Date: Fri Jun 22 2012 - 10:15:59 EST


On Thu, Jun 21, 2012 at 02:45:56PM -0700, Rakesh Iyer wrote:
> Hello,
>
> I coded up the watchdog and dropped it in but never did get the time to go
> looking for evidence of stalls so no confirmed evidence of what the cause
> was.
>
> Chad and I did manage to stare at the code long and hard and sort of
> convince ourselves that cfq_cfqq_wait_busy & associated logic could be the
> cause of the stall (strictly in my opinion - that logic can be fully be
> fully folded into the idling logic, but that's a discussion for another
> day).

Rakesh, so in your watchdog code you just kicked the queue? I am wondering
how does that help. In the sense, that we did do cfq_schedule_dispatch
here which will run the queue and CFQ did not find any pending requests to dispatch. So even if we kick the queue later, CFQ will not find any requests to
dispatch until and unless something changes in the mean time.

So I am not sure if it is same issue as you were facing.

Thanks
Vivek
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/