Re: "deadlock" between smc91x driver and link_watch

From: Andrew Morton
Date: Tue Nov 23 2004 - 18:35:23 EST


Ian Campbell <icampbell@xxxxxxxxx> wrote:
>
> Hi,
>
> I'm seeing a deadlock in linkwatch_event() when bringing down an
> Ethernet interface using the smc91x driver (drivers/net/smc91x.c).
>
> What I am seeing is that smc_close() is calling netif_carrier_off which
> has the call chain:
> netif_carrier_off
> -> linkwatch_fire_event
> -> schedule_work or schedule_delayed_work
> The function that is scheduled is linkwatch_event().
>
> smc_close() then goes on to call flush_scheduled_work() in order to
> ensure that it's own pending workqueue stuff (smc_phy_configure()) is
> completed before powering down the PHY.
>
> What I am seeing is that linkwatch_event() is deadlocking trying take
> rtnl_sem via rtnl_shlock(). The lock appears to already be held by a
> call to rtnl_lock() from devinet_ioctl().
>
> Any ideas? Perhaps smc_phy_configure calls could just check that the
> interface is up before continuing, then there would be no need to flush
> the queue to get rid of it.
>

linkwatch probably doesn't need the flush_scheduled_work(), because it
correctly does refcounting on the device. Presumably that
flush_scheduled_work() in smc_close() is there to force out any pending
calls to smc_phy_configure().

One possible fix would be to remove that flush_scheduled_work() and to do
refcounting around smc_phy_configure(): dev_hold() when scheduling the work
(if schedule_work() returned true), dev_put() in the handler.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/