Re: [PATCH] tg3: netif_carrier_off runs too early; could still bequeued when init fails

From: David S. Miller
Date: Tue Feb 21 2006 - 16:38:02 EST


From: "Michael Chan" <mchan@xxxxxxxxxxxx>
Date: Tue, 21 Feb 2006 08:44:20 -0800

> On Mon, 2006-02-20 at 14:43 -0500, Jeff Mahoney wrote:
> > This patch moves the netif_carrier_off() call from tg3_init_one()->
> > tg3_init_link_config() to tg3_open() as is the convention for most
> > other network drivers.
>
> I think moving netif_carrier_off() later is the right thing to do. We
> can also move it to the end of tg3_init_one() just before returning 0.

Agreed.

> > I was getting a panic after a tg3 device failed to initialize due to DMA
> > failure. The oops pointed to the link watch queue with spinlock debugging
> > enabled. Without spinlock debugging, the Oops didn't occur.
> >
> > I suspect that the link event was getting queued but not executed until
> > after the DMA test had failed and the device was freed. The link event
> > was then operating on freed memory, which could contain anything. With this
> > patch applied, the Oops no longer occurs.
>
> DMA test failed? What NIC device do you have? How did it fail?

I get this too with an old 5700 3COM card on sparc64. I'll get
you some more detailed info later today, hopefully.

Jeff, please get some details for Michael about your failure
case. Thanks.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/