Re: [PATCH v3] usb: dwc3: gadget: Fix NULL pointer dereference in dwc3_gadget_suspend

From: Thinh Nguyen
Date: Thu Feb 08 2024 - 17:55:26 EST


On Wed, Feb 07, 2024, Marek Szyprowski wrote:
> Dear All,
>
> On 19.01.2024 10:48, Uttkarsh Aggarwal wrote:
> > In current scenario if Plug-out and Plug-In performed continuously
> > there could be a chance while checking for dwc->gadget_driver in
> > dwc3_gadget_suspend, a NULL pointer dereference may occur.
> >
> > Call Stack:
> >
> > CPU1: CPU2:
> > gadget_unbind_driver dwc3_suspend_common
> > dwc3_gadget_stop dwc3_gadget_suspend
> > dwc3_disconnect_gadget
> >
> > CPU1 basically clears the variable and CPU2 checks the variable.
> > Consider CPU1 is running and right before gadget_driver is cleared
> > and in parallel CPU2 executes dwc3_gadget_suspend where it finds
> > dwc->gadget_driver which is not NULL and resumes execution and then
> > CPU1 completes execution. CPU2 executes dwc3_disconnect_gadget where
> > it checks dwc->gadget_driver is already NULL because of which the
> > NULL pointer deference occur.
> >
> > Cc: <stable@xxxxxxxxxxxxxxx>
> > Fixes: 9772b47a4c29 ("usb: dwc3: gadget: Fix suspend/resume during device mode")
> > Acked-by: Thinh Nguyen <Thinh.Nguyen@xxxxxxxxxxxx>
> > Signed-off-by: Uttkarsh Aggarwal <quic_uaggarwa@xxxxxxxxxxx>
>
> This patch landed some time ago in linux-next as commit 61a348857e86
> ("usb: dwc3: gadget: Fix NULL pointer dereference in
> dwc3_gadget_suspend"). Recently I found that it causes the following
> warning when no USB gadget is bound to the DWC3 driver and a system
> suspend/resume cycle is performed:
>
> dwc3 12400000.usb: wait for SETUP phase timed out
> dwc3 12400000.usb: failed to set STALL on ep0out
> ------------[ cut here ]------------
> WARNING: CPU: 4 PID: 604 at drivers/usb/dwc3/ep0.c:289
> dwc3_ep0_out_start+0xc8/0xcc
> Modules linked in:
> CPU: 4 PID: 604 Comm: rtcwake Not tainted 6.8.0-rc3-next-20240207 #7979
> Hardware name: Samsung Exynos (Flattened Device Tree)
>  unwind_backtrace from show_stack+0x10/0x14
>  show_stack from dump_stack_lvl+0x58/0x70
>  dump_stack_lvl from __warn+0x7c/0x1bc
>  __warn from warn_slowpath_fmt+0x1a0/0x1a8
>  warn_slowpath_fmt from dwc3_ep0_out_start+0xc8/0xcc
>  dwc3_ep0_out_start from dwc3_gadget_soft_disconnect+0x16c/0x230
>  dwc3_gadget_soft_disconnect from dwc3_gadget_suspend+0xc/0x90
>  dwc3_gadget_suspend from dwc3_suspend_common+0x44/0x30c
>  dwc3_suspend_common from dwc3_suspend+0x14/0x2c
>  dwc3_suspend from dpm_run_callback+0x94/0x288
>  dpm_run_callback from device_suspend+0x130/0x6d0
>  device_suspend from dpm_suspend+0x124/0x35c
>  dpm_suspend from dpm_suspend_start+0x64/0x6c
>  dpm_suspend_start from suspend_devices_and_enter+0x134/0xbd8
>  suspend_devices_and_enter from pm_suspend+0x2ec/0x380
>  pm_suspend from state_store+0x68/0xc8
>  state_store from kernfs_fop_write_iter+0x110/0x1d4
>  kernfs_fop_write_iter from vfs_write+0x2e8/0x430
>  vfs_write from ksys_write+0x5c/0xd4
>  ksys_write from ret_fast_syscall+0x0/0x1c
> Exception stack(0xf1421fa8 to 0xf1421ff0)
> ...
> irq event stamp: 14304
> hardirqs last  enabled at (14303): [<c01a599c>] console_unlock+0x108/0x114
> hardirqs last disabled at (14304): [<c0c229d8>]
> _raw_spin_lock_irqsave+0x64/0x68
> softirqs last  enabled at (13030): [<c010163c>] __do_softirq+0x318/0x4f4
> softirqs last disabled at (13025): [<c012dd40>] __irq_exit_rcu+0x130/0x184
> ---[ end trace 0000000000000000 ]---
>
> IMHO dwc3_gadget_soft_disconnect() requires some kind of a check if
> dwc->gadget_driver is present or not, as it really makes no sense to do

I don't think checking that is sufficient, and I don't think that's the
case here.

> any ep0 related operations if there is no gadget driver at all.
>

If there's indeed no gadget_driver present, then we wouldn't get this
stack trace. (ie. dwc3_ep0_out_start should occurs when gadget_driver is
present). This is a race happened between binding + suspend.

I think something like this should be sufficient. Would you mind giving
it a try?

diff --git a/drivers/usb/dwc3/gadget.c b/drivers/usb/dwc3/gadget.c
index 564976b3e2b9..1990d6371066 100644
--- a/drivers/usb/dwc3/gadget.c
+++ b/drivers/usb/dwc3/gadget.c
@@ -2656,6 +2656,11 @@ static int dwc3_gadget_soft_disconnect(struct dwc3 *dwc)
int ret;

spin_lock_irqsave(&dwc->lock, flags);
+ if (!dwc->pullups_connected) {
+ spin_unlock_irqrestore(&dwc->lock, flags);
+ return 0;
+ }
+
dwc->connected = false;

/*


Thanks,
Thinh