Re: Async suspend-resume patch w/ completions (was: Re: Asyncsuspend-resume patch w/ rwsems)

From: Alan Stern
Date: Sat Dec 19 2009 - 22:49:53 EST


On Sat, 19 Dec 2009, Rafael J. Wysocki wrote:

> On Friday 18 December 2009, Alan Stern wrote:
> > On Fri, 18 Dec 2009, Rafael J. Wysocki wrote:
> >
> > > I didn't manage to do that, but I was able to mark sd and i8042 as async and
> > > see the impact of this.
> >
> > Apparently this didn't do what you wanted. In the nx6325
> > sd+i8042+async+extra log, the 0:0:0:0 device (which is a SCSI disk) was

To be precise, the device is an ATA or SATA disk but it is managed by
the sd driver.

> > suspended by the main thread instead of an async thread.
>
> Hm, that's odd, because there's a noticeable time difference between the
> two cases in which the sd is sync and async. I'll look into it further.

I don't know what the whole story is, but the PID number tells the
tale.

> > There's an important point I neglected to mention before. Your logs
> > don't show anything for devices with no suspend callbacks at all.
> > Nevertheless, these devices sit on the device list and prevent other
> > devices from suspending or resuming as soon as they could.
>
> Unless they are async, that is.

Yes. It would be simpler to make them async. But first we ought to
know what they are. Can you add an extra line to the log for such
devices?

What I'm afraid of is that there might be a "normal" device with a
"normal" ancestor but with "abnormal" devices in between (where
"normal" means there is a suspend or resume routine and "abnormal"
means all the method pointers are NULL). I know that this happens when
there's a USB mass-storage device, for example. If we complete the
intermediate devices immediately, then there won't be anything to
prevent the ancestor from suspending before the device or the device
from resuming before the ancestor. Forcing the "abnormal" devices to
be async, even if they aren't marked that way, would avoid these
problems.

> > For example, the fingerprint sensor (3-1) took the most time to resume.
> > But other devices were delayed until after it finished because it had
> > children with no callbacks, and they delayed the devices following
> > them in the list.
> >
> > What would happen if you completed these devices immediately, as part
> > of the first pass?
>
> OK. How do the PM core is supposed to check if a device has null suspend
> and resume? Check all of the function pointers in the first pass?

All the relevant pointers (including the legacy pointers). That is,
you check only the suspend pointers during the first suspend pass, and
likewise for resume.

Alan Stern

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/