Re: [PATCH 3/6] usb: cdns3-ti: add suspend/resume procedures for J7200

From: Kevin Hilman
Date: Tue Dec 12 2023 - 13:26:14 EST


Théo Lebrun <theo.lebrun@xxxxxxxxxxx> writes:

> Hello,
>
> On Sun Nov 26, 2023 at 11:36 PM CET, Kevin Hilman wrote:
>> Théo Lebrun <theo.lebrun@xxxxxxxxxxx> writes:
>> > On Wed Nov 22, 2023 at 11:23 PM CET, Kevin Hilman wrote:
>> >> Théo Lebrun <theo.lebrun@xxxxxxxxxxx> writes:
>> >> The point is to signal to the power domain the device is in that it can
>> >> power on/off. These IP blocks are (re)used on many different SoCs, so
>> >> the driver should not make any assumptions about what power domain it is
>> >> in (if any.)
>> >
>> > On my platform, when the device is attached to the PD it gets turned on.
>> > That feels logical to me: if a driver is not RPM aware it "just works".
>>
>> It "just works"... until the domain gets turned off.
>>
>> > Are there platforms where RPM must get enabled for the attached
>> > power-domains to be turned on?
>>
>> Yes, but but more importantly, there are platforms where RPM must get
>> enabled for the power domain to *stay* on. For example, the power
>> domain might get turned on due to devices probing etc, but as soon as
>> all the RPM-enabled drivers drop their refcount, the domain will turn
>> off. If there is a device in that domain with a non-RPM enabled driver,
>> that device will be powered off anc cause a crash.
>
> OK, that makes sense, thanks for taking the time to explain. This topic
> makes me see two things that I feel are close to being bugs. I'd be
> curious to get your view on both.

TL;DR; they are features, not bugs. ;)

> - If a device does not use RPM but its children do, it might get its
> associated power-domain turned off. That forces every single driver
> that want to stay alive to enable & increment RPM.
>
> What I naively expect: a genpd with a device attached to it that is
> not using RPM should mean that it should not be powered off at
> runtime_suspend. Benefit: no RPM calls in drivers that do not use
> it, and the behavior is that the genpd associated stays alive "as
> expected".

Your expectation makes sense, but unfortunately, that's not how RPM was
designed.

Also remember that we don't really want specific device drivers to know
which PM domain they are in, or whether they are in a PM domain at
all. The same IP block can be integrated in different ways across
different SoCs, even within the same SoC family, and we want the device
driver to just work.

For that to work well, any driver that might be in any PM domain should
add RPM calls.

> - If a device uses RPM & has a refcount strictly positive, its
> associated power-domain gets turned off either way at suspend_noirq.
> That feels non-intuitive as well.
>
> What I naively expect: check for RPM refcounts of attached devices
> when doing suspend_noirq of power-domains. Benefit: control of what
> power-domains do from attached devices is done through the RPM API.

I agree that this is non-intuitive from an RPM PoV, but remember that
RPM was added on top of existing system-wide suspend support. And from
a system-wide suspend PoV, it might be non-intuitive that a driver
thinks it should be active (non-zero refcount) when user just requested
a system-wide suspend. Traditionally, when a user requests a
system-wide suspend, they expect the whole system to shut down.

On real SoCs in real products, power management is not so black and
white, and I fully understand that, and personally, I'm definitely open
to not forcing RPM-active devices off in suspend, but that would require
changes to core code, and remove some assumptions of core code that
would need to be validated/tested.

Kevin