Re: [PATCH v2 4/4] interconnect: qcom: icc-rpmh: Add BCMs to commit list in pre_aggregate

From: Mike Tipton
Date: Wed Aug 18 2021 - 00:44:06 EST


On 8/11/2021 11:13 AM, Stephen Boyd wrote:
Quoting Alex Elder (2021-08-11 09:01:27)
On 8/10/21 6:31 PM, Stephen Boyd wrote:
Quoting Mike Tipton (2021-07-21 10:54:32)
We're only adding BCMs to the commit list in aggregate(), but there are
cases where pre_aggregate() is called without subsequently calling
aggregate(). In particular, in icc_sync_state() when a node with initial
BW has zero requests. Since BCMs aren't added to the commit list in
these cases, we don't actually send the zero BW request to HW. So the
resources remain on unnecessarily.

Add BCMs to the commit list in pre_aggregate() instead, which is always
called even when there are no requests.

Fixes: 976daac4a1c5 ("interconnect: qcom: Consolidate interconnect RPMh support")
Signed-off-by: Mike Tipton <mdtipton@xxxxxxxxxxxxxx>
---

This patch breaks reboot for me on sc7180 Lazor

If I am using the interface improperly or something in the
IPA driver, please let me know. I actually plan to switch
to using the bulk interfaces soon (FYI).


I suspect I'm seeing a shutdown ordering issue, where we start dropping
interconnect requests in driver shutdown callbacks and then some bus
turns off and the CPU can't access a device. Maybe to fix this problem
(if reverting isn't an option) would be to add a shutdown hook to
rpmh-icc that effectively "props up" the bandwidth requests during
shutdown so that we don't have to think about finding the place that the
interconnect is turned off. We're shutting down/restarting anyway, so
there isn't much point in trying to be power efficient for the last few
moments of runtime.


I wouldn't have expected this change to impact reboot, since this change should only impact places where pre_aggregate() is called without subsequently calling aggregate(). I don't think there are currently any places that can happen other than icc_sync_state().

I suppose what could be happening is we're now disabling certain paths in icc_sync_state() and their associated drivers just aren't used or attempting accesses until they're being torn down in reboot. That doesn't seem particularly likely, but nothing else immediately comes to mind.

We already mark paths critical for the CPU as "keepalive" such that they'll never turn off. This includes the CPU path to DDR and top-level CSRs. Basically just paths that can't actually be turned off while SW is running. That logic is unchanged in this patch. So we generally shouldn't need any shutdown-specific callbacks to place BW votes during this window. Client drivers should still ensure they're sequencing their shutdown logic such that any bus accesses happen before they remove their BW requests.