Re: [PATCH V4] PM / OPP: Pass opp_table to dev_pm_opp_put_regulator()

From: Joonyoung Shim
Date: Wed Nov 30 2016 - 01:19:16 EST


Hi Viresh,

On 11/30/2016 12:59 PM, Viresh Kumar wrote:
> From: Stephen Boyd <sboyd@xxxxxxxxxxxxxx>
>
> Joonyoung Shim reported an interesting problem on his ARM octa-core
> Odoroid-XU3 platform. During system suspend, dev_pm_opp_put_regulator()
> was failing for a struct device for which dev_pm_opp_set_regulator() is
> called earlier.
>
> This happened because an earlier call to
> dev_pm_opp_of_cpumask_remove_table() function (from cpufreq-dt.c file)
> removed all the entries from opp_table->dev_list apart from the last CPU
> device in the cpumask of CPUs sharing the OPP.
>
> But both dev_pm_opp_set_regulator() and dev_pm_opp_put_regulator()
> routines get CPU device for the first CPU in the cpumask. And so the OPP
> core failed to find the OPP table for the struct device.
>
> In order to fix that up properly, we need to revisit APIs like
> dev_pm_opp_set_regulator() and make them talk in terms of cookies
> provided by the OPP core. But such a solution will be hard to backport
> to stable kernels.
>
> This patch attempts to fix this problem by returning a pointer to the
> opp_table from dev_pm_opp_set_regulator() and using that as the
> parameter to dev_pm_opp_put_regulator(). This ensures that the
> dev_pm_opp_put_regulator() doesn't fail to find the opp table.
>
> Note that similar design problem also exists with other
> dev_pm_opp_put_*() APIs, but those aren't used currently by anyone and
> so we don't need to update them for now.
>
> [Viresh]: Written commit log, minor improvements in the patch and tested
> on exynos 5250.
>
> Cc: # v4.4+ <stable@xxxxxxxxxxxxxxx>
> Reported-by: Joonyoung Shim <jy0922.shim@xxxxxxxxxxx>
> Signed-off-by: Stephen Boyd <sboyd@xxxxxxxxxxxxxx>
> Signed-off-by: Viresh Kumar <viresh.kumar@xxxxxxxxxx>
> ---
> V3->V4:
> - Completely different approach, suggested earlier by Stephen.
> - Can be merged safely now as both /me and Stephen agree to this one.
>
> @Joonyoung: Can you please test this last patch please ?
>

Just system suspend/resume is working but i was missing below test case
that you inform when i test for prior patches on my Odroid-XU3 board.

- offline CPU 4
- suspend the system

With this test case, now all patches posted have the problem that is
failed to get clk: -2.

If all CPUs are online, cpufreq_driver->init(policy) is called by
cpufreq_online() only for CPU0 and CPU4 during system resume but it is
called for CPU5, CPU6 and CPU7 during system resume after CPU4 is
offline, and causes error.

Please refer below logs.

# echo 0 > /sys/devices/system/cpu/cpu4/online
[ 145.438931] IRQ54 no longer affine to CPU4
[ 145.439931] CPU4: shutdown
#
#
# rtcwake -m mem -s 3
wakeup from "mem" at Mon Apr 9 11:04:19 2001
[ 148.708773] PM: Syncing filesystems ... done.
[ 148.718591] Freezing user space processes ... (elapsed 0.005 seconds) done.
[ 148.727749] Freezing remaining freezable tasks ... (elapsed 0.002 seconds) done.
[ 148.753406] wake enabled for irq 135
[ 148.754006] smsc95xx 1-1.1:1.0 eth0: entering SUSPEND2 mode
[ 148.844402] PM: suspend of devices complete after 100.196 msecs
[ 148.869158] PM: late suspend of devices complete after 19.793 msecs
[ 148.891948] PM: noirq suspend of devices complete after 17.803 msecs
[ 148.897067] Disabling non-boot CPUs ...
[ 148.920201] IRQ51 no longer affine to CPU1
[ 148.921414] CPU1: shutdown
[ 148.990114] IRQ52 no longer affine to CPU2
[ 148.991158] CPU2: shutdown
[ 149.049416] IRQ53 no longer affine to CPU3
[ 149.050477] CPU3: shutdown
[ 149.114297] IRQ55 no longer affine to CPU5
[ 149.115318] CPU5: shutdown
[ 149.184539] IRQ56 no longer affine to CPU6
[ 149.185421] CPU6: shutdown
[ 149.257561] IRQ57 no longer affine to CPU7
[ 149.258458] CPU7: shutdown
exynos_suspend_enter: suspending the system...
exynos_suspend_enter: wakeup masks: fffffffd,ffffffef
UART[f7020000]: ULCON=0003, UCON=f3c5, UFCON=0131, UBRDIV=001b
[ 149.295339] Enabling non-boot CPUs ...
[ 149.314454] CPU1 is up
[ 149.329717] CPU2 is up
[ 149.350140] CPU3 is up
[ 149.370365] cpu cpu5: cpufreq_init: failed to get clk: -2
[ 149.374803] IRQ55 no longer affine to CPU5
[ 149.375126] CPU5: shutdown
[ 149.399737] Error taking CPU5 up: -2
[ 149.425424] cpu cpu6: cpufreq_init: failed to get clk: -2
[ 149.429886] IRQ56 no longer affine to CPU6
[ 149.430242] CPU6: shutdown
[ 149.454745] Error taking CPU6 up: -2
[ 149.480397] cpu cpu7: cpufreq_init: failed to get clk: -2
[ 149.484869] IRQ57 no longer affine to CPU7
[ 149.485186] CPU7: shutdown
[ 149.509718] Error taking CPU7 up: -2
[ 149.512392] s3c-i2c 12c60000.i2c: slave address 0x00
[ 149.516787] s3c-i2c 12c60000.i2c: bus frequency set to 65 KHz
[ 149.522554] s3c-i2c 12c80000.i2c: slave address 0x00
[ 149.527461] s3c-i2c 12c80000.i2c: bus frequency set to 65 KHz
[ 149.535818] PM: noirq resume of devices complete after 23.970 msecs
[ 149.543853] PM: early resume of devices complete after 3.020 msecs
[ 149.571356] s3c2410-wdt 101d0000.watchdog: watchdog disabled
[ 149.575811] usb usb1: root hub lost power or was reset
[ 149.641026] usb usb2: root hub lost power or was reset
[ 149.646353] exynos-tmu 10060000.tmu: More trip points than supported by this TMU.
[ 149.652405] exynos-tmu 10060000.tmu: 2 trip points should be configured in polling mode.
[ 149.663667] wake disabled for irq 135
[ 149.680093] Suspended for 2.879 seconds
[ 149.715816] usb usb3: root hub lost power or was reset
[ 149.719482] usb usb4: root hub lost power or was reset
[ 149.728432] s3c-rtc 101e0000.rtc: rtc disabled, re-enabling
[ 149.932034] usb 1-1: reset high-speed USB device number 2 using exynos-ehci
[ 150.387017] usb 1-1.1: reset high-speed USB device number 3 using exynos-ehci
[ 150.566376] PM: resume of devices complete after 995.669 msecs
[ 150.575402] Restarting tasks ... done.

Thanks.