[PATCH v2 0/4] thermal: intel: hfi: Fix memory corruption on resume from hibernation

From: Ricardo Neri
Date: Tue Jan 02 2024 - 23:13:30 EST


Hi,

This is v2 of this series. You can read the overview and motivation in the
cover letter of v1 [1].

I smoke-retested this version on a Meteor Lake system. It completed 50
cycles of suspend-to-disk and resume. I completed longer testing for v1.

Changes since v1:
* Added locking to hfi_pm_notify() to serialize RMW operations on the
MSR_IA32_HW_FEEDBACK_CONFIG register. (Rafael)
* Relocated a comment regarding the reallocation of HFI memory to
patch 3/4. (Rafael)
* Clarified that patch 1/4 does not introduce functional changes.
(Rafael)
* Indicated the first stable version on which this patchset should be
backported.
* Renamed hfi_do_pm_[enable|disable]() as hfi_do_[enable|disable]() for
future reuse. (Stan)
* Registered the HFI suspend notifier towards the end of
intel_hfi_init(). (Stan)

Thanks and BR,
Ricardo

[1]. https://lore.kernel.org/all/20231227062940.10780-1-ricardo.neri-calderon@xxxxxxxxxxxxxxx/

Ricardo Neri (4):
thermal: intel: hfi: Refactor enabling code into helper functions
thermal: intel: hfi: Enable an HFI instance from its first online CPU
thermal: intel: hfi: Disable an HFI instance when all its CPUs go
offline
thermal: intel: hfi: Add a suspend notifier

drivers/thermal/intel/intel_hfi.c | 153 +++++++++++++++++++++++++-----
1 file changed, 127 insertions(+), 26 deletions(-)

--
2.25.1