[GIT PULL] RAS updates for 6.5

From: Borislav Petkov
Date: Mon Jun 26 2023 - 08:52:48 EST


Hi Linus,

please pull the set of RAS updates for 6.5.

Thx.

---

The following changes since commit f1fcbaa18b28dec10281551dfe6ed3a3ed80e3d6:

Linux 6.4-rc2 (2023-05-14 12:51:40 -0700)

are available in the Git repository at:

git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git tags/ras_core_for_v6.5

for you to fetch changes up to 4251566ebc1cf95ae26a1e5a24cdac1ac25e942f:

EDAC/amd64: Cache and use GPU node map (2023-06-19 13:01:44 +0200)

----------------------------------------------------------------
- Add initial support for RAS hardware found on AMD server GPUs (MI200).
Those GPUs and CPUs are connected together through the coherent fabric
and the GPU memory controllers report errors through x86's MCA so EDAC
needs to support them. The amd64_edac driver supports now HBM (High
Bandwidth Memory) and thus such heterogeneous memory controller
systems

- Other small cleanups and improvements

----------------------------------------------------------------
Borislav Petkov (AMD) (1):
x86/amd_nb: Re-sort and re-indent PCI defines

Muralidhara M K (2):
EDAC/amd64: Document heterogeneous system enumeration
EDAC/amd64: Add support for AMD heterogeneous Family 19h Model 30h-3Fh

Osama Muhammad (1):
ras/debugfs: Fix error checking for debugfs_create_dir()

Yazen Ghannam (4):
x86/MCE: Check a hw error's address to determine proper recovery action
x86/amd_nb: Add MI200 PCI IDs
x86/MCE/AMD, EDAC/mce_amd: Decode UMC_V2 ECC errors
EDAC/amd64: Cache and use GPU node map

Documentation/driver-api/edac.rst | 120 ++++++++++++
arch/x86/kernel/amd_nb.c | 50 ++---
arch/x86/kernel/cpu/mce/amd.c | 6 +-
arch/x86/kernel/cpu/mce/core.c | 2 +-
drivers/edac/amd64_edac.c | 386 +++++++++++++++++++++++++++++++++++---
drivers/edac/amd64_edac.h | 1 +
drivers/edac/mce_amd.c | 3 +-
drivers/ras/debugfs.c | 2 +-
include/linux/pci_ids.h | 1 +
9 files changed, 513 insertions(+), 58 deletions(-)

--
Regards/Gruss,
Boris.

https://people.kernel.org/tglx/notes-about-netiquette