[PATCH v12 0/8] Add support for Sub-NUMA cluster (SNC) systems

From: Tony Luck
Date: Wed Nov 29 2023 - 19:34:30 EST


The Sub-NUMA cluster feature on some Intel processors partitions the CPUs
that share an L3 cache into two or more sets. This plays havoc with the
Resource Director Technology (RDT) monitoring features. Prior to this
patch Intel has advised that SNC and RDT are incompatible.

Some of these CPU support an MSR that can partition the RMID counters in
the same way. This allows monitoring features to be used. With the caveat
that users must be aware that Linux may migrate tasks more frequently
between SNC nodes than between "regular" NUMA nodes, so reading counters
from all SNC nodes may be needed to get a complete picture of activity
for tasks.

Cache and memory bandwidth allocation features continue to operate at
the scope of the L3 cache.

Signed-off-by: Tony Luck <tony.luck@xxxxxxxxx>

Changes since v11:

Global: (comment from Reinette)
Reorder tags with Signed-off-by: first, then Reviewed/Tested

Patch1: (comment from Reinette)
Add error message to domain_remove_cpu() [matching the one in
domain_add_cpu()] for the case where get_cpu_cacheinfo_id()
failed to find a cache ID for the current CPU.

Patch3: (comment from Reinette)
When splitting the domain_add_cpu() and domain_remove_cpu()
functions add "control" and "monitor" to the warning messages.
Fix the:
pr_warn("Couldn't find control scope id=%d for CPU %d\n", id, cpu);
message:
s/Couldn't/Can't/
s/control scope/control domain with/
Add resource name.
Ditto for similar monitor message.

Patch6: (comment from Reinette)
Used Reinette's rewrite into imperative mood for latter part
of commit message.

Patch8: (comment from Randy)
s/have/has/ s/cache. But/cache, but/

Added Reinette's "Reviewed-by:" to all patches except patch 3.

Added Shaopeng Tan's Reviewed and Tested to all patches.

Rebased to v6.7-rc3

Tony Luck (8):
x86/resctrl: Prepare for new domain scope
x86/resctrl: Prepare to split rdt_domain structure
x86/resctrl: Prepare for different scope for control/monitor
operations
x86/resctrl: Split the rdt_domain and rdt_hw_domain structures
x86/resctrl: Add node-scope to the options for feature scope
x86/resctrl: Introduce snc_nodes_per_l3_cache
x86/resctrl: Sub NUMA Cluster detection and enable
x86/resctrl: Update documentation with Sub-NUMA cluster changes

Documentation/arch/x86/resctrl.rst | 25 +-
include/linux/resctrl.h | 85 +++--
arch/x86/include/asm/msr-index.h | 1 +
arch/x86/kernel/cpu/resctrl/internal.h | 66 ++--
arch/x86/kernel/cpu/resctrl/core.c | 411 +++++++++++++++++-----
arch/x86/kernel/cpu/resctrl/ctrlmondata.c | 58 +--
arch/x86/kernel/cpu/resctrl/monitor.c | 68 ++--
arch/x86/kernel/cpu/resctrl/pseudo_lock.c | 26 +-
arch/x86/kernel/cpu/resctrl/rdtgroup.c | 149 ++++----
9 files changed, 607 insertions(+), 282 deletions(-)


base-commit: 2cc14f52aeb78ce3f29677c2de1f06c0e91471ab
--
2.41.0