[PATCH v2 0/2] drm/msm: Add tracking for faults associated with an address space

From: Rob Clark
Date: Tue Feb 01 2022 - 11:16:10 EST


From: Rob Clark <robdclark@xxxxxxxxxxxx>

Currently, for GL_EXT_robustness userspace uses the global and per-
submitqueue fault counters to determine GUILTY_CONTEXT_RESET_EXT vs
INNOCENT_CONTEXT_RESET_EXT. But that is a bit overly paranoid, in
that a fault in a different process's context (when it has it's own
isolated address space) should not hurt anything.

This is particularly annoying with CrOS and chrome's exit_on_context_lost quirk,
while running deqp in the android container, as the deqp-egl suite has
tests that intentionally trigger gpu hangs (for the purpose of testing
the robustness extension), which triggers chrome to restart, which
restarts the android container!

But chrome doesn't need to know about these faults, thanks to address
space isolation.

Applies on top of https://patchwork.freedesktop.org/series/98907/

Rob Clark (2):
drm/msm/gpu: Add ctx to get_param()
drm/msm/gpu: Track global faults per address-space

drivers/gpu/drm/msm/adreno/adreno_gpu.c | 5 +++--
drivers/gpu/drm/msm/adreno/adreno_gpu.h | 3 ++-
drivers/gpu/drm/msm/msm_drv.c | 3 ++-
drivers/gpu/drm/msm/msm_gem.h | 3 +++
drivers/gpu/drm/msm/msm_gpu.c | 8 +++++++-
drivers/gpu/drm/msm/msm_gpu.h | 8 ++++++--
drivers/gpu/drm/msm/msm_rd.c | 6 ++++--
7 files changed, 27 insertions(+), 9 deletions(-)

--
2.34.1