[PATCHv3 0/2] *** Detect interrupt storm in softlockup ***

From: Bitao Hu
Date: Wed Jan 31 2024 - 12:18:35 EST


Hi, guys.
I have implemented a low-overhead method for detecting interrupt storm
in softlockup. Please review it, all comments are welcome.

Changes from v2 to v3:

- From Liu Song, using enum instead of macro for cpu_stats, shortening
the name 'idx_to_stat' to 'stats', adding 'get_16bit_precesion' instead
of using right shift operations, and using 'struct irq_counts'.

- From kernel robot test, using '__this_cpu_read' and '__this_cpu_write'
instead of accessing to an per-cpu array directly, in order to avoid
this warning.
'sparse: incorrect type in initializer (different modifiers)'

Changes from v1 to v2:

- From Douglas, optimize the memory of cpustats. With the maximum number
of CPUs, that's now this.
2 * 8192 * 4 + 1 * 8192 * 5 * 4 + 1 * 8192 = 237,568 bytes.

- From Liu Song, refactor the code format and add necessary comments.

- From Douglas, use interrupt counts instead of interrupt time to
determine the cause of softlockup.

- Remove the cmdline parameter added in PATCHv1.

Bitao Hu (2):
watchdog/softlockup: low-overhead detection of interrupt storm
watchdog/softlockup: report the most frequent interrupts

kernel/watchdog.c | 240 ++++++++++++++++++++++++++++++++++++++++++++++
1 file changed, 240 insertions(+)

--
2.37.1 (Apple Git-137.1)