Re: Failed to create a rescuer kthread for the amdgpu-reset-dev workqueue

From: Christian König
Date: Fri Jan 12 2024 - 03:17:54 EST


Well the driver load is interrupted for some reason.

Have you set any timeout for modprobe?

Regards,
Christian.

Am 12.01.24 um 09:11 schrieb Thomas Perrot:
Hello,

We are updating the kernel from the 6.1 to the 6.6 and we observe an
amdgpu’s regression with Radeon RX580 8GB and SiFive Unmatched:
“workqueue: Failed to create a rescuer kthread for wq 'amdgpu-reset-
dev': -EINTR
[drm:amdgpu_reset_create_reset_domain [amdgpu]] *ERROR* Failed to
allocate wq for amdgpu_reset_domain!
amdgpu 0000:07:00.0: amdgpu: Fatal error during GPU init
amdgpu 0000:07:00.0: amdgpu: amdgpu: finishing device.
amdgpu: probe of 0000:07:00.0 failed with error -12”

We tried to figure it out without success for the moment, do you have
some advice to identify the root cause and to fix it?

Kind regards,
Thomas Perrot