Re: kernel 5.15 does not boot with 3ware card (never had this issue <= 5.14) - scsi 0:0:0:0: WARNING: (0x06:0x002C) : Command (0x12) timed out, resetting card

From: Douglas Miller
Date: Mon Nov 01 2021 - 16:03:15 EST


I have seen a problem, with a different adapter and arch but similar symptoms, where 5.14 worked and 5.15 did not. That was tracked down to a difference in IRQ domain handling between the two kernels, resulting in an IRQ essentially not working anymore. The fix was arch-specific and not x86_64, but might be of interest:

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=5a4b0320783a

On 11/1/21 14:48, Justin Piszcz wrote:
On Mon, Nov 1, 2021 at 6:36 AM Justin Piszcz <jpiszcz@xxxxxxxxxxxxxxx> wrote:
On Sun, Oct 31, 2021 at 7:52 PM Bart Van Assche <bvanassche@xxxxxxx> wrote:
On 10/31/21 16:19, Justin Piszcz wrote:
Diff between 5.14 and 5.15 .config files-- could it be something to do with
CONFIG_IOMMU_DEFAULT_DMA_LAZY=y?
That's hard to say. Is CONFIG_MAGIC_SYSRQ enabled? If not, please enable
it and hit Alt-Printscreen-t (dump task list; see also
Documentation/admin-guide/sysrq.rst) and share the contents of the
kernel log. If that would not be convenient, please try to bisect this
issue.
[ .. ]

It appears at this point in the boot process the keyboard (USB and
PS2) are not yet available and/or do not respond in this scenario (I
do have CONFIG_MAGIC_SYSRQ enabled+have used it in the past). I'll
build the prior 5.15-rc(1-7) to check where it stopped working and
reply back to the list when I have that info.
[..]

I have tried all of the -rc's and they all hang at boot, keyboard
input (USB/PS2) is not working at this stage in the boot process.
Are there any thoughts on how to debug this further?

[9.305954] 3u-sas: scsi0: Found an LSI 3ware 9750-2414e Controller at
Oxfb760000, IRQ: 45.
[9.6179701 3u-sas: scsi0: Firmware FH9X 5.12.00.016, BIOS BE9X
5.11.00.007, Phys: 28.
[30.498007] scsi 0:0:0:0: WARNING: (0x06:0x002C) : Command (0x12)
timed out, resetting card
[71.4419581 scsi 0:0:0:0: WARNING: (0x06: 0x002C): Command (0x0) timed
out, resetting card.

# lilo
Added 5.14.8-1
Added 5.15.0-1 - hangs with the error above
Added 5.15.0-rc1-1 - hangs with the error above
Added 5.15.0-rc2-1 - hangs with the error above
Added 5.15.0-rc3-1 - hangs with the error above
Added 5.15.0-rc4-1 - hangs with the error above
Added 5.15.0-rc5-1 - hangs with the error above
Added 5.15.0-rc6-1 - hangs with the error above
Added 5.15.0-rc7-1 * - hangs with the error above

Regards,

Justin.