[PROBLEM] Lockup of >=2.4.20 on Tyan Thunder K7x S2469 + 3Ware Escalade 7800

From: Vincent Touquet (vincent@ulyssis.org)
Date: Wed Apr 16 2003 - 06:32:47 EST


Problem Description:
---------------------
I have a brandnew Tyan Thunder K7x S2469GN SMP motherboard
with a 3Ware Escalade 7800 RAID card, which has a RAID5
array of 7 Western Digital 120Gb [Caviar 1200JB SE MB]
and one hot spare.

Large amount of IO to the array cause the machine to
hangup. Quickly switching to console when I notice
that the machine is about to hang reveals the following
message:
Unit #0: Command (f6f27000) timed out, resetting card

I tried booting with noapic, as was suggested by the
great 3ware support team, but to no avail.

I tried different kernels (self compiled, standard Debian ones),
but nothing seems to help. Probably switching to a UP kernel
would solve the problem, but I would like to make use
of the second CPU too.

Useful Information:
---------------------
Some useful info is included in text attachments
(some lines are over 80 charachters so it would
 wrap horribly):

The output of lspci -vvvx
The output of cat /proc/interrupts
The output of dmesg

I included these data both for the case where the kernel
was booted with the noapic and without the noapic option.

I also included the .config of the kernel I used
(2.4.21-pre7). Note that plain 2.4.20 fails too.

Call for help :)
-----------------
Please CC: me in any reply. I follow lkml through the
online archives, so if you forget to CC, it won't be a
disaster :)

If there is any info lacking in this email, please tell
me so, this is my first problem report on lkml, and probably
I still have a lot to learn in that regard (and then some).

Should I compile in the magic SysReq support and then
try to grab some info out of the kernel when it hangs up ?
Some pointers on how to take this on would be helpful,
I will start by reading in more detail the lkml faq
entries at tux.org.

PCI hangs happened before on this mobo ?:
------------------------------------------
I found some other Tyan Thunder K7x related posts on lkml:
http://www.ussg.iu.edu/hypermail/linux/kernel/0108.1/0724.html
This one is interesting, cause I think it is also the PCI
bus which hangs in my case, causing the 3Ware card to time out
and then of course the whole system locks up.
Then again I don't use any sound related code in the kernel ...

Thanks for any help provided,

Vincent Touquet















-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Wed Apr 23 2003 - 22:00:18 EST