kernel soft lockup ECS-P6VEM3 VIA C3 GIGAPRO CPU motherboard

From: Calvin Arndt
Date: Mon Mar 30 2009 - 20:06:21 EST



Hi, we have an issue with 2.6 and the CentaurHauls CPU on the ECS-P6VEM3 VIA C3 GIGAPRO CPU motherboard.
The kernel stops responding to pings, loses ssh connections on all network interfaces.
This occurs after 10 hours and 37 minutes of uptime. It repeats without fail.
The interesting issue is that it is recoverable! Walk up to the system and hit any key and all comes back to life!
Actually any input (via hardware) will make it start responding again. IE. input via the serial console.

I'm guessing something has disabled soft interrupts until a hardware interrupt is received. But I don't have a
clue as to how to troubleshoot this any further. This has been tested with 2.6.25.20 and up to current.
This kernel works flawlessly on our other hardware (wrap 2c, AMD 2600+ and several others)

Any thoughts?
TIA

Calvin....


root@tester:~# cat /proc/cpuinfo
processor : 0
vendor_id : CentaurHauls
cpu family : 6
model : 7
model name : VIA Ezra
stepping : 8
cpu MHz : 734.957
cache size : 64 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu de tsc msr cx8 mtrr pge mmx 3dnow
bogomips : 1469.91
clflush size : 32
power management:


ssh root@xxxxxxxxxxxxx 'while true;do uptime;sleep 10;done'
01:17:05 up 10:34, load average: 0.00, 0.00, 0.00
01:17:15 up 10:35, load average: 0.00, 0.00, 0.00
01:17:25 up 10:35, load average: 0.00, 0.00, 0.00
01:17:35 up 10:35, load average: 0.00, 0.00, 0.00
01:17:45 up 10:35, load average: 0.00, 0.00, 0.00
01:17:55 up 10:35, load average: 0.00, 0.00, 0.00
01:18:05 up 10:35, load average: 0.00, 0.00, 0.00
01:18:15 up 10:36, load average: 0.00, 0.00, 0.00
01:18:25 up 10:36, load average: 0.00, 0.00, 0.00
01:18:35 up 10:36, load average: 0.00, 0.00, 0.00
01:18:45 up 10:36, load average: 0.00, 0.00, 0.00
01:18:55 up 10:36, load average: 0.00, 0.00, 0.00
01:19:05 up 10:36, load average: 0.00, 0.00, 0.00
01:19:15 up 10:37, load average: 0.00, 0.00, 0.00
01:19:25 up 10:37, load average: 0.00, 0.00, 0.00
01:19:35 up 10:37, load average: 0.00, 0.00, 0.00
Read from remote host 192.168.200.1: Connection timed out




--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/