System crashes on startup with 2.3.29

Charlie Hedlin (charlie@hedlin.com)
Fri, 3 Dec 1999 20:51:06 -0600 (CST)


I am not subscribed to the list, but I checked some web archives and
didn't see anything on this. Please copy any responses to me at
charlie@hedlin.com.

Thanks

When my machine starts init it crashes imediately (the file system still
has the clean bit when I boot 2.2.13 afterwards).

The system is a current Debian potato machine. I will include output of
ver_linux and relevent proc files at the end of this message.

When I first tried 2.3.29 the machine hung durring the initialization of
the IDE driver. It would recognize the interfaces and hang before
printing information about the drivers. I recompiled without IDE support
(my root is on sda1), and got the below when init started. The APIC error
messages repeated for some time before the watchdog stepped in.

I repeated with init=/bin/sash, and booted to a prompt, but I hit ls
(never tried anything else) and the system did the same, but no watchdog
so I never got the messsage. Here is from the normal boot:

APIC error interupt on CPU#12, should never happen.
... APIC ESR0: 00000008
... APIC ESR1: 00000008
APIC error interupt on CPU#0, should never happen.
... APIC ESR0: 00000004
... APIC ESR1: 00000004
NMI Watchdog detected LOCKUP on CPU0, registers:
CPU: 0
EIP: 0010:[<c01cf285>]
EFLAGS: 00000002
eax: 00000005 ebx: bffffafc ecx: bffff9bc edx: 00000018
esi: 00000002 edi: bffff9bc ebp: bggggb3c esp: c12f7fb0
ds: 0018 es: 0018 ss: 0018
Process init (pid: 1, stackpage=c12f7000)
Stack: bffffafc bffff9bc bffff93c 00000002 bffff9bc bffffb3c 00000005 bfff002b
00000018 00000043 c010a92d 00000010 00000246 0000002b 00000002 40096987
00000023 00000202 bffff8f0 0000002b
Call Trace: [<c010a92d>]
Code: f6 05 80 6a 25 c0 01 75 f7 e9 e2 15 f4 ff f0 0f ba 35 84 f6
console shuts up ...

Here is the file piped through ksymops -K -L -O -m /boot/System.map-2.3.29
while running 2.2.13.

ksymoops 0.7c on i686 2.2.13. Options used
-V (default)
-K (specified)
-L (specified)
-O (specified)
-m /boot/System.map-2.3.29 (specified)

CPU: 0
EIP: 0010:[<c01cf285>]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00000002
eax: 00000005 ebx: bffffafc ecx: bffff9bc edx: 00000018
esi: 00000002 edi: bffff9bc ebp: bggggb3c esp: c12f7fb0
ds: 0018 es: 0018 ss: 0018
Process init (pid: 1, stackpage=c12f7000)
Stack: bffffafc bffff9bc bffff93c 00000002 bffff9bc bffffb3c 00000005 bfff002b
00000018 00000043 c010a92d 00000010 00000246 0000002b 00000002 40096987
00000023 00000202 bffff8f0 0000002b
Call Trace: [<c010a92d>]
Code: f6 05 80 6a 25 c0 01 75 f7 e9 e2 15 f4 ff f0 0f ba 35 84 f6

>>EIP; c01cf285 <stext_lock+435/7a70> <=====
Trace; c010a92d <restore_all+8/f>
Code; c01cf285 <stext_lock+435/7a70>
00000000 <_EIP>:
Code; c01cf285 <stext_lock+435/7a70> <=====
0: f6 05 80 6a 25 c0 01 testb $0x1,0xc0256a80 <=====
Code; c01cf28c <stext_lock+43c/7a70>
7: 75 f7 jne 0 <_EIP>
Code; c01cf28e <stext_lock+43e/7a70>
9: e9 e2 15 f4 ff jmp fff415f0 <_EIP+0xfff415f0> c0110875 <smp_error_interrupt+1/78>
Code; c01cf293 <stext_lock+443/7a70>
e: f0 0f ba 35 84 f6 00 lock btrl $0x0,0xf684
Code; c01cf29a <stext_lock+44a/7a70>
15: 00 00

The rest of this is to give an idea of my computer, and is taken under
kernel 2.2.13

First, output from ver_linux
-- Versions installed: (if some fields are empty or look
-- unusual then possibly you have very old versions)
Linux rodent 2.2.13 #1 SMP Wed Dec 1 23:23:05 CST 1999 i686 unknown
Kernel modules 2.3.7
Gnu C 2.95.2
Binutils 2.9.5.0.19
Linux C Library 2.1.2
Dynamic linker ldd: version 1.9.11
Procps 2.0.6
Mount 2.10
Net-tools 2.05
Kbd 0.99
Sh-utils 2.0
Sh-utils Parker.
Sh-utils
Sh-utils Inc.
Sh-utils NO
Sh-utils PURPOSE.
Modules Loaded nfsd lockd sunrpc isofs ip_masq_vdolive ip_masq_raudio ip_masq_quake ip_masq_irc ip_masq_ftp ip_masq_cuseeme lm_sensors sr_mod sg ide-cd cdrom lp parport_pc parport tuner bttv videodev i2c 3c59x eepro100 serial

Here is /proc/cpuinfo

processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 1
model name : Pentium Pro
stepping : 9
cpu MHz : 198.669532
cache size : 256 KB
fdiv_bug : no
hlt_bug : no
sep_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
bogomips : 198.25

processor : 12
vendor_id : GenuineIntel
cpu family : 6
model : 1
model name : Pentium Pro
stepping : 6
cpu MHz : 198.669532
cache size : 256 KB
fdiv_bug : no
hlt_bug : no
sep_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
bogomips : 198.25

/proc/pci

PCI devices found:
Bus 0, device 0, function 0:
Host bridge: Intel 82441FX Natoma (rev 2).
Medium devsel. Fast back-to-back capable. Master Capable. Latency=32.
Bus 0, device 6, function 0:
Ethernet controller: Intel 82557 (rev 2).
Medium devsel. Fast back-to-back capable. IRQ 18. Master Capable. Latency=72. Min Gnt=8.Max Lat=56.
Prefetchable 32 bit memory at 0xffbef000 [0xffbef008].
I/O at 0xff20 [0xff21].
Non-prefetchable 32 bit memory at 0xff800000 [0xff800000].
Bus 0, device 7, function 0:
ISA bridge: Intel 82371SB PIIX3 ISA (rev 1).
Medium devsel. Fast back-to-back capable. Master Capable. No bursts.
Bus 0, device 7, function 1:
IDE interface: Intel 82371SB PIIX3 IDE (rev 0).
Medium devsel. Fast back-to-back capable. Master Capable. Latency=32.
I/O at 0xffa0 [0xffa1].
Bus 0, device 7, function 2:
USB Controller: Intel 82371SB PIIX3 USB (rev 1).
Medium devsel. Fast back-to-back capable. IRQ 9. Master Capable. Latency=32.
I/O at 0xff40 [0xff41].
Bus 0, device 9, function 0:
SCSI storage controller: Adaptec AIC-7880U (rev 0).
Medium devsel. Fast back-to-back capable. IRQ 17. Master Capable. Latency=72. Min Gnt=8.Max Lat=8.
I/O at 0xfc00 [0xfc01].
Non-prefetchable 32 bit memory at 0xffbeb000 [0xffbeb000].
Bus 0, device 11, function 0:
VGA compatible controller: NVidia Unknown device (rev 3).
Vendor id=10de. Device id=20.
Medium devsel. Fast back-to-back capable. IRQ 16. Master Capable. Latency=48. Min Gnt=5.Max Lat=1.
Non-prefetchable 32 bit memory at 0xfb000000 [0xfb000000].
Prefetchable 32 bit memory at 0xfa000000 [0xfa000008].
Bus 0, device 15, function 0:
Ethernet controller: 3Com 3C590 10bT (rev 0).
Medium devsel. IRQ 17. Master Capable. Latency=248. Min Gnt=3.Max Lat=8.
I/O at 0xff80 [0xff81].
Bus 0, device 17, function 0:
Multimedia video controller: Brooktree Bt848 (rev 17).
Medium devsel. Fast back-to-back capable. IRQ 18. Master Capable. Latency=136. Min Gnt=16.Max Lat=40.
Prefetchable 32 bit memory at 0xffbea000 [0xffbea008].

/proc/dma
4: cascade

/proc/interupts
CPU0 CPU1
0: 152642 0 XT-PIC timer
1: 438 428 IO-APIC-edge keyboard
2: 0 0 XT-PIC cascade
4: 14695 15816 IO-APIC-edge serial
7: 24166 1 IO-APIC-edge parport0
8: 0 2 IO-APIC-edge rtc
12: 76 26 IO-APIC-edge PS/2 Mouse
13: 1 0 XT-PIC fpu
14: 2628 1440 IO-APIC-edge ide0
17: 8453 8697 IO-APIC-level aic7xxx, eth1
18: 39273 40349 IO-APIC-level bttv, Intel EtherExpress Pro 10/100 Ethernet
NMI: 0
ERR: 0

/proc/meminfo

total: used: free: shared: buffers: cached:
Mem: 164462592 95125504 69337088 64626688 20840448 45547520
Swap: 68153344 0 68153344
MemTotal: 160608 kB
MemFree: 67712 kB
MemShared: 63112 kB
Buffers: 20352 kB
Cached: 44480 kB
SwapTotal: 66556 kB
SwapFree: 66556 kB

The memory is ECC and the system hasn't recorded any errors since August
(3 single bits in 2 years, so I might have a problem, but I doubt it, I
had never checked it before).

If I can do anything else to help, please let me know,
Thanks,
Charlie Hedlin

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/