Double oops with 2.3.9pre5+lm_sensors, netscape, and Seti@Home

Bradley M Keryan (keryan@andrew.cmu.edu)
Thu, 1 Jul 1999 02:02:46 -0400 (EDT)


There's a lot of info in this message, 99% of which is probably
irrelevant, but I don't know which 1% to post so please bear with me :)

[1.] Summary
This oops occurred while running netscape-navigator and Seti@Home (plus
some gnome-terminals and ssh).

[2.] Description
Seti@Home had been running for about a day. Netscape had been running for
about 30 minutes. While I was using ssh, the oops occurred (I heard it
get written to the syslog), netscape locked up, and Seti@Home segfaulted.

I had some of the lm_sensors 2.3.4 modules loaded, and I think this might
be related to lm_sensors. When I run Seti@Home, the CPU temperature
increases significantly (it usually goes up to 57 or 59C). I have 60C set
in the BIOS as the "alarm temperature" and 65C set as the "shutdown
temperature". After the oopses occurred, running "sensors" told me the
CPU temperature had hit 61C, which is why I mentioned lm_sensors.

This is on a Tyan Trinity 100AT S1590S with version 1.16
BIOS and an AMD K6-2 450. I would assume that the BIOS temperature
checking is done with SMM code; could that be a problem?

Also, I was for some stupid reason running "yes" in the background (don't
ask).

This doesn't seem to be an egcs-vs-kernel problem, because I compiled it
with gcc 2.7.2.3 (albeit, that copy of gcc was compiled with egcs 1.1.2,
and I'm unaware of any problems with doing that).

Also, UHCI, V4L, and CPIA are compiled in and a CPIA camera is plugged in,
but it has not been used since I booted.

[4.] /proc/version
Linux version 2.3.9 (root@fatale.keryan.org) (gcc version 2.7.2.3) #8 Tue
Jun 29 22:39:54 CDT 1999

[5.] Oops
Jul 1 00:14:10 fatale kernel: Unable to handle kernel paging request at virtual address a0194f1c
Jul 1 00:14:10 fatale kernel: current->tss.cr3 = 0159f000, %cr3 = 0159f000
Jul 1 00:14:10 fatale kernel: *pde = 00000000
Jul 1 00:14:10 fatale kernel: Oops: 0000
Jul 1 00:14:10 fatale kernel: CPU: 0
Jul 1 00:14:10 fatale kernel: EIP: 0010:[<a0194f1c>]
Jul 1 00:14:10 fatale kernel: EFLAGS: 00010206
Jul 1 00:14:10 fatale kernel: eax: 00000040 ebx: c381ded0 ecx: c2de2920 edx: 00000040
Jul 1 00:14:10 fatale kernel: esi: c381df00 edi: c0185a90 ebp: 00000000 esp: c381de94
Jul 1 00:14:10 fatale kernel: ds: 0018 es: 0018 ss: 0018
Jul 1 00:14:10 fatale kernel: Process netscape-naviga (pid: 8368, process nr: 49, stackpage=c381d000)
Jul 1 00:14:10 fatale kernel: Stack: c0185a90 c2425a80 c2de2920 50011bc0 c015c0fa c2425a80 c381df00 00000918
Jul 1 00:14:10 fatale kernel: c381ded0 c2425a80 c2425a80 00000918 00000002 c381ded0 c0185a90 000020b0
Jul 1 00:14:10 fatale kernel: 000001f4 000001f4 00000000 00000000 c015c3cc c2425a80 c381df00 00000918
Jul 1 00:14:11 fatale kernel: Call Trace: [unix_stream_sendmsg+0/672] [sock_sendmsg+138/176] [unix_stream_sendmsg+0/672] [sock_readv_writev+140/160] [do_readv_writev+334/512] [handle_IRQ_event+49/112] [do_8259A_IRQ+127/172]
Jul 1 00:14:11 fatale kernel: [do_8259A_IRQ+159/172] [sys_writev+89/112] [system_call+52/56]
Jul 1 00:14:11 fatale kernel: Code: <1>Unable to handle kernel paging request at virtual address a0194f1c
Jul 1 00:14:11 fatale kernel: current->tss.cr3 = 0159f000, %cr3 = 0159f000
Jul 1 00:14:11 fatale kernel: *pde = 00000000
Jul 1 00:14:11 fatale kernel: Oops: 0000
Jul 1 00:14:11 fatale kernel: CPU: 0
Jul 1 00:14:11 fatale kernel: EIP: 0010:[show_registers+631/688]
Jul 1 00:14:11 fatale kernel: EFLAGS: 00010086
Jul 1 00:14:11 fatale kernel: eax: a0194f1c ebx: 00000000 ecx: c023e2d4 edx: c367a000
Jul 1 00:14:11 fatale kernel: esi: 0000002b edi: c381e000 ebp: c4000000 esp: c381ddd4
Jul 1 00:14:11 fatale kernel: ds: 0018 es: 0018 ss: 0018
Jul 1 00:14:11 fatale kernel: Process netscape-naviga (pid: 8368, process nr: 49, stackpage=c381d000)
Jul 1 00:14:11 fatale kernel: Stack: a0194f1c c381c000 c023e2d4 c381df00 c0185a90 00000000 00000040 c381ded0
Jul 1 00:14:11 fatale kernel: c2de2920 00000040 a0194f1c 00010206 c47f0000 c4800000 c0109990 c381de58
Jul 1 00:14:11 fatale kernel: c0207678 c0208ac9 00000000 00000000 c010f2c6 c0208ac9 c381de58 00000000
Jul 1 00:14:11 fatale kernel: Call Trace: [unix_stream_sendmsg+0/672] [soundcore:mod_firmware_load+4037120/6113200] [soundcore:mod_firmware_load+4102656/6047664] [die+48/64] [stext_lock+6220/12040] [stext_lock+11421/12040] [do_page_fault+902/1056]
Jul 1 00:14:11 fatale kernel: [stext_lock+11421/12040] [unix_stream_sendmsg+0/672] [error_code+45/52] [unix_stream_sendmsg+0/672] [unix_stream_sendmsg+0/672] [sock_sendmsg+138/176] [unix_stream_sendmsg+0/672] [sock_readv_writev+140/160]
Jul 1 00:14:11 fatale kernel: [do_readv_writev+334/512] [handle_IRQ_event+49/112] [do_8259A_IRQ+127/172] [do_8259A_IRQ+159/172] [sys_writev+89/112] [system_call+52/56]
Jul 1 00:14:11 fatale kernel: Code: 0f b6 0c 03 89 4c 24 38 51 68 70 76 20 c0 e8 b6 b0 00 00 83

soundcore:mod_firmware_load looks sorta suspicious, considering I have an
SB16 which I don't think has firmware.

[7.1.] Software
Kernel modules 2.1.121
Gnu C egcs-2.91.66
Binutils 2.9.1.0.23
Linux C Library 2.1.1
Dynamic linker ldd (GNU libc) 2.1.1
Procps 2.0.2
Mount 2.9o
Net-tools 1.51
Kbd [option...]
Sh-utils 1.16
Modules Loaded sr_mod bsd_comp ppp slhc gl518sm sensors smbus
i2c-via algo-bit i2c-proc i2c-core st opl3 sb uart401 sound soundcore

[7.2.] Processor info
processor : 0
vendor_id : AuthenticAMD
cpu family : 5
model : 8
model name : AMD-K6(tm) 3D processor
stepping : 12
cpu MHz : 451.033204
fdiv_bug : no
hlt_bug : no
sep_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr mce cx8 sep mtrr pge mmx 3dnow
bogomips : 897.84

[7.3.] Module info
sr_mod 18656 1 (autoclean)
bsd_comp 3752 1 (autoclean)
ppp 21056 2 (autoclean) [bsd_comp]
slhc 4508 1 (autoclean) [ppp]
gl518sm 7544 0 (unused)
sensors 5232 0 [gl518sm]
smbus 1820 0 [gl518sm sensors]
i2c-via 1188 0 (unused)
algo-bit 8708 0 [i2c-via]
i2c-proc 2732 0
i2c-core 4464 0 [gl518sm sensors smbus algo-bit i2c-proc]
st 25404 0 (unused)
opl3 10928 1
sb 35012 1
uart401 5908 1 [sb]
sound 60984 0 [opl3 sb uart401]
soundcore 2476 6 [sb sound]

[7.4.] /proc/scsi/scsi
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: SEAGATE Model: ST32155N Rev: 0318
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 02 Lun: 00
Vendor: WangDAT Model: Model 1300 Rev: 02.5
Type: Sequential-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 03 Lun: 00
Vendor: PLEXTOR Model: CD-ROM PX-8XCS Rev: 1.07
Type: CD-ROM ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 04 Lun: 00
Vendor: SONY Model: CD-R CDU948S Rev: 1.0h
Type: CD-ROM ANSI SCSI revision: 02

scsi0 is an aic7xxx, but I was not accessing it at the time.

Mounted filesystems:
/dev/hda2 on / type ext2 (rw)
none on /proc type proc (rw)
/dev/hda6 on /home type ext2 (rw)
/dev/sda1 on /mnt/data type ext2 (rw)
none on /dev/pts type devpts (rw,mode=0622)

.config (which shows I have way too much stuff loaded)
CONFIG_EXPERIMENTAL=y
CONFIG_M586TSC=y
CONFIG_X86_WP_WORKS_OK=y
CONFIG_X86_INVLPG=y
CONFIG_X86_BSWAP=y
CONFIG_X86_POPAD_OK=y
CONFIG_X86_TSC=y
CONFIG_1GB=y
CONFIG_MTRR=y
CONFIG_MODULES=y
CONFIG_KMOD=y
CONFIG_NET=y
CONFIG_PCI=y
CONFIG_PCI_GOANY=y
CONFIG_PCI_BIOS=y
CONFIG_PCI_DIRECT=y
CONFIG_PCI_QUIRKS=y
CONFIG_PCI_OLD_PROC=y
CONFIG_SYSVIPC=y
CONFIG_SYSCTL=y
CONFIG_BINFMT_AOUT=m
CONFIG_BINFMT_ELF=y
CONFIG_BINFMT_MISC=m
CONFIG_PARPORT=y
CONFIG_PARPORT_PC=y
CONFIG_PNP=y
CONFIG_PNP_PARPORT=y
CONFIG_BLK_DEV_FD=y
CONFIG_BLK_DEV_IDE=y
CONFIG_BLK_DEV_IDEDISK=y
CONFIG_BLK_DEV_IDECD=y
CONFIG_BLK_DEV_CMD640=y
CONFIG_BLK_DEV_RZ1000=y
CONFIG_BLK_DEV_IDEPCI=y
CONFIG_BLK_DEV_IDEDMA_PCI=y
CONFIG_IDEDMA_PCI_AUTO=y
CONFIG_BLK_DEV_VIA82C586=y
CONFIG_BLK_DEV_IDEDMA=y
CONFIG_IDEDMA_AUTO=y
CONFIG_BLK_DEV_LOOP=m
CONFIG_BLK_DEV_RAM=m
CONFIG_PARIDE_PARPORT=y
CONFIG_BLK_DEV_IDE_MODES=y
CONFIG_PACKET=y
CONFIG_FIREWALL=y
CONFIG_UNIX=y
CONFIG_INET=y
CONFIG_IP_FIREWALL=y
CONFIG_IP_ALWAYS_DEFRAG=y
CONFIG_IP_MASQUERADE=y
CONFIG_IP_MASQUERADE_ICMP=y
CONFIG_IP_ALIAS=y
CONFIG_SKB_LARGE=y
CONFIG_SCSI=y
CONFIG_BLK_DEV_SD=y
CONFIG_CHR_DEV_ST=m
CONFIG_BLK_DEV_SR=m
CONFIG_BLK_DEV_SR_VENDOR=y
CONFIG_CHR_DEV_SG=m
CONFIG_SCSI_MULTI_LUN=y
CONFIG_SCSI_CONSTANTS=y
CONFIG_SCSI_AIC7XXX=y
CONFIG_AIC7XXX_TCQ_ON_BY_DEFAULT=y
CONFIG_AIC7XXX_CMDS_PER_DEVICE=8
CONFIG_AIC7XXX_RESET_DELAY=5
CONFIG_NETDEVICES=y
CONFIG_DUMMY=m
CONFIG_NET_ETHERNET=y
CONFIG_NET_EISA=y
CONFIG_DEC_ELCP=y
CONFIG_PPP=m
CONFIG_SLIP=m
CONFIG_SLIP_COMPRESSED=y
CONFIG_SLIP_SMART=y
CONFIG_VT=y
CONFIG_VT_CONSOLE=y
CONFIG_SERIAL=y
CONFIG_UNIX98_PTYS=y
CONFIG_UNIX98_PTY_COUNT=256
CONFIG_PRINTER=m
CONFIG_PRINTER_READBACK=y
CONFIG_MOUSE=y
CONFIG_PSMOUSE=y
CONFIG_RTC=y
CONFIG_VIDEO_DEV=y
CONFIG_JOYSTICK=m
CONFIG_JOY_ANALOG=m
CONFIG_USB=y
CONFIG_USB_UHCI=y
CONFIG_USB_HUB=y
CONFIG_USB_MOUSE=y
CONFIG_USB_KBD=y
CONFIG_USB_CPIA=y
CONFIG_AUTOFS_FS=y
CONFIG_FAT_FS=m
CONFIG_MSDOS_FS=m
CONFIG_VFAT_FS=m
#(note: didn't build, but I wasn't surprised or alarmed)
CONFIG_ISO9660_FS=y
CONFIG_JOLIET=y
CONFIG_PROC_FS=y
CONFIG_DEVPTS_FS=y
CONFIG_EXT2_FS=y
CONFIG_NFS_FS=y
CONFIG_SUNRPC=y
CONFIG_LOCKD=y
CONFIG_BSD_DISKLABEL=y
CONFIG_NLS=y
CONFIG_NLS_CODEPAGE_437=m
CONFIG_NLS_ISO8859_1=m
CONFIG_VGA_CONSOLE=y
CONFIG_VIDEO_SELECT=y
CONFIG_FB=y
CONFIG_DUMMY_CONSOLE=y
CONFIG_VIDEO_SELECT=y
CONFIG_FB_MATROX=y
CONFIG_FB_MATROX_MILLENIUM=y
CONFIG_FBCON_ADVANCED=y
CONFIG_FBCON_MFB=y
CONFIG_FBCON_CFB2=y
CONFIG_FBCON_CFB4=y
CONFIG_FBCON_CFB8=y
CONFIG_FBCON_CFB16=y
CONFIG_FBCON_CFB24=y
CONFIG_FBCON_CFB32=y
CONFIG_FBCON_VGA_PLANES=y
CONFIG_FBCON_VGA=y
CONFIG_FONT_8x8=y
CONFIG_FONT_8x16=y
CONFIG_SOUND=m
CONFIG_SOUND_OSS=m
CONFIG_SOUND_SB=m
CONFIG_SOUND_ADLIB=m
CONFIG_SOUND_SOFTOSS=m
CONFIG_SOUND_YM3812=m
CONFIG_MAGIC_SYSRQ=y

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/