Kernel Panic with raid0 and wu-ftpd on 2.3.5ac1

synFlood (synflood@endor.sick.cl)
Thu, 1 Jul 1999 01:11:47 -0400


[1.] One line summary of the problem:
Kernel Panic with raid0 on 2.3.5ac1

[2.] Full description of the problem/report:
i run a ftp server wich had 1 raid0 disk wich consist on a bunch o
IDE and SCSI disks. all running into 1 single md device. from time to
time, when users try to download a file from the FTP the kernel fails with
a kernel Oops and the ftp process (proftpd 1.2.0pre3) fall into D state and
ps -auxwl shows :

100 598 3804 241 0 0 1352 0 wait_on_pag DW ? 0:00
(in.ftpd)

also system load start to raise really fast, but the machine is completely usable

[3.] Keywords (i.e., modules, networking, kernel):
raid0, ftp
[4.] Kernel version (from /proc/version):
Linux version 2.3.5ac1 (root@lot.sick.cl) (gcc version egcs-2.90.29 980515 (egcs-1.0.3 release)) #1 Sun Jun 13 22:42:12 CLT 1999

[5.] Output of Oops.. message (if applicable) with symbolic information
resolved (see Documentation/oops-tracing.txt)
ksymoops 0.7c on i686 2.3.5ac1. Options used
-V (default)
-k /proc/ksyms (default)
-l /proc/modules (default)
-o /lib/modules/2.3.5ac1/ (default)
-m /usr/src/linux/System.map (default)

Warning: You did not tell me where to find symbol information. I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc. ksymoops -h explains the options.

Unable to handle kernel paging request at virtual address c40484a0
current->tss.cr3 = 03264000, pr3 = 03264000
*pde = 003df063
*pte = 00000000
Oops: 0000
CPU: 0
EIP: 0010:[<c4014494>]
EFLAGS: 00010202
eax: 0332c4c3 ebx: c40484a0 ecx: 00000008 edx: 00000006
esi: 00000002 edi: 0ccb130e ebp: 00000004 esp: c2d55e2c
ds: 0018 es: 0018 ss: 0018
Process in.ftpd (pid: 7228, process nr: 144, stackpage=c2d55000)
Stack: 00000004 00000009 c4018090 1996261c c4016000 c0127f4e 00000001 00000000
c017b9bf c0236180 c36434ee c36434f0 00000002 c01764ad 00000000 c36434ee
c36434f0 00000002 c36434e0 266fa203 00000000 00000100 00000400 c0128538
Call Trace: [<c4018090>] [<c4016000>] [<c0127f4e>] [<c017b9bf>] [<c01764ad>]
[<c0128538>] [<c01286df>]
[<c011de98>] [<c011833d>] [<c011dfdf>] [<c011df38>] [<c0125e19>] [<c0109e34>]
Code: 8b 33 8b 06 03 46 08 39 c7 7c 1f 8b 5b 04 85 db 75 16 57 68
Reading Oops report from the terminal
Unable to handle kernel paging request at virtual address c40484a0
current->tss.cr3 = 03264000, pr3 = 03264000
*pde = 003df063
Oops: 0000
CPU: 0
EIP: 0010:[<c4014494>]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010202
eax: 0332c4c3 ebx: c40484a0 ecx: 00000008 edx: 00000006
esi: 00000002 edi: 0ccb130e ebp: 00000004 esp: c2d55e2c
ds: 0018 es: 0018 ss: 0018
Process in.ftpd (pid: 7228, process nr: 144, stackpage=c2d55000)
Stack: 00000004 00000009 c4018090 1996261c c4016000 c0127f4e 00000001 00000000
c017b9bf c0236180 c36434ee c36434f0 00000002 c01764ad 00000000 c36434ee
c36434f0 00000002 c36434e0 266fa203 00000000 00000100 00000400 c0128538
Call Trace: [<c4018090>] [<c4016000>] [<c0127f4e>] [<c017b9bf>] [<c01764ad>]
[<c0128538>] [<c01286df>]
[<c011de98>] [<c011833d>] [<c011dfdf>] [<c011df38>] [<c0125e19>] [<c0109e34>]
Code: 8b 33 8b 06 03 46 08 39 c7 7c 1f 8b 5b 04 85 db 75 16 57 68

>>EIP; c4014494 <[raid0]raid0_map+a4/138> <=====
Trace; c4018090 <[raid0]__module_using_checksums+38fc/f8b8>
Trace; c4016000 <[raid0]__module_using_checksums+186c/f8b8>
Trace; c0127f4e <create_buffers+62/298>
Trace; c017b9bf <md_map+5b/60>
Trace; c01764ad <ll_rw_block+e9/1fc>
Trace; c0128538 <brw_page+280/354>
Trace; c01286df <generic_readpage+7f/8c>
Trace; c011de98 <do_generic_file_read+4f0/590>
Trace; c011833d <do_getitimer+a5/b0>
Trace; c011dfdf <generic_file_read+57/70>
Trace; c011df38 <file_read_actor+0/50>
Trace; c0125e19 <sys_read+b1/c8>
Trace; c0109e34 <system_call+34/38>
Code; c4014494 <[raid0]raid0_map+a4/138>
00000000 <_EIP>:
Code; c4014494 <[raid0]raid0_map+a4/138> <=====
0: 8b 33 movl (%ebx),%esi <=====
Code; c4014496 <[raid0]raid0_map+a6/138>
2: 8b 06 movl (%esi),%eax
Code; c4014498 <[raid0]raid0_map+a8/138>
4: 03 46 08 addl 0x8(%esi),%eax
Code; c401449b <[raid0]raid0_map+ab/138>
7: 39 c7 cmpl %eax,%edi
Code; c401449d <[raid0]raid0_map+ad/138>
9: 7c 1f jl 2a <_EIP+0x2a> c40144be
<[raid0]raid0_map+ce/138>
Code; c401449f <[raid0]raid0_map+af/138>
b: 8b 5b 04 movl 0x4(%ebx),%ebx
Code; c40144a2 <[raid0]raid0_map+b2/138>
e: 85 db testl %ebx,%ebx
Code; c40144a4 <[raid0]raid0_map+b4/138>
10: 75 16 jne 28 <_EIP+0x28> c40144bc
<[raid0]raid0_map+cc/138>
Code; c40144a6 <[raid0]raid0_map+b6/138>
12: 57 pushl %edi
Code; c40144a7 <[raid0]raid0_map+b7/138>
13: 68 00 00 00 00 pushl $0x0

1 warning issued. Results may not be reliable.

[6.] A small shell script or example program which triggers the
problem (if possible)
[7.] Environment

[7.1.] Software (add the output of the ver_linux script here)
Linux lot.sick.cl 2.3.5ac1 #1 Sun Jun 13 22:42:12 CLT 1999 i686 unknown
Kernel modules 2.1.121
Gnu C egcs-2.90.29 980515 (egcs-1.0.3 release)
Binutils 2.9.1.0.15
Linux C Library 2.0.7
Dynamic linker ldd (GNU libc) 2.0.7
Linux C++ Library 2.8.0
Procps 1.2.9
Mount 2.8a
Net-tools 1.50
Kbd 0.96
Sh-utils 1.16
Modules Loaded 3c59x raid0 aha152x

[7.2.] Processor information (from /proc/cpuinfo):
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 3
model name : Pentium II (Klamath)
stepping : 4
cpu MHz : 267.279122
cache size : 512 KB
fdiv_bug : no
hlt_bug : no
sep_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 sep mtrr pge mca cmov mmx
bogomips : 266.24

[7.3.] Module information (from /proc/modules):
3c59x 17252 1 (autoclean)
raid0 1744 2 (autoclean)
aha152x 25068 1

[7.4.] SCSI information (from /proc/scsi/scsi)
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: SEAGATE Model: ST15230N Rev: 0638
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 02 Lun: 00
Vendor: QUANTUM Model: PD1050iS Rev: 3110
Type: Direct-Access ANSI SCSI revision: 02

[7.5.] Other information that might be relevant to the problem
(please look in /proc and include all information that you
think to be relevant):

/proc/mdstat
Personalities : [2 raid0]
read_ahead 128 sectors
md0 : active raid0 sda1 hdd1 hdb3 hdc1 11035504 blocks 4k chunks
md1 : active raid0 hdb1 hdc2 2270992 blocks 4k chunks
md2 : inactive
md3 : inactive

Partition Tables:
Device Boot Start End Blocks Id System
/dev/sda1 1 4095 4193264 83 Linux native
/dev/hdd1 1 8338 4202320+ 83 Linux native
/dev/hdc1 1 4162 2097616+ 83 Linux native
/dev/hdc2 4163 7752 1809360 83 Linux native
/dev/hdb1 1 229 461632+ 83 Linux native
/dev/hdb3 332 600 542304 83 Linux native

Raid Device:
/dev/md0 10628407 6964389 3664018 66% /ftp

[X.] Other notes, patches, fixes, workarounds:

Thank you

-- 

SynFl00d

email : synflood@endor.sick.cl Efax Number : (815) 366-3177 ICQ : 6994327

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/