1.3.69 oopses

Thomas =?ISO-8859-1?Q?K=F6nig?= (Thomas.Koenig@ciw.uni-karlsruhe.de)
Tue, 27 Feb 1996 20:48:16 +0100 (MET)


1.3.69 oopsed on me when I ran ssh-keyscan on 6000 hosts (same
test which led to the crash on 1.3.68). As last time, all
of the hosts were on the same 'local' network.

1.3.69 oopsed after some time:

Oops: 0000
CPU: 0
EIP: 0010:[<0014e24f>]
EFLAGS: 00010246
eax: 20202020 ebx: 00000000 ecx: 00000000 edx: 00000000
esi: 0114f098 edi: 0000004f ebp: 01142f34 esp: 01142f04
ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Process cat (pid: 151, process nr: 28, stackpage=01142000)
Stack: 0114e000 0014f114 0141f8a0 00000400 0000003a 00000000 0000004f 33323130
37363534 42413938 46454443 00000000 303a3034 44303a30 3a30383a 313a3130
00000030 00000293 0141f8a0 00001000 001582cd 0114e000 01142f90 00001c00
Call Trace: [<0014f114>] [<001582cd>] [<0014f114>] [<00123888>] [<0010a849>]
Code: 0f b6 40 57 39 c1 7d 2e 8a 44 31 18 c0 e8 04 25 ff 00 00 00

0014dd14 T _arp_query
0014ddd4 t _arp_set_predefined
0014de50 T _arp_find
0014e1ec T _arp_get_info <===============
0014e3a4 T _arp_bind_cache
0014e77c t _arp_run_bh
0014e990 t _arp_req_set
0014ec04 t _arp_req_get

Immediately after, a second oops:

kfree of non-kmalloced memory: 0114f390, next= 30313a31, order=538978848
Oops: 0000
CPU: 0
EIP: 0010:[<00112bd2>]
EFLAGS: 00010017
eax: ffffffff ebx: 00000286 ecx: 01ef4f24 edx: 32783020
esi: 01efd018 edi: ffffffff ebp: 01ef4f08 esp: 01ef4f04
ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Process bash (pid: 121, process nr: 5, stackpage=01ef4000)
Stack: 01df7018 01ef4f38 001126fb 01ef4f24 0128bc90 01ef4f68 01b20001 001b58d4
00000000 00000000 ffffffff 01efd018 00112410 01b8f000 00191ddf 01b8f000
0128bc90 01b2fd90 00000001 00000000 00186577 00000000 00000000 00000001
Call Trace: [<001126fb>] [<00112410>] [<00191ddf>] [<00186577>] [<00186491>] [<00123888>] [<0010a849>]
Code: 39 42 08 72 f9 89 11 8b 42 04 89 41 04 89 4a 04 8b 41 04 89

00112b00 T _sleep_on
00112bbc T _add_timer
00112bf0 T _del_timer <==================
00112c28 t _count_active_tasks
00112cc8 t _second_overflow
00112ecc t _timer_bh
00112f60 T _tqueue_bh
00112fb0 T _immediate_bh

and a third one, in the same routine:

task not on run-queue
Oops: 0002
CPU: 0
EIP: 0010:[<00112be5>]
EFLAGS: 00010006
eax: 20202020 ebx: 00000216 ecx: 0129117c edx: 0114f03c
esi: 00000258 edi: 01291018 ebp: 001b50d0 esp: 001b50cc
ds: 0018 es: 0018 fs: 002b gs: 0018 ss: 0018
Process swapper (pid: 0, process nr: 0, stackpage=001b3210)
Stack: 00000001 0129117c 0014b81e 0129117c 01291018 01285078 00000246 001b5144
0014b88c 01291018 00000001 00000258 01291018 0014b8a7 01291018 00000000
0014baef 01291018 00000000 01291018 00000001 001d2f20 0014bbba 01291018
Call Trace: [<0014b81e>] [<0014b88c>] [<0014b8a7>] [<0014baef>] [<0014bbba>] [<0014bb2c>] [<00112f04>]
[<00118a02>] [<0010a7cd>] [<00109a14>] [<0010a849>] [<001096c4>] [<001094e7>] [<001180f8>] [<0011270d>]
Code: 89 08 53 9d 8b 5d fc 89 ec 5d c3 55 89 e5 53 8b 4d 08 31 c0

Plus a few more goodies:

Aiee, killing interrupt handler
kfree of non-kmalloced memory: 001b5250, next= 001b5258, order=1790552
kfree of non-kmalloced memory: 001b5240, next= 001b5258, order=1790096
kfree of non-kmalloced memory: 001b5674, next= 001b5258, order=1790552
idle task may not sleep
Feb 27 20:21:43 mvmap66 last message repeated 4 times
arp_expire_request: 556e0d81 postponed
arp_expire_request: 3a560d81 postponed

And annother Oops:

Oops: 0000
CPU: 0
EIP: 0010:[<0011f532>]
EFLAGS: 00010006
eax: 00000048 ebx: 0114f000 ecx: 20202020 edx: 00000002
esi: 00000246 edi: 00000001 ebp: 001b0800 esp: 01350d40
ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Process ssh-keyscan (pid: 140, process nr: 24, stackpage=01350000)
Stack: 00000000 00092898 c36d0d81 001b0800 01272d14 0114f000 001b6c00 011a4878
00000002 ffaa0055 ffffffff 00000246 0014e5ad 0000004c 00000001 011a4878
000000be c36d0d81 001be92c 001d2f40 00000000 2c780d81 00000246 2c780d81
Call Trace: [<001b0800>] [<0014e5ad>] [<0013904d>] [<0013f27a>] [<0013f566>] [<0013f629>] [<0014324a>]
[<00146fed>] [<00150b7d>] [<00139e94>] [<001504eb>] [<0013931e>] [<00139905>] [<0013aa3e>] [<0010a849>]
Code: 81 39 aa ff 55 00 0f 84 f8 00 00 00 56 9d 51 68 14 f4 11 00

0011f2a4 T _sys_munlockall
0011f31c T _kmalloc_init
0011f380 T _get_order
0011f464 T _kmalloc <=====================
0011f6ec T _kfree
0011f8a8 t _free_area_pages

arp_expire_request: 586f0d81 postponed
arp_expire_request: 28560d81 postponed
arp_expire_request: 45770d81 postponed
double lock on socket at 00146a40

And another one:

Oops: 0002
CPU: 0
EIP: 0010:[<00112c08>]
EFLAGS: 00010006
eax: 20202020 ebx: 00000246 ecx: 0129117c edx: 0114f03c
esi: 00000005 edi: 01291018 ebp: 01f9ef58 esp: 01f9ef54
ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Process bash (pid: 120, process nr: 4, stackpage=01f9e000)
Stack: 01291018 01f9ef98 00150016 0129117c 01291018 01291018 01291018 00000005
001d2f20 00140646 01291018 0014058c 012911b4 00112f04 01291018 00000001
ffffffff 00000001 00118a02 00000000 01f9efbc 00000002 00052806 bffff114
Call Trace: [<00150016>] [<00140646>] [<0014058c>] [<00112f04>] [<00118a02>] [<0010a7cd>]
Code: 89 10 c7 41 04 00 00 00 00 c7 01 00 00 00 00 b8 01 00 00 00
Aiee, killing interrupt handler

00112a44 T _interruptible_sleep_on
00112b00 T _sleep_on
00112bbc T _add_timer
00112bf0 T _del_timer <==============
00112c28 t _count_active_tasks
00112cc8 t _second_overflow
00112ecc t _timer_bh
00112f60 T _tqueue_bh
00112fb0 T _immediate_bh

followed by a few thousand arp_expire_request lines (faster than
kerneld could handle them). I hit reset.

-- 
Thomas Koenig, Thomas.Koenig@ciw.uni-karlsruhe.de, ig25@dkauni2.bitnet.
The joy of engineering is to find a straight line on a double
logarithmic diagram