Running Out Of Memory?!?

Greg Estabrooks (greyfoxx@hfx.andara.com)
Thu, 11 Jun 1998 21:53:56 -0300


I'm hoping someone out there can shed some light on a problem I'm having.
I've been noticing that ALL of my memory is used, however there are very few
processes actually running. This seems to have caused my system to have a
kernel fault
when it completely ran out of physical memory. I've searched through my logs
and noticed that it has happen twice before. Two of the times the system
actually rebooted itself after the kernel error, but once it just halted.

When I add up what's listed in a 'ps aux' or a top it only adds up to
approx 10 meg, but it seems that something is grabbing the left overs. The
machine is running a named 8.1.12 name server and is doing the
routing/firewall for a small network. We are vhosting about 5 IP's on the
machine and there are only about 15 ipfwadm firewall rules so I don't see
exactly what's stealing all the memory. I noticed also that the swap doesn't
seem to be getting touched at all.

I've "enclosed" a copy of my ps, top and /proc/meminfo as well as a copy of
the latest kernel error that is showed up in my logs from the most recent
freak out in hopes that someone out there might have a clue as to what could
be the cause? I've since upgraded to kernel 2.0.34 in hopes that it may
help, but I still see the memory as being in use and am hoping to squish
this bug soon :)

System specs:

P133, 32meg ram, 12meg swap file, 1.1 gig hd, two PCI nic's connected to a
hub.

Thx in advance.
======================================================================

8:08pm up 6:02, 1 user, load average: 0.05, 0.02, 0.00
24 processes: 23 sleeping, 1 running, 0 zombie, 0 stopped
CPU states: 0.5% user, 0.3% system, 0.0% nice, 99.1% idle
Mem: 31072K av, 29648K used, 1424K free, 7860K shrd, 420K buff
Swap: 14076K av, 0K used, 14076K free 25536K cached

PID USER PRI NI SIZE RSS SHARE STAT LIB %CPU %MEM TIME COMMAND
700 root 13 0 572 572 448 R 0 0.9 1.8 0:00 top
1 root 0 0 404 404 340 S 0 0.0 1.3 0:01 init
2 root 0 0 0 0 0 SW 0 0.0 0.0 0:00 kflushd
3 root -12 -12 0 0 0 SW< 0 0.0 0.0 0:00 kswapd
609 root 1 0 892 892 576 S 0 0.0 2.8 0:00 sshd
339 root 0 0 316 316 260 S 0 0.0 1.0 0:00 mingetty
340 root 0 0 316 316 260 S 0 0.0 1.0 0:00 mingetty
52 root 0 0 352 352 300 S 0 0.0 1.1 0:00 kerneld
197 root 0 0 460 460 384 S 0 0.0 1.4 0:00 syslogd
206 root 0 0 504 504 328 S 0 0.0 1.6 0:00 klogd
217 daemon 0 0 396 396 320 S 0 0.0 1.2 0:00 atd
228 root 0 0 472 472 392 S 0 0.0 1.5 0:00 crond
239 root 0 0 656 656 432 S 0 0.0 2.1 0:00 snmpd
251 root 0 0 424 424 356 S 0 0.0 1.3 0:00 inetd
276 root 0 0 352 352 304 S 0 0.0 1.1 0:00 gpm
328 root 9 0 552 552 460 S 0 0.0 1.7 0:09 sshd
341 root 0 0 316 316 260 S 0 0.0 1.0 0:00 mingetty
342 root 0 0 316 316 260 S 0 0.0 1.0 0:00 mingetty

343 root 0 0 316 316 260 S 0 0.0 1.0 0:00 mingetty
344 root 0 0 316 316 260 S 0 0.0 1.0 0:00 mingetty
611 root 8 0 800 800 616 S 0 0.0 2.5 0:00 bash
346 root 0 0 236 236 196 S 0 0.0 0.7 0:00 update
531 adren 0 0 352 352 304 S 0 0.0 1.1 0:00 bnc
675 root 0 0 984 984 556 S 0 0.0 3.1 0:00 named

====================================================

USER PID %CPU %MEM SIZE RSS TTY STAT START TIME COMMAND
adren 531 0.0 1.1 716 352 ? S 16:16 0:00 bnc
daemon 217 0.0 1.2 768 396 ? S 14:07 0:00 /usr/sbin/atd
root 1 0.0 1.3 748 404 ? S 14:06 0:01 init [3]
root 2 0.0 0.0 0 0 ? SW 14:06 0:00 (kflushd)
root 3 0.0 0.0 0 0 ? SW< 14:06 0:00 (kswapd)
root 52 0.0 1.1 724 352 ? S 14:07 0:00 kerneld
root 197 0.0 1.4 784 460 ? S 14:07 0:00 syslogd
root 206 0.0 1.6 856 504 ? S 14:07 0:00 klogd
root 228 0.0 1.5 832 472 ? S 14:07 0:00 crond
root 239 0.0 2.1 1116 656 ? S 14:07 0:00 /usr/sbin/snmpd -f
root 251 0.0 1.3 768 424 ? S 14:07 0:00 inetd
root 276 0.0 1.1 732 352 ? S 14:07 0:00 gpm -t ms
root 328 0.0 1.7 1132 552 ? S 14:07 0:09
/usr/local/sbin/sshd
root 339 0.0 1.0 712 316 1 S 14:07 0:00 /sbin/mingetty
tty1
root 340 0.0 1.0 712 316 2 S 14:07 0:00 /sbin/mingetty
tty2
root 341 0.0 1.0 712 316 3 S 14:07 0:00 /sbin/mingetty
tty3
root 342 0.0 1.0 712 316 4 S 14:07 0:00 /sbin/mingetty
tty4
root 343 0.0 1.0 712 316 5 S 14:07 0:00 /sbin/mingetty
tty5
root 344 0.0 1.0 712 316 6 S 14:07 0:00 /sbin/mingetty
tty6
root 346 0.0 0.7 708 236 ? S 14:07 0:00 update (bdflush)
root 609 0.1 2.8 1224 892 ? S 19:59 0:00
/usr/local/sbin/sshd
root 611 0.0 2.5 1196 800 p0 S 19:59 0:00 -bash
root 675 0.0 3.1 1388 984 ? S 20:00 0:00 named
root 701 0.0 1.5 812 480 p0 R 20:12 0:00 ps aux

==========================================================

total: used: free: shared: buffers: cached:
Mem: 31817728 30281728 1536000 7839744 430080 26148864
Swap: 14413824 0 14413824
MemTotal: 31072 kB
MemFree: 1500 kB
MemShared: 7656 kB
Buffers: 420 kB
Cached: 25536 kB
SwapTotal: 14076 kB
SwapFree: 14076 kB

=======================================================

Jun 11 14:05:36 cronos kernel: Warning: kfree_skb passed an skb still on a
list (from 005ffaf4).
Jun 11 14:05:36 cronos kernel: Unable to handle kernel paging request at
virtual address c319001a
Jun 11 14:05:36 cronos kernel: current->tss.cr3 = 00101000,
Jun 11 14:05:36 cronos kernel: *pde = 00000000
Jun 11 14:05:36 cronos kernel: Oops: 0000
Jun 11 14:05:36 cronos kernel: CPU: 0
Jun 11 14:05:36 cronos kernel: EIP: 0010:[<0319001a>]

Jun 11 14:05:36 cronos kernel: EFLAGS: 00010202
Jun 11 14:05:36 cronos kernel: eax: 0319001a ebx: 001a008a ecx: 005ff9fc
edx: 005ff000
Jun 11 14:05:36 cronos kernel: esi: 00000184 edi: 00000000 ebp: 001a6c5c
esp: 001a6bf4
Jun 11 14:05:36 cronos kernel: ds: 0018 es: 0018 fs: 002b gs: 0018
ss: 0018
Jun 11 14:05:36 cronos kernel: Process swapper (pid: 0, process nr: 0,
stackpage=001a4d48)
Jun 11 14:05:36 cronos kernel: Stack: 00136ef3 001a008a 001a008a 005ffaf4
001371f3 001a008a 005ffaf4 005fff3c
Jun 11 14:05:36 cronos kernel: 00ab0074 0000011c 0013f35d 005ffaf4
00000000 00000000 005fff3c 001aa6bc
Jun 11 14:05:36 cronos kernel: 00000002 00007100 00000000 005fff8c
00000001 001b2700 ffffff00 00000100
Jun 11 14:05:36 cronos kernel: Call Trace: [sock_wfree+35/44]
[kfree_skb+183/244] [ip_rcv+1197/1416] [net_bh+252/284] [do_bottom_ha$
Jun 11 14:05:36 cronos kernel: [sys_idle+59/112] [system_call+85/124]
[init+0/612] [start_kernel+424/436]
Jun 11 14:05:36 cronos kernel: Code: <1>Unable to handle kernel paging
request at virtual address c319001a
===============================================
GreyFoxx

http://users.andara.com/~greyfoxx

"Unix _is_ user friendly. 
It's just very selective about who its friends are. 
And sometimes even best friends have fights. "

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu