Re: OOM Killer killing whole system

From: Anton Titov
Date: Fri Jan 20 2006 - 16:46:35 EST


On Fri, 2006-01-20 at 14:04 -0600, Chase Venters wrote:
> On Fri, 20 Jan 2006, Andrew Morton wrote:
> >> Jan 15 06:05:09 vip 216477 pages slab
> >
> > It's all in slab. 800MB.
> >
> > I'd be suspecting a slab memory leak. If it happens again, please take a
> > copy of /proc/slabinfo, send it.
> >
>
> Andrew & Anton,
> The culprit was 1.5 million SCSI commands in the scsi command cache.
>
> Thanks,
> Chase

I currently have this:
scsi_cmd_cache 1458778 1458790 384 10 1 : tunables 54 27
8 : slabdata 145879 145879 0

in /proc/slabinfo, which is pretty close to 1.5 million. The system is
working fine but it should be not very loaded anyway, so a mem leakage
will not show up early. Just checked, that scsi_cmd_cache on other
machines of mine is under 100, so it seems like a problem.

Unfortunately, while being a programmer, I'm totally unaware
what /proc/slabinfo means, but I'm perfectly willing to provide a shell
(in case of Andrew or other famous developer it may be even root) on
this machine.

I'm attaching the /proc/slabinfo

Thanks for help,
Anton


Attachment: slab.gz
Description: GNU Zip compressed data