Re: swapper: page allocation failure. - random reboot problem

From: Haar János
Date: Tue May 23 2006 - 06:16:34 EST



----- Original Message -----
From: "Nick Piggin" <nickpiggin@xxxxxxxxxxxx>
To: "Haar János" <djani22@xxxxxxxxxxxx>
Cc: "Con Kolivas" <kernel@xxxxxxxxxxx>; <cw@xxxxxxxx>;
<linux-kernel@xxxxxxxxxxxxxxx>
Sent: Tuesday, May 23, 2006 11:17 AM
Subject: Re: swapper: page allocation failure. - random reboot problem


> Haar János wrote:
>
> > OK, it is enough, to switch to 64bit, thanks!
> >
> > But i have a little problem.
> > My node #3 reboots again.
> >
> > At this point i have run out of ideas. :-(
> >
> > This is checked already:
> >
> > - the complete hardware, except the 12 hdd. (smart reports, no errors at
> > all, 4x ide + 8xSATA all 300GB.)
> > - the SMP race. (checked with non-smp kernel)
> > - APIC/ACPI (tested with non... kernel)
> > - the e1000 driver (tested with realtek gige adapter)
> > - the complete filesystem, OS (NFS-ROOT, and copy between nodes.)
> > - the memory allocation proble, (checked with debug-kernel, and rised
> > min_free_kbytes)
> >
> > The systems only service is nbd. (nbd-server serving md0, raid4 array)
> >
> > Anybody have an idea?
>
> Bad hardware. Run memtest overnight. Can your power supply deal with
> that many drives? etc.

Sorry, i have allready did these things.

The motherboard + CPU + RAM successed the overnight memtest, but anyway i
have replaced this group with another pre-tested ones, but no change!
(Additionally, i replaced the NIC [e1000 to e1000, and e1000 to realtek],
the sata cards [promise to promise], the sata and ide cables, mb, cpu, ram,
ps, and the power cable too.
Only the 12hdd is the same, but smart reports no errors at all!)
The power supply is the 3rd. and the problem is the same.

This is really a software bug, but i dont know exactly where.

Cheers,
Janos

>
> --
> SUSE Labs, Novell Inc.
> Send instant messages to your online friends http://au.messenger.yahoo.com
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/