Re: Stuck TCP sockets in 2.1.1xx SMP

Alex Korobka (korobka@galaxy.ams.sunysb.edu)
Tue, 20 Oct 1998 09:33:29 -0400 (EDT)


Andi Kleen <ak@muc.de> writes:
> In article <199810191841.OAA18489@galaxy.ams.sunysb.edu>,
> Alex Korobka <korobka@galaxy.ams.sunysb.edu> writes:
> > We have a few dual PII 400 machines (P6DBE boards, eepro100 NICs)
> > that we'd like to use in a Beowulf-like cluster. However, all recent
> > kernels have exhibited the same problem, NPB2.3 MPI benchmarks keep
> > getting stuck waiting for incoming data. This happens only when
> > there are 2 MPI processes running on the same machine, there are
> > no problems with one process per machine. This is the output
> > of netstat -a -t for a job consisting of 8 MPI processes running
> > on star1, star2, star3, and star4 nodes.
>
> Could you define 'all recent kernels'. Was there a version when it started,
> but in the old one worked?
>

There was a number of kernels (2.1.3x - 2.1.9x) that didn't boot
on this hardware. All following releases have had this problem.
One more thing, three simultaneous NetPIPE processes usually kill
the machine before the block size gets to 1Mb.

Alex Korobka

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/