Re: Strange problem with a Linux server

G.W. Wettstein (greg@wind.enjellic.com)
Thu, 8 Oct 1998 05:07:15 -0500


On Oct 7, 3:17pm, Dominik Weis wrote:
} Subject: Strange problem with a Linux server

> We had a Linux server that lost all the network connections and nobody
> was able to connect to it. I was not at
> the server when it happend but another person rebooted the server and
> after it restarted it worked again. We did not change anything on the
> server for more than two weeks. It worked really fine. The strange part is
> I have nothing in the logs about errors and there where no error messages
> on the console.

> The server is a SMP system (400Mhz) and has 256MB Ram. We use kernel
> version 2.0.35 and the ethernet card that we use is:
> <4>de4x5.c:V0.535 1998/2/21 davies@maniac.ultranet.com
> <4>eth0: media is 100Mb/s.
>
> Does anyone now what could have been the problem or where should I search
> for the reason? Could the problem be related to the ethernet driver?

This thread sort of seemed to get lost in a discussion about whether
or not Dominik has a dual-PII system or a dual-PPro system. I am very
much interested in whether anyone has seen the behavior he is
describing.

I am running the IMAP server component of a messaging system that I
developed for the university on a dual-PII 300 system with 256
megabyte of RAM. We have seen 2-3 networking stalls similar to what
Dominik has reported in the last month or so. Downing and upping the
interface brings the machine back to life.

The NIC cards are SMC Etherpower II's both sharing a single IRQ. I
have reported the behavior to Donald. Interestingly the non-stalled
NIC continues to function but both NICS need to be downed and upped in
order to restore operation.

We have also seen one instance of the fatal deadlock condition that
Leonard Zubkoff discusses in kernel/sched.c in the context of the
allow_interrupts function. I noticed in Alan's release notes that
there might be a fix forthcoming for this so I am very interested in
seeing the completed patch.

I didn't mean to interrupt the flow of the thread but this behavior
may be of interest to others running 2.0.x boxes in SMP production
environments.

> Thanks
>
> Dominik Weis

A pleasant day to everyone.

Greg

}-- End of excerpt from Dominik Weis

As always,
Dr. G.W. Wettstein Enjellic Systems Development - Specialists in
4206 N. 19th Ave. intranet based enterprise information solutions.
Fargo, ND 58102 WWW: http://www.enjellic.com
Phone: 701-281-1686 EMAIL: greg@wind.enjellic.com
------------------------------------------------------------------------------
"Things fall apart; the centre cannot hold. Mere anarchy is loosed upon
the world; The blood-dimmed tide is loosed, and everywhere the ceremony
of innocence is drowned; The best lack all conviction, while the worst
are full of passionate intensity."
-- Yeats, Second Coming

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/