Strange hard hangs, probably hd spin down related

Hans (J.W.R.deGoede@ITS.TUDelft.NL)
Wed, 02 Sep 1998 10:26:35 +0200 (METDST)


Hi all,

I've been using linux 2.0.xx for 2 years now and it has never crashed
on me like this before.

It all started yesterday when I tried to telnet home from school
and my machine wasn't there.

When I got home it turned out that it hanged hard, no keyboard, no
numlock, no ping. Nothing on the screen (it was blanked before the crash I
guess). Only the reset button helped.

So i had it building kernels with the burnin script all night,
and none failed and all .o files where identical.

So I guessed that just a bit had fallen over.

Then this morning 20 minutes after I stopped the burnin script
I logged in on tty2. I heared the hd spin up, since it was sleeping.
At that time I wondered wether I had killed X last night or wether it was
still running. So when the hd was spinning up I switched to the console of
X (7). Nothing happened amd the password promt just stood there and stood
there. Again it hang hard.

No this might be coincedence + coincedence.
Basicly being that I shouldn't switch to X when the hd is spinning up.
and that a bit fall over. But I don't believe in that much coincedence.

I changed the following things lately:

About 2 weeks ago I got a new video card and did some other things too:
-changed ide kabels for shorter ones (since I wanted to enable udma)
-enabled udma on my via appolo board using the jumbo-9 patch
-changed my s3 trio64 2mb for a s3virge/dx 4mb
-removed my 1 gig scsi ibm drive to use it in my brothers sparc ipx ;)

About 2 months ago :
-got a new hd made this secondary master and moved my linux install to it.

I suspect the spinning and the new hd because:
-the hd made a strange noice when spinning up
-When I first got it I had bought 2 (one for a friend) and this one
wouldn't be detected by any of 3 pc's (with each their own ide cabel)
while the other one worked fine in all 3 (with the same cabels)
So I returned it to the store. Then I got it back (the same serial at
least ;) and they said it worked fine for them.
And indeed it did work , but I still don't trust it.

Here's my systemconfig:

vxpro+ (via appolo) motherboard
cyrix6x86L-PR200+
48 mb 60ns-edo (2x 8 + 2x 16)
1gig seagate as primary master (pio 4)
6.5 gig quantum fireball secondary master (udma)
1.44 & 1.2 mb floppy's
ncr53c810 scsi2-controller
pioner 4x scsi cdrom
etherexpress pro10
sb32 + 2mb
s3virge/dx 4mb

Here's some more detailed info:

rnel: ide: VIA VT82C586B (split FIFO) UDMA Bus Mastering IDE
rnel: Controller on PCI bus 0 function 57
rnel: ide: timings == ba09c20b
rnel: ide0: BM-DMA at 0x6000-0x6007
rnel: ide1: BM-DMA at 0x6008-0x600f
rnel: hda: ST51080A, 1033MB w/256kB Cache, CHS=525/64/63
rnel: hdc: QUANTUM FIREBALL ST6.4A, 6149MB w/81kB Cache, CHS=13328/15/63,
UDMA
rnel: ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
rnel: ide1 at 0x170-0x177,0x376 on irq 15
rnel: Floppy drive(s): fd0 is 1.44M, fd1 is 1.2M
rnel: FDC 0 is an 8272A
rnel: Partition check:
rnel: hda: hda1 hda2 hda3
rnel: hdc: [PTBL] [784/255/63] hdc1 hdc2

PCI devices found:
Bus 0, device 11, function 0:
VGA compatible controller: S3 Inc. ViRGE/DX or /GX (rev 1).
Medium devsel. IRQ 255. Master Capable. Latency=32. Min
Gnt=4.Max Lat=255.
Non-prefetchable 32 bit memory at 0xe0000000.
Bus 0, device 9, function 0:
Non-VGA device: NCR 53c810 (rev 1).
Medium devsel. IRQ 10. Master Capable. Latency=64.
I/O at 0x6200.
Non-prefetchable 32 bit memory at 0xe4000000.
Bus 0, device 7, function 1:
IDE interface: VIA Technologies VT 82C586 Apollo IDE (rev 6).
Medium devsel. Fast back-to-back capable. Master Capable.
Latency=32.
I/O at 0x6000.
Bus 0, device 7, function 0:
ISA bridge: VIA Technologies VT 82C586 Apollo ISA (rev 37).
Medium devsel. Master Capable. No bursts.
Bus 0, device 0, function 0:
Host bridge: VIA Technologies VT 82C585 Apollo VP1/VPX (rev 35).
Medium devsel. Fast back-to-back capable. Master Capable.
Latency=32.

My current kernel = 2.0.35 (clean) + jumbo-9 + modular-sound-3 +
awe-fix for modular-sound-3.

Well I'll try to reproduce this this evening when I'm back home
I hope that it will crash often if I set my hdsleep time to 1 minute.
I've got it disabled for now and it's still running ;)

If I can reproduce this I'll also try it with 2.0.34 clean.

Well I hope any of you got a clue. Could enabling apm in the kernel help?

I read linux-kernel regular through the web-archives.
But I'm not subscribed so please cc replies to me personally

Thanks,

Hans

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.altern.org/andrebalsa/doc/lkml-faq.html