Hard Disc Trouble

Kirill A. Kornilov (ciro@dawntech.zgrad.ru)
Sun, 18 May 1997 15:28:47 +0400 (MSD)


Hi.

I had a severe filesystem corruption.
Later, I've played a bit with my laptop's HD to find the reason for it.

when disc enteres stand-by mode and when woken up, driver says:

hda: write_intr: Status=0x50 { DriveReady SeekComplete }

after that ( sometimes ) it appears completly inaccessible:

hda irq timeout: status=0xd0 { Busy }

Such an error results a mess of files on a rw mounted filesystem.
If / is mounted rw from same disc, the only way to feel safe is to
hold a backup tape in a hand.

The easy trouble-maker is "hdparm -Y /dev/hda", but HD timeout in BIOS
Setup makes the same.

If you can explain me what's up, it will make me feel better.
Can you supply any hints, what could it be?

Thanks.

P.S. I'm not in the mailing list. Please, reply to kirill@qdn.miee.ru

Here some things describing my laptop:

----------------------------------------------------
BIOS setup
----------------------------------------------------

32 bit access = yes (actually I don't know what is it, I didn't
try to change it)
----------------------------------------------------
Processor
----------------------------------------------------
P100, Intel.

----------------------------------------------------
dmesg:
----------------------------------------------------
Console: 16 point font, 400 scans
Console: colour VGA+ 80x25, 1 virtual console (max 63)
pcibios_init : BIOS32 Service Directory structure at 0xc00f6e30
pcibios_init : BIOS32 Service Directory entry at 0xfd674
pcibios_init : PCI BIOS revision 2.10 entry at 0xfd817
Probing PCI hardware.
Calibrating delay loop.. ok - 39.83 BogoMIPS
Memory: 14888k/16384k available (600k kernel code, 392k reserved, 492k data, 12k
init)
Swansea University Computer Society NET3.039 for Linux 2.1
NET3: Unix domain sockets 0.16 for Linux NET3.038.
Swansea University Computer Society TCP/IP for NET3.037
IP Protocols: IGMP, ICMP, UDP, TCP
Checking 386/387 coupling... Ok, fpu using exception 16 error reporting.
Checking 'hlt' instruction... Ok.
Linux version 2.1.36 (kirill@linh) (gcc version 2.7.2.1) #3 Thu May 1 12:23:03 M
SD 1997
POSIX conformance testing by UNIFIX
Serial driver version 4.24 with no serial options enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
MSR device installed.
APM BIOS version 1.1 Flags 0x03 (Driver version 1.2)
Entry f000:c24b cseg16 f000 dseg 40 cseg len ffff, dseg len ffff
Connection version 1.1
AC on line, battery status charging, battery life unknown
battery flag 0x08, battery life unknown
Real Time Clock Driver v1.07
hda: HITACHI_DK223A-11, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hda: HITACHI_DK223A-11, 1031MB w/128kB Cache, LBA, CHS=523/64/63
Partition check:
hda:Started kswapd v 1.23
hda1 hda2 < hda5 hda6 hda7 > hda3 hda4
VFS: Mounted root (ext2 filesystem) readonly.
Freeing unused kernel memory: 12k freed
Adding Swap: 32220k swap-space (priority -1)
Replacing keyboard beeper
Swansea University Computer Society IPX 0.38 for NET3.037
IPX Portions Copyright (c) 1995 Caldera, Inc.
lo UP fl=0000004b pa=0100007F/000000FF brd=FFFFFF7F dst=00000000
Linux PCMCIA Card Services 2.9.5
Intel PCIC probe:
Vadem VG-468 at 0x3e0, 2 sockets
irq mask (scanned) = 0x8a98, status change on irq 15
cs: IO port probe 0x0100-0x03ff: excluding 0x378-0x37f 0x398-0x39f 0x3b0-0x3e7
cs: IO port probe 0x0a20-0x0a27: excluding 0xa20-0xa27
PS/2 auxiliary pointing device detected -- driver installed.
---------------------------------------------------------------
cat /proc/pci
----------------------------------------------------------------
PCI devices found:
Bus 0, device 18, function 1:
IDE interface: UMC UM8673F (rev 162).
Medium devsel. IRQ 14.
Bus 0, device 18, function 0:
ISA bridge: UMC UM8886N (rev 163).
Medium devsel. Master Capable. No bursts.
Bus 0, device 17, function 0:
VGA compatible controller: Chips & Technologies 65548 (rev 0).
Medium devsel. Fast back-to-back capable.
Non-prefetchable 32 bit memory at 0xfe000000.
Bus 0, device 0, function 0:
Host bridge: UMC UM8891N (rev 176).
Medium devsel. Master Capable. No bursts.
-----------------------------------------------------------------
Note: failure happens even if I don't have any special settings under
linux (like ide0=autotune, hdparam calls)
hdparm -i
-----------------------------------------------------------------

/dev/hda:

Model=HITACHI_DK223A-11, FwRev=00M0A0R7, SerialNo=043831
Config={ HardSect NotMFM HdSw>15uSec Fixed DTR>10Mbs }
RawCHS=2095/16/63, TrkSize=36162, SectSize=574, ECCbytes=4
BuffType=3(DualPortCache), BuffSize=128kB, MaxMultSect=16, MultSect=8
DblWordIO=no, maxPIO=2(fast), DMA=yes, maxDMA=2(fast)
CurCHS=2095/16/63, CurSects=2111760, LBA=yes, LBAsects=2112360
tDMA={min:120,rec:120}, DMA modes: sword0 sword1 sword2 mword0 mword1 *mword2
IORDY=yes, tPIO={min:120,w/IORDY:120}, PIO modes: mode3 mode4
---------------------------------------------------------------------

----------------------------------
E-Mail: Kirill A. Kornilov <ciro@dawntech.zgrad.ru>
E-Mail1: kirill@qdn.miee.ru
Date: 18-May-97
Time: 15:28:49
----------------------------------