Re: disk errors

Lee Willis (lee@gbdirect.co.uk)
08 Oct 1999 14:32:27 +0100


Igor Mozetic <igor.mozetic@uni-mb.si> writes:

> I'm running 2.2.12 with Debian 2.1 on several machines, and
> (after heavy disk activity) keep getting the following messages:
>
> Oct 8 10:31:50 gubesa kernel: hda: dma_intr: error=0x10
> { SectorIdNotFound }, LBAsect=9139240, sector=689050
> Oct 8 10:31:50 gubesa kernel: hda: dma_intr: status=0x71
> { DriveReady DeviceFault SeekComplete Error }

Probably unrelated but with the following setup:

$ cat /proc/ide/hdc/model
FUJITSU MPA3017ATU

$ uname -a
Linux beehive.gbdirect.co.uk 2.2.12 #5 Wed Oct 6 12:46:07 BST 1999 i586 unknown

Hmm, well actually the kernel was compiled with SMP support before but
on a UP board (Don't ask why, just one of those blonde-moments ...)

I got the following a few days ago ...

Oct 6 10:11:59 beehive kernel: hdc: status timeout: status=0x80 { Busy }
Oct 6 10:11:59 beehive kernel: hdc: DMA disabled
Oct 6 10:11:59 beehive kernel: hdc: drive not ready for command
Oct 6 10:12:29 beehive kernel: ide1: reset timed-out, status=0x80
Oct 6 10:12:29 beehive kernel: hdc: status timeout: status=0x80 { Busy }
Oct 6 10:12:29 beehive kernel: hdc: drive not ready for command
Oct 6 10:12:53 beehive kernel: ide1: reset: success
Oct 6 10:13:04 beehive kernel: hdc: irq timeout: status=0x80 { Busy }
Oct 6 10:13:34 beehive kernel: ide1: reset timed-out, status=0x80
Oct 6 10:13:34 beehive kernel: hdc: status timeout: status=0x80 { Busy }
Oct 6 10:13:34 beehive kernel: hdc: drive not ready for command
Oct 6 10:14:04 beehive kernel: ide1: reset timed-out, status=0x80
Oct 6 10:14:04 beehive kernel: hdc: status timeout: status=0x80 { Busy }
Oct 6 10:14:04 beehive kernel: end_request: I/O error, dev 16:02 (hdc), sector 5025
Oct 6 10:14:04 beehive kernel: hdc: drive not ready for command
Oct 6 10:14:04 beehive kernel: hdc: status timeout: status=0x80 { Busy }
Oct 6 10:14:04 beehive kernel: hdc: drive not ready for command
Oct 6 10:14:06 beehive kernel: ide1: reset: success
Oct 6 10:14:18 beehive kernel: hdc: lost interrupt
Oct 6 10:14:18 beehive kernel: hdc: read_intr: status=0x50 { DriveReady SeekComplete }
Oct 6 10:14:29 beehive kernel: hdc: lost interrupt
Oct 6 10:14:29 beehive kernel: hdc: read_intr: status=0x50 { DriveReady SeekComplete }
Oct 6 10:14:42 beehive kernel: hdc: lost interrupt
Oct 6 10:14:44 beehive kernel: hdc: status timeout: status=0x80 { Busy }
Oct 6 10:14:44 beehive kernel: hdc: drive not ready for command
Oct 6 10:14:47 beehive kernel: ide1: reset: success
Oct 6 10:15:04 beehive kernel: hdc: lost interrupt
Oct 6 10:15:04 beehive kernel: hdc: read_intr: status=0x50 { DriveReady SeekComplete }
Oct 6 10:25:20 beehive kernel: cat uses obsolete /proc/pci interface
Oct 6 10:26:46 beehive kernel: hdc: status timeout: status=0x80 { Busy }
Oct 6 10:26:46 beehive kernel: hdc: drive not ready for command
Oct 6 10:26:49 beehive kernel: ide1: reset: success
Oct 6 10:26:59 beehive kernel: hdc: lost interrupt
Oct 6 10:26:59 beehive kernel: hdc: read_intr: status=0x50 { DriveReady SeekComplete }
Oct 6 10:27:10 beehive kernel: hdc: lost interrupt
Oct 6 10:27:10 beehive kernel: hdc: read_intr: status=0x50 { DriveReady SeekComplete }
Oct 6 10:27:18 beehive kernel: hdc: read_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
Oct 6 10:27:18 beehive kernel: hdc: read_intr: error=0x40 { UncorrectableError }, LBAsect=229468, sector=229405
Oct 6 10:27:18 beehive kernel: end_request: I/O error, dev 16:01 (hdc), sector 229405
Oct 6 10:27:18 beehive kernel: EXT2-fs error (device ide1(22,1)): ext2_write_inode: unable to read inode block - inode=28750, block=114702
Oct 6 10:27:27 beehive kernel: hdc: read_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
Oct 6 10:27:27 beehive kernel: hdc: read_intr: error=0x40 { UncorrectableError }, LBAsect=229468, sector=229405
Oct 6 10:27:27 beehive kernel: end_request: I/O error, dev 16:01 (hdc), sector 229405
Oct 6 10:27:27 beehive kernel: EXT2-fs error (device ide1(22,1)): ext2_write_inode: unable to read inode block - inode=28750, block=114702
Oct 6 10:27:42 beehive kernel: hdc: lost interrupt
Oct 6 10:27:45 beehive kernel: hdc: read_intr: status=0x59 { DriveReady SeekComplete DataRequest Error }
Oct 6 10:27:45 beehive kernel: hdc: read_intr: error=0x40 { UncorrectableError }, LBAsect=229468, sector=229405
Oct 6 10:27:45 beehive kernel: end_request: I/O error, dev 16:01 (hdc), sector 229405
Oct 6 10:27:45 beehive kernel: EXT2-fs error (device ide1(22,1)): ext2_write_inode: unable to read inode block - inode=28750, block=114702
Oct 6 10:27:56 beehive kernel: hdc: lost interrupt

And so on ...
After a while I ended up with a distintly unbootable machine (My /lib/*
had gone very bad :( ) I presumed this was a hardware fault, as the
machine is no spring chicken and it's working fine now after a
re-install of init and some shared libraries and a quick fsck -c.

Is there a chance this is a kernel problem, I can post the entire output
if it'd help.

Lee.

-- 
Developer
GBdirect Ltd, 27 Park Drive, Heaton, Bradford, UK, BD9 4DS
Tel: +44 (0)1274 772277

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/