IDE error causes crash?

From: Steve Ruby (steve@rubysolutions.com)
Date: Mon Apr 17 2000 - 10:48:23 EST


My, mostly, stock redhat 6.1 installation (kernel 2.2.12) has been
crashing
on a nearly weekly basis, this is the only of my linux servers that is
having problems with the same distribution installed.

Most of the time there is nothining in the syslog to indicate the
cause of the crash but two times I have seen:

Apr 15 23:01:35 georgia kernel: hda: write_intr: status=0x00 { }
Apr 15 23:01:35 georgia kernel: ide0: reset: success
Apr 15 23:01:41 georgia kernel: hda: read_intr: status=0x00 { }
Apr 15 23:01:41 georgia kernel: ide0: reset: success

Searches for similar errors in the archive seem to be assocated
with a status that is NOT 0 and some message in the braces. Searches
for ide0: reset: sucess messages in the archives are also not
associated with crashed, just people trying to determine if the
error is a bad thing.

Of the last 5 crashes only two have posted similar messages.
3/5 crashes have happened at 11PM when I am running a backup to
hdc. One crash happened while I was accessing the tape drive,
another was not at 11:00 and the tape drive should not
have been being accessed at that time.

The computer contains only
/dev/hda:
 multcount = 0 (off)
 I/O support = 0 (default 16-bit)
 unmaskirq = 0 (off)
 using_dma = 0 (off)
 keepsettings = 0 (off)
 nowerr = 0 (off)
 readonly = 0 (off)
 readahead = 8 (on)
 geometry = 1583/255/63, sectors = 25434228, start = 0

and /dev/hdc a 5gb HP TRAVAN drive.

(on boot drives are recognized as):
Apr 17 08:10:12 georgia kernel: ide0: BM-DMA at 0xf000-0xf007, BIOS
settings: hda:pio, hdb:pio
Apr 17 08:10:12 georgia kernel: ide1: BM-DMA at 0xf008-0xf00f, BIOS
settings: hdc:pio, hdd:pio
Apr 17 08:10:12 georgia kernel: hda: ST313030A, ATA DISK drive
Apr 17 08:10:12 georgia kernel: hdc: HP COLORADO 5GB, ATAPI TAPE drive
Apr 17 08:10:12 georgia kernel: hda: ST313030A, 12419MB w/512kB Cache,
CHS=1583/255/63
Apr 17 08:10:12 georgia kernel: hda: hda1 hda2 < hda5 hda6 hda7 hda8 >

I am _unable_ to cause the machine to crash by running the backup by
hand
(it runs fine), and as I said only a fraction of the time do I get
meaningful info in the syslog before the crash. The only reason I am
assuming
the time of crash is by the last entry in the syslog from vpop3d or
ftpd or other final entry.

All other linux, and HP-UX boxes I work on have SCSI drives only so
I am not very familiar with settings and control of IDE drives.

Is it wrong to be running the two IDE devices as master/master, should
I be running master/slave? DMA appears to be off which should be the
'SAFE' setting, no?

Thanks for any help,
Steve

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Sun Apr 23 2000 - 21:00:11 EST