aic7xxx 5.0.19 /dev/st0 problems?

Theodore Y. Ts'o (phil@Stimpy.netroedge.com)
Tue, 23 Jun 1998 12:19:06 -0700 (PDT)


I've got a dual Pent2 machine running 2.0.34 SMP with aic7xxx 5.0.19 and
my tape backups have been failing (with any tape) the last few days
without me noticing. I think they started failing after I upgraded the
aic7xxx driver (not sure). I popped in the cleaning-tape (can't hurt!),
but the kernel spun like a top loading the machine instantly to 9+ after I
did a "mt -f /dev/st0 status". I did some poking around on the news
groups, and there are some other people which are having similar problems
with their "/dev/st0" while using aic7xxx 5.0.19. (My tape drive is a
Sony SDX-S300C which worked fine for weeks after I got it under diferent
system configurations.)

Has anyone else run into this? Is there a fix? Is there a previous
release of aic7xxx which is nice and stable under 2.0.34 SMP?

Thanks!

Phil

Here are some logs on the topic (I hope this isn't overkill):

Sample tape backup output:
--------------------------

***********************************************
**** Start Level-0 at Mon Jun 22 22:00:00 PDT 1998 into file
/etc/tape-backup/level-0/1998_06_22_Mon
**** Running under user admin
SCSI 2 tape drive:
File number=0, block number=0, partition=0.
Tape block size 512 bytes. Density code 0x30 (unknown).
Soft error count since last status=0
General status bits on (41010000):
BOT ONLINE IM_REP_EN
/bin/tar: Removing leading `/' from absolute path names in the archive
Total bytes written: 23933952
/bin/tar: Cannot write to /dev/st0: Input/output error
/bin/tar: Error is not recoverable: exiting now
**** Ending Level-0 at Tue Jun 23 00:26:10 PDT 1998 into file
/etc/tape-backup/level-0/1998_06_22_Mon
***********************************************

Here's /var/log/messages:
-------------------------

Jun 23 10:54:56 Teller kernel: st0: Error with sense data: Current error
st09:00: sense key Medium Error
Jun 23 10:54:56 Teller kernel: Additional sense indicates Write error
Jun 23 10:59:44 Teller kernel: (scsi0:0:8:0) Parity error during phase
Command.
Jun 23 11:00:04 Teller kernel: scsi : aborting command due to timeout :
pid 278495, scsi0, channel 0, id 0, lun 0
Write (6) 0e fb dd 02 00
Jun 23 11:00:09 Teller kernel: scsi : aborting command due to timeout :
pid 278496, scsi0, channel 0, id 0, lun 0
Write (6) 00 c0 4b 02 00
Jun 23 11:00:24 Teller kernel: scsi : aborting command due to timeout :
pid 278495, scsi0, channel 0, id 0, lun 0
Write (6) 0e fb dd 02 00
Jun 23 11:00:24 Teller kernel: scsi : aborting command due to timeout :
pid 278497, scsi0, channel 0, id 0, lun 0
Write (10) 00 00 b7 60 04 00 00 02 00
[... Lots more of the same...]
Jun 23 11:02:04 Teller kernel: scsi : aborting command due to timeout :
pid 278497, scsi0, channel 0, id 0, lun 0
Write (10) 00 00 b7 60 04 00 00 02 00
Jun 23 11:02:09 Teller kernel: scsi : aborting command due to timeout :
pid 278496, scsi0, channel 0, id 0, lun 0
Write (6) 00 c0 4b 02 00
Jun 23 11:02:09 Teller kernel: (scsi0:0:8:-1) Unexpected busfree,
LASTPHASE = 0x0, SEQADDR = 0x59
Jun 23 11:02:28 Teller kernel: (scsi0:0:0:-1) Unexpected busfree,
LASTPHASE = 0x1, SEQADDR = 0x151
Jun 23 11:02:30 Teller kernel: scsi0 channel 0 : resetting for second half
of retries.
Jun 23 11:02:30 Teller kernel: SCSI bus is being reset for host 0 channel
0.
Jun 23 11:02:30 Teller kernel: (scsi0:0:-1:-1) Yikes!! There is a loop in
the free list!
Jun 23 11:02:30 Teller kernel: st0: Error 80000.
Jun 23 11:02:33 Teller kernel: (scsi0:0:0:0) Using wide (16 bit)
transfers.
Jun 23 11:02:33 Teller kernel: (scsi0:0:0:0) Synchronous at 20.0MHz,
offset 8.

/proc/scsi/aic7xxx/0
--------------------

Adaptec AIC7xxx driver version: 5.0.19/3.2.4
Compile Options:
AIC7XXX_RESET_DELAY : 5
AIC7XXX_TAGGED_QUEUEING: Adapter Support Enabled
Check below to see which
devices use tagged queueing
AIC7XXX_PAGE_ENABLE : Enabled (This is no longer an option)
AIC7XXX_PROC_STATS : Enabled

Adapter Configuration:
SCSI Adapter: Adaptec AHA-294X Ultra SCSI host adapter
Ultra Wide Controller
PCI MMAPed I/O Base: 0xfebff000
Adaptec SCSI BIOS: Disabled
IRQ: 10
SCBs: Active 0, Max Active 2,
Allocated 30, HW 16, Page 255
Interrupts: 297114
BIOS Control Word: 0x1832
Adapter Control Word: 0x005f
Extended Translation: Disabled
Disconnect Enable Flags: 0xffff
Ultra Enable Flags: 0xfeff
Tag Queue Enable Flags: 0x0000
Ordered Queue Tag Flags: 0x0000
Default Tag Queue Depth: 8
Tagged Queue By Device array for aic7xxx host instance 0:
{255,255,255,255,255,255,255,255,255,255,255,255,255,255,255,255}
Actual queue depth per device for aic7xxx host instance 0:
{1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1}

Statistics:
(scsi0:0:0:0)
nxfers 295176 (142166 read;153010 written)
blks(512) rd=2689779; blks(512) wr=5000168
< 512 512-1K 1-2K 2-4K 4-8K 8-16K 16-32K 32-64K 64-128K
>128K
Reads: 0 1 74824 15470 24255 10702 2208 3992 10714
0
Writes: 0 0 114964 8787 1249 9187 341 156 18326
0

(scsi0:0:8:0)
nxfers 1375 (633 read;742 written)
blks(512) rd=40512; blks(512) wr=46746
< 512 512-1K 1-2K 2-4K 4-8K 8-16K 16-32K 32-64K 64-128K
>128K
Reads: 0 0 0 0 0 0 0 633 0
0
Writes: 0 0 0 0 0 0 371 371 0
0

/proc/scsi/scsi:
----------------

Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: MicroNet Model: DataDock 7000 Rev: C1-1
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 08 Lun: 00
Vendor: SONY Model: SDX-300C Rev: 0300
Type: Sequential-Access ANSI SCSI revision: 02

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu