Concern about 2.2.15 Adaptec 7xxx drivers

From: Robert A. Hayden (rhayden@geek.net)
Date: Sat May 13 2000 - 01:02:29 EST


I've been encountering some problems with my system since upgrading
Geek.NET to 2.2.15. Since the upgrade, I have not been able to make a
successful backup of my system to my Sonny DDS-3 tape drive. Tonight, I
ran a manual backup to watch it and all hell broke lose.

First, some stats:

        System: Dual PII-400 w/ 256 MB
                        ASUS P2B-DS Motherboard with intergrated
                                Adaptec 7980 Controller
        Drives: 1 Quantim 9GB Ultra 2
                        1 CMD 5440 External RAID Controller w/
                                3x9gb IBM UW SCSI (RAID 5)
                        1 Sony DDS-3 Tape Drive
                        1 NEC SCSI CDRom
        Network: 1 Intel EtherExpress Pro 100
        Video: Nothin special, in console mode anyways

---

Symptoms and Errors:

When I use BRU to do a full system backup, I've not been able to get a completed backup since upgrading to 2.2.15. However, I've not assumed a kernel problem due to the fact that I've had some niggling power issues in the last week (UPS install tomorrow!) and I've always had to re-insert the tape if the machine reboots. Having reboots have also meant about a half-dozen FSCKs, but they have always completed without probem.

Tonight, I popped in a brand new tape and ran a manual backup. I then had the following pass by the console:

... May 13 00:01:01 geek kernel: scsi0: MEDIUM ERROR on channel 0, id 1, lun 0, CDB: Read (10) 00 01 62 83 c7 00 00 08 00 May 13 00:01:01 geek kernel: Info fld=0x16283c8, Current sd08:11: sense key Medium Error May 13 00:01:01 geek kernel: Additional sense indicates Unrecovered read error - recommend rewrite the data May 13 00:01:01 geek kernel: scsidisk I/O error: dev 08:11, sector 23233416 May 13 00:01:04 geek kernel: scsi0: MEDIUM ERROR on channel 0, id 1, lun 0, CDB: Read (10) 00 01 62 83 c7 00 00 08 00 May 13 00:01:04 geek kernel: Info fld=0x16283c8, Current sd08:11: sense key Medium Error May 13 00:01:04 geek kernel: Additional sense indicates Unrecovered read error - recommend rewrite the data May 13 00:01:04 geek kernel: scsidisk I/O error: dev 08:11, sector 23233416 ...

After a minute, the error ended and the machine unlocked and resume normal operations and the backup continued.

A short time later, I got another error that unfortunately didn't log and I didn't get the specifics. It was some kind of an "unable to read" followed by an "unable to write" to both the single hard drive (/dev/sda) and the RAID array (/dev/sdb). This time the machine totally locked and I was forced to power cycle. I don't know if I could have left it for a while and had the machine recover operation. I was more in panic mode at that point ;-).

After reboot, the system did it's fsck just fine. I looked at uk.kernel.org for the 2.2.15 release notes and saw updates had been made to the 7xxx drivers and it seemed to fit the fact my in-ability to back up began with my upgrade to 2.2.15.

Tomorrow I have a planned outtage to install the UPS and other things. At that time, I'm going to try another manual backup using the 2.2.14 kernel and see if I'm successful. For the time being I've downgraded back to 2.2.14 just to ease my mind over night.

Any thoughts here? I'm including a copy of dmesg, as reported by 2.2.14, below.

Thanks much

---

Linux version 2.2.14 (root@geek.net) (gcc version egcs-2.91.66 19990314/Linux (egcs-1.1.2 release)) #4 SMP Sat May 13 00:44:06 CDT 2000 Intel MultiProcessor Specification v1.1 Virtual Wire compatibility mode. OEM ID: OEM00000 Product ID: PROD00000000 APIC at: 0xFEE00000 Processor #1 Pentium(tm) Pro APIC version 17 Processor #0 Pentium(tm) Pro APIC version 17 I/O APIC #2 Version 17 at 0xFEC00000. Processors: 2 mapped APIC to ffffe000 (fee00000) mapped IOAPIC to ffffd000 (fec00000) Detected 400918721 Hz processor. Console: colour VGA+ 80x25 Calibrating delay loop... 399.77 BogoMIPS Memory: 257704k/262080k available (1016k kernel code, 420k reserved, 2896k data, 44k init) Dentry hash table entries: 32768 (order 6, 256k) Buffer cache hash table entries: 262144 (order 8, 1024k) Page cache hash table entries: 65536 (order 6, 256k) VFS: Diskquotas version dquot_6.4.0 initialized Checking 386/387 coupling... OK, FPU using exception 16 error reporting. Checking 'hlt' instruction... OK. POSIX conformance testing by UNIFIX per-CPU timeslice cutoff: 100.22 usecs. CPU1: Intel Pentium II (Deschutes) stepping 03 calibrating APIC timer ... ..... CPU clock speed is 400.9190 MHz. ..... system bus clock speed is 100.2296 MHz. Booting processor 0 eip 2000 Calibrating delay loop... 400.59 BogoMIPS OK. CPU0: Intel Pentium II (Deschutes) stepping 03 Total of 2 processors activated (800.36 BogoMIPS). enabling symmetric IO mode... ...done. ENABLING IO-APIC IRQs init IO_APIC IRQs IO-APIC (apicid-pin) 2-0, 2-10, 2-11, 2-12, 2-13, 2-18, 2-20, 2-21, 2-22, 2-23 not connected. number of MP IRQ sources: 15. number of IO-APIC #2 registers: 24. testing the IO APIC.......................

IO APIC #2...... .... register #00: 02000000 ....... : physical APIC id: 02 .... register #01: 00170011 ....... : max redirection entries: 0017 ....... : IO APIC version: 0011 .... register #02: 00000000 ....... : arbitration: 00 .... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 000 00 1 0 0 0 0 0 0 00 01 000 00 0 0 0 0 0 1 1 59 02 0FF 0F 0 0 0 0 0 1 1 51 03 000 00 0 0 0 0 0 1 1 61 04 000 00 0 0 0 0 0 1 1 69 05 000 00 0 0 0 0 0 1 1 71 06 000 00 0 0 0 0 0 1 1 79 07 000 00 0 0 0 0 0 1 1 81 08 000 00 0 0 0 0 0 1 1 89 09 000 00 0 0 0 0 0 1 1 91 0a 000 00 1 0 0 0 0 0 0 00 0b 000 00 1 0 0 0 0 0 0 00 0c 000 00 1 0 0 0 0 0 0 00 0d 000 00 1 0 0 0 0 0 0 00 0e 000 00 0 0 0 0 0 1 1 99 0f 000 00 0 0 0 0 0 1 1 A1 10 0FF 0F 1 1 0 1 0 1 1 A9 11 0FF 0F 1 1 0 1 0 1 1 B1 12 000 00 1 0 0 0 0 0 0 00 13 0FF 0F 1 1 0 1 0 1 1 B9 14 000 00 1 0 0 0 0 0 0 00 15 000 00 1 0 0 0 0 0 0 00 16 000 00 1 0 0 0 0 0 0 00 17 000 00 1 0 0 0 0 0 0 00 IRQ to pin mappings: IRQ0 -> 2 IRQ1 -> 1 IRQ3 -> 3 IRQ4 -> 4 IRQ5 -> 5 IRQ6 -> 6 IRQ7 -> 7 IRQ8 -> 8 IRQ9 -> 9 IRQ10 -> 17 IRQ11 -> 16 IRQ12 -> 19 IRQ14 -> 14 IRQ15 -> 15 .................................... done. PCI: PCI BIOS revision 2.10 entry at 0xf0730 PCI: Using configuration type 1 PCI: Probing PCI hardware Linux NET4.0 for Linux 2.2 Based upon Swansea University Computer Society NET3.039 NET4: Unix domain sockets 1.0 for Linux NET4.0. NET4: Linux TCP/IP 1.0 for NET4.0 IP Protocols: ICMP, UDP, TCP TCP: Hash tables configured (ehash 262144 bhash 65536) Starting kswapd v 1.5 Detected PS/2 Mouse Port. Serial driver version 4.27 with no serial options enabled ttyS00 at 0x03f8 (irq = 4) is a 16550A ttyS01 at 0x02f8 (irq = 3) is a 16550A pty: 2048 Unix98 ptys configured Floppy drive(s): fd0 is 1.44M FDC 0 is a post-1991 82077 (scsi0) <Adaptec AIC-7890/1 Ultra2 SCSI host adapter> found at PCI 6/0 (scsi0) Wide Channel, SCSI ID=7, 32/255 SCBs (scsi0) Downloading sequencer code... 385 instructions downloaded scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.21/3.2.4 <Adaptec AIC-7890/1 Ultra2 SCSI host adapter> scsi : 1 host. (scsi0:0:0:0) Synchronous at 40.0 Mbyte/sec, offset 31. Vendor: QUANTUM Model: QM39100TD-SW Rev: N491 Type: Direct-Access ANSI SCSI revision: 02 Detected scsi disk sda at scsi0, channel 0, id 0, lun 0 (scsi0:0:1:0) Synchronous at 40.0 Mbyte/sec, offset 15. Vendor: CMD TECH Model: CRD-5440-1 Rev: C1-9 Type: Direct-Access ANSI SCSI revision: 02 Detected scsi disk sdb at scsi0, channel 0, id 1, lun 0 Vendor: NEC Model: CD-ROM DRIVE:500 Rev: 1.0 Type: CD-ROM ANSI SCSI revision: 02 Detected scsi CD-ROM sr0 at scsi0, channel 0, id 2, lun 0 (scsi0:0:3:0) Synchronous at 10.0 Mbyte/sec, offset 15. Vendor: SONY Model: SDT-9000 Rev: 0400 Type: Sequential-Access ANSI SCSI revision: 02 Detected scsi tape st0 at scsi0, channel 0, id 3, lun 0 scsi : detected 1 SCSI tape 1 SCSI cdrom 2 SCSI disks total. Uniform CDROM driver Revision: 2.56 SCSI device sda: hdwr sector= 512 bytes. Sectors= 17783250 [8683 MB] [8.7 GB] SCSI device sdb: hdwr sector= 512 bytes. Sectors= 35698688 [17431 MB] [17.4 GB] eth0: Intel EtherExpress Pro 10/100 at 0xb800, 00:A0:C9:67:2E:9C, IRQ 10. Board assembly 667280-003, Physical connectors present: RJ45 Primary interface chip i82555 PHY #1. General self-test: passed. Serial sub-system self-test: passed. Internal registers self-test: passed. ROM checksum self-test: passed (0x49caa8d6). Receiver lock-up workaround activated. Partition check: sda: sda1 sda2 sda3 sda4 < sda5 sda6 > sdb: sdb1 VFS: Mounted root (ext2 filesystem) readonly. Freeing unused kernel memory: 44k freed Adding Swap: 1542232k swap-space (priority -1) =-=-=-=-=-= Robert Hayden rhayden@geek.net UIN: 16570192

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Mon May 15 2000 - 21:00:22 EST