SCSI/Block boot problems

From: Russell King (rmk@arm.linux.org.uk)
Date: Sat Jan 18 2003 - 11:57:02 EST


Continuing the theme of testing 2.5.59, there appears to be some fairly
bad error handling somewhere in this area at the moment. So far, my
debugging shows the following:

1. A permanent error with a _1_ SCSI target on a SCSI bus causes the
   SCSI error handling to go completely gaga and eventually ends up
   oopsing the kernel. Full kernel messages with SCSI debugging enabled
   have been forwarded to James Bottomley.

2. SCSI appears to attempt to spin up a non-present disk in removable
   SCSI drive _3_ times during boot. This is new behaviour for 2.5,
   which 2.4, 2.2 nor 2.0 used to show.

   Since each spinup takes around 2 minutes to timeout and the drive
   obvious isn't going to spin up without media present, it produces
   some very long test cycles, and is a source of continual annoyance.

3. SCSI goes completely gaga after a SCSI disk IO error. I haven't
   got much to say about this other than to supply the kernel messages
   (with some extra ones added to try to track down the problem.)

   At this point, we are trying to read the partition table on the
   aforementioned empty SCSI removable drive:

         sda:submitting buffer 0 of 1 (cc3fa580) page c026e3c0
        submission done
        prep_rq_fn: device sda ret = 1

   scsi_prep_fn() returns BLKPREP_KILL, and we end the request:

        __end_that_request_first: req c0427dc0 uptodate 0 nrbytes 4096
        end_request: I/O error, dev sda, sector 0
        end_buffer_async_read: bh cc3fa580 page c026e3c0 uptodate 0
        Buffer I/O error on device sd(8,0), logical block 0
        unlocking page: all buffers unlocked
        unlocking page c026e3c0 waitqueue c0003228: flags 00001006
        all done

   The partition code attempts to read another page:

        submitting buffer 0 of 1 (cc3fa580) page c026e3c0
        submission done
        wait_on_page_bit: task c0441040 page c026e3c0 bit 0 waitqueue c0003228
        prep_rq_fn: device sda ret = 2
        sleeping on page c026e3c0: flags 00021007
        prep_rq_fn: device sda ret = 2
        prep_rq_fn: device sda ret = 2
        prep_rq_fn: device sda ret = 2
        prep_rq_fn: device sda ret = 2
        prep_rq_fn: device sda ret = 2

   This time, scsi_prep_fn() continually returns BLKPREP_DEFER and we
   don't make any further progress. (I assume 20 minutes of waiting is
   probably long enough! 8))

--
Russell King (rmk@arm.linux.org.uk)                The developer of ARM Linux
             http://www.arm.linux.org.uk/personal/aboutme.html

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Thu Jan 23 2003 - 22:00:19 EST