Re: 2.6.30-rc1: parisc: system hangs on boot at __schedule()

From: James Bottomley
Date: Wed Apr 15 2009 - 11:06:21 EST


On Wed, 2009-04-15 at 16:49 +0400, Alexander Beregalov wrote:
> 2009/4/9 Kyle McMartin <kyle@xxxxxxxxxxx>:
> > On Thu, Apr 09, 2009 at 12:46:15PM +0400, Alexander Beregalov wrote:
> >> 2009/4/9 Kyle McMartin <kyle@xxxxxxxxxxx>:
> >> > On Wed, Apr 08, 2009 at 05:03:04PM +0400, Alexander Beregalov wrote:
> >> >> Hi
> >> >>
> >> >> Machine is HP j6000.
> >> >> CPU(s): 2 x PA8700 (PCX-W2) at 750.000000 MHz
> >> >>
> >> >> gcc version 4.3.3 (Gentoo 4.3.3-r2 p1.1, pie-10.1.5)
> >> >>
> >> >>
> >> >> Machine hangs before starting rc scripts, but SysRq and C-A-Del work.
> >> >>
> >> >> All tasks are at __schedule+0x268/0x7bc:
> >> >>
> >> >
> >> > .config?
> >>
> >> Sorry, attached.
> >
> > Thanks, I swapped disks into my j6700 and will try to reproduce.
>
> It seems the problem is the same as mentioned here:
> http://marc.info/?l=linux-kernel&m=123920746830420&w=2
>
> The patch fixes the issue.
>
> James?
> The same problem on two of my hosts: x86_64 with LSI SAS MegaRAID
> and parisc with SYM53C8XX_2

Well, I can tell you why I don't see the problem: My parisc system has
modular SCSI, so it doesn't really test out the async system that well
(it was designed more for monolithic kernels).

On the specific patch in the email, it seems reasonable, but I think it
might interfere with the sd probe async calls, so what you might end up
losing a rache where the host fully scanned, but the sd driver not
attached.

The root cause of the problem is that we now have two different async
mechanisms in SCSI: our original one for host scanning and the new one
for sd attachment.

James


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/