Re: 2.6.15-mm2

From: Reuben Farrelly
Date: Thu Jan 12 2006 - 23:49:05 EST




On 13/01/2006 9:51 a.m., Jeff Garzik wrote:
Andrew Morton wrote:
Reuben Farrelly <reuben-lkml@xxxxxxxx> wrote:

Indeed that seems to fix it. I've just booted -mm3 and it came up with no problems at all.


whew.

Still up and running with 9 1/2 hrs uptime on the clock.

What about all the other problems? The oops under ata_device_add()?

And is it still saying this?

ACK the questions... I confess I dove into this earlier today, and quickly got lost in the multiple bug reports :( Nothing against Reuben -- we need more motivated bug reporters like him!

See http://www.reub.net/files/kernel/outstanding-kernel-bugs.txt for a list and status of things which are affecting me. I'm trying to keep it up to date so that it's useful - at the moment I need to keep a list else things get muddled.

Basically in a nutshell:

1. SATA crashing out when controller not configured:

ata1: SATA max UDMA/133 cmd 0x0 ctl 0x2 bmdma 0x0 irq 0

Call Trace:
[<c0103c5d>] show_stack+0x9b/0xc0
[<c0103de4>] show_registers+0x162/0x1e7
[<c0103f8f>] die+0x126/0x231
[<c01140db>] do_page_fault+0x271/0x5b9
[<c01037df>] error_code+0x4f/0x54
[<c023cabd>] class_device_del+0xa3/0x156

This is the next serious problem after the barrier/elevator bug which is now succesfully resolved. Maybe half the time I boot up I need to reset hardware as this one triggers.
It's a good genuine Intel board with the latest bios, and only started happening recently, so I'm not yet convinced it's hardware..


2. SATA 'slow to respond'

Occasionally:

ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 19 (level, low) -> IRQ 193
ahci 0000:00:1f.2: AHCI 0001.0000 32 slots 4 ports 1.5 Gbps 0xf impl SATA mode
ahci 0000:00:1f.2: flags: 64bit ncq led slum part
ata1: SATA max UDMA/133 cmd 0xF8804D00 ctl 0x0 bmdma 0x0 irq 50
ata2: SATA max UDMA/133 cmd 0xF8804D80 ctl 0x0 bmdma 0x0 irq 50
ata3: SATA max UDMA/133 cmd 0xF8804E00 ctl 0x0 bmdma 0x0 irq 50
ata4: SATA max UDMA/133 cmd 0xF8804E80 ctl 0x0 bmdma 0x0 irq 50
ata1: SATA link up 1.5 Gbps (SStatus 113)
ata1 is slow to respond, please be patient
ata1 failed to respond (30 secs)
scsi0 : ahci
ata2: SATA link up 1.5 Gbps (SStatus 113)
ata2 is slow to respond, please be patient
ata2 failed to respond (30 secs)


3. Various other smaller problems, sky2 "Cannot find PowerManagement capability" sometimes, cannot boot with acpi=off and also cannot boot with nosmp on an SMP compiled kernel as SATA times out. Also "uhci_hcd 0000:00:1d.3: Unlink after no-IRQ? Controller is probably using the wrong IRQ."
These are lower priority problems though.

The first ATA bug is a bit of a killer, so I'd appreciate if it could be looked at out first. When testing out the barrier problem I had to do quite a good number of extra reboots because the box kept oopsing on bootup :(

reuben


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/