Re: [PATCH update] firewire: fw-ohci: work around generation bugin TI controllers (fix AV/C and more)

From: Stefan Richter
Date: Wed Apr 16 2008 - 06:22:59 EST


Jarod Wilson wrote:
> On Saturday 12 April 2008 04:31:25 pm Stefan Richter wrote:
>> https://bugzilla.redhat.com/show_bug.cgi?id=243081
...
>> @@ -2360,6 +2369,8 @@ pci_probe(struct pci_dev *dev, const str
>> ohci->old_uninorth = dev->vendor == PCI_VENDOR_ID_APPLE &&
>> dev->device == PCI_DEVICE_ID_APPLE_UNI_N_FW;
>> #endif
>> + ohci->bus_reset_packet_quirk = dev->vendor == PCI_VENDOR_ID_TI;
>> +
>
> I have a few cards with PCI_VENDOR_ID_CREATIVE with a TI TSB41AB2 chip on 'em
> (SoundBlaster Audigy w/FireWire port). I've not had any issues on any of the
> cards I've got, but do we want to add them to the work-around list just to be
> safe?

Right now TSB82AA2 is confirmed buggy by two slightly differing reports
(Mladen's and mine) and TSB43AB22 or TSB43AB22A is under suspicion.

Whether TSB41AB2 is affected or not is impossible to say without having
a hardware combination which triggers the bug. So far it seems the bug
occurs if there are more bus reset events than self ID complete events.
Note, I noticed the bug on my TSB82AA2 only by means of the bus reset
packet generation logging, not because of any subsequent malfunction
(because a bus reset forced by the bus manager code is always saving the
day on my setup).

If you could reproduce these conditions somehow and then check the debug
log for temporary generation mismatches, that would be good. Note that
the conditions for the bug to occur are almost exclusively influenced by
the PHY layer hardware of all nodes on the bus, while the bug itself is
caused by the link layer controller.

Anyway, if we get confirmation that the proposed patch fixes
TSB43AB22(A) and you don't have the means to prove that TSB41AB2 is not
buggy, then let's add the vendor ID of Creative as well. As mentioned,
the patch introduces a certain danger of
1. fw-core missing to send responses to requests which came in
immediately after bus reset (unlikely border case),
2. fw-core responding to requests which came in before the last
bus reset (even less likely border case, because that would only
happen if the self ID complete event was processed before an older
AR event, which does not happen according to my observations with
a bunch of different chips).

1. is to be compensated by the requester by retry. We have seen that
some simplistic requesters have difficulties with that. But luckily,
these requesters are AV/C targets and send requests as AV/C transaction
responses. I think the chances for this to happen are rather low.

2. is to be compensated by the requester by proper matching of
transaction labels and discarding transaction labels at bus reset events.

AFAIU, the ieee1394 stack has always been in danger to trip these border
cases because ohci1394 never looks at the bus reset packet generation.
So this speaks _for_ enabling the bus_reset_packet_quirk if a card is
merely under suspicion of having the bug and we are unable to find
someone who can disprove the suspicion for us.
--
Stefan Richter
-=====-==--- -=-- -==-=
http://arcgraph.de/sr/
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/