r8169 GigE driver problem, locks up 2.4.23 NFS subsystem

From: Marcus Blomenkamp
Date: Sat Dec 13 2003 - 07:01:29 EST


Hi all.

Since ugrading from a realtek-8139 based nic to a realtek-6139 gigabit one i
am experiencing strange network problems. Particularly it locks up the NFS
subsystem on writing to remote files.

I tried to narrow it down using several TCP/UDP traffic tools such as
'netpipe' and 'netcat' and different kernel r8169 driver versions.

I restricted the network to a single 100mbit crossover cable between a machine
with said gig-nic and a file server which has not been modified yet.

File server: 'kartoffel', RTL-8139, linux-2.4.20
Client machine: 'zwiebel', RTL-8169S, linux-2.4.23-pre9, linux-2.6.0-test11

I tried 2.4.23-pre9 and 2.6.0-test11 vanilla drivers and a special realtek
supplied version for linux-2.4. With respect to basic TCP and UDP transfer
their behaviour was identical. However 2.6 NFS subsystem was able to recover
from the network stalls, while 2.4 NFS did not release processes from 'D'
state.

My suspicion is that something related to UDP datagram to IP-over-ethernet
frame fragmentation is broken. TCP transfers in both directions work fine
while UDP transmissions over a specific datagram size stall after sending a
few k.

These objections manifest into NFS running fine over TCP and running fine over
UDP with wsize<=4096, while standard NFS mount option wsize=8192 fails.

If anyone is interested, i have dmesg output and ethereal log files handy.

Best regards, Marcus

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/