Re: strange ext3 corruption problem on 2.6.x

From: John Pearson
Date: Tue Mar 23 2004 - 02:35:31 EST


OK,

I've seen this one now, too; here's my datapoint:

First, under vanilla 2.6.3:

EXT3-fs error (device dm-0): ext3_readdir: bad entry in directory #917711: rec_len % 4 != 0 - offset=0, inode=1182746341, rec_len=16861, name_len=185
Aborting journal on device dm-0.
ext3_abort called.
EXT3-fs abort (device dm-0): ext3_journal_start: Detected aborted journal
Remounting filesystem read-only



Then, under 2.6.4+skas3:


EXT3-fs error (device dm-3): ext3_readdir: bad entry in directory #510327: directory entry across blocks - offset=0, inode=0, rec_len=5044, name_len=113
Aborting journal on device dm-3.
ext3_abort called.
EXT3-fs abort (device dm-3): ext3_journal_start: Detected aborted journal
Remounting filesystem read-only



I'm running ext3 over raid5; In both cases, fsck spotted the aborted journal and checked the FS, which came up clean.

No other issues in many days of uptime, including kernel compiles, etc., so I'm reasonably confident of the RAM and hardware generally.

I wouldn't describe either volume as seeing heavy use - there's rarely more than one reader, and almost never more than one writer.

dm-3 has had no writes since last boot (it serves images to diskless clients, including NFS roots mounted ro); dm-0 had seen a few writes (it's a read-mostly FTP server containing mirrors of debian-security and a few other things, synced about once a month).

'directory #510327' on dm-3 is a manpage directory, which shows a size of 20480 and contains 751 files; 'directory #917711' on dm-0 has a size of 8192 and contains 101 files.

The box is a UMP Athlon XP with 512Mb DDR RAM on a VIA VT8237-based
board, using on-board IDE + a Promise 20268 controller (but as the RAID layer works, I doubt it's the hardware).
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/