Re: [GIT-PULL] More Squashfs fixes for 2.6.29?

From: Phillip Lougher
Date: Wed Mar 11 2009 - 20:11:45 EST

Next message: Andrew Morton: "Re: [PATCH] NOMMU: Pages allocated to a ramfs inode's pagecache mayget wrongly discarded"
Previous message: Robert Hancock: "Re: Off-by-one in both LIBATA and IDE drivers"
In reply to: Linus Torvalds: "Re: [GIT-PULL] More Squashfs fixes for 2.6.29?"
Next in thread: Linus Torvalds: "Re: [GIT-PULL] More Squashfs fixes for 2.6.29?"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Linus Torvalds wrote:

On Wed, 11 Mar 2009, Phillip Lougher wrote:
Squashfs: Valid filesystems are flagged as bad by the corrupted fs patch

The problem arises due to an unexpected corner-case with zlib which the
corrupted filesystems patch didn't address. Very occasionally zlib
exits needing a couple of extra bytes of input (up to 6 seen bytes in the
test filesystems), but with avail_out == 0 and no more output buffer
space available. This situation was incorrectly flagged as an output buffer
overrun by the corrupted filesystems patch.

I'm almost certain that this situation is caused by a bug in how you call zlib().

You've explicitly asked for Z_NO_FLUSH, and I suspect you should be using Z_SYNC_FLUSH (of Z_FINISH if you know you have all the input data). You want as much to be inflated as possible, and you do seem to be expecting it to flush as much as possible.

Z_NO_FLUSH and Z_SYNC_FLUSH are equivalent in the zlib implementation
according to the documentation...

"In this implementation, inflate() always flushes as much output as
possible to the output buffer, and always uses the faster approach on the
first call. So the only effect of the flush parameter in this implementation
is on the return value of inflate(), as noted below, or when it returns early
because Z_BLOCK is used."

Which is why I never bothered to change it. But, I agree it should be
Z_SYNC_FLUSH for correctness.

However, I think the more direct cause is your inflate loop. The rules for running inflate() is to call it until it is done, or returns an error:

inflate() should normally be called until it returns Z_STREAM_END or an
error. However if all decompression is to be performed in a single step
(a single call of inflate), the parameter flush should be set to
Z_FINISH.

so you seem to be doing this all wrong. I think you _should_ be doing an inner loop over zlib_inflate() that just does inflates until you get a buffer error, and if you get a buffer error you then go on to the next page if avail_out is zero (all done if it's the last page), or fill the input buffer if it's empty.

I originally implemented an inner loop, something like

do {
if(avail_out == 0)
Get next output buffer

if(avail_in == 0)
Get next input buffer

do {
err = zlib_inflate( xxxx )
while (err == Z_OK);

} while (err == Z_BUF_ERROR);

But I optimised the inner loop out because of the following logic:

zlib_inflate always tries to make as much progress as possible, therefore
if it returns Z_OK it always either needs more input or more output, in which
case iterating around the outer loop will get the next buffer. If it returns
any other error code (Z_STREAM_END or Z_DATA_ERROR) we want to terminate
the loop. The inner loop iterating until Z_BUF_ERROR is received is therefore
unnecessary, it merely iterates twice, first receiving Z_OK, the second time
receiving Z_BUF_ERROR (because no progress can be made), dropping out to
the outer loop to get the next buffers.

So this should have equivalent behaviour

do {
if(avail_out == 0)
Get next output buffer

if(avail_in == 0)
Get next input buffer

err = zlib_inflate( xxxx )

} while (err == Z_OK);

I could put the inner loop back in if you prefer.

Phillip

So I really think you should fix the zlib_inflate() loop instead.

I dunno. At least that's what the docs suggest, and it's what we do in git (another heavy user of zlib, althoughb the usage patterns are rather different, and we _tend_ to

Linus

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Andrew Morton: "Re: [PATCH] NOMMU: Pages allocated to a ramfs inode's pagecache mayget wrongly discarded"
Previous message: Robert Hancock: "Re: Off-by-one in both LIBATA and IDE drivers"
In reply to: Linus Torvalds: "Re: [GIT-PULL] More Squashfs fixes for 2.6.29?"
Next in thread: Linus Torvalds: "Re: [GIT-PULL] More Squashfs fixes for 2.6.29?"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]