Re: [PATCH 3/3] mlock: avoid dirtying pages and triggering writeback

From: Theodore Tso
Date: Fri Nov 19 2010 - 08:42:14 EST


On Fri, Nov 19, 2010 at 2:23 AM, Michel Lespinasse <walken@xxxxxxxxxx> wrote:
>
> Approaching the problem the other way - would there be any objection to
> adding code to do an fallocate() equivalent at the start of mlock ?
> This would be a no-op when the file is fully allocated on disk, and would
> allow mlock to return an error if the file can't get fully allocated
> (no idea what errno should be for such case, though).

My vote would be against.   If you if you mmap a sparse file and then
try writing to it willy-nilly, bad things will happen.  This is true without
a mlock().   Where is it written that mlock() has anything to do with
improving this situation?

If userspace wants to call fallocate() before it calls mlock(), it should
do that.  And in fact, in most cases, userspace should probably be
encouraged to do that.   But having mlock() call fallocate() and
then return ENOSPC if there's no room? Isn't it confusing that mlock()
call ENOSPC? Doesn't that give you cognitive dissonance? It should
because fundamentally mlock() has nothing to do with block allocation!!
Read the API spec!

Look, it was an accident / bug of the implementation that mlock()
magically dirtied all these pages.  It might have made some situations
better, but I very much doubt applications depended upon it, and I'd
really rather not perpetuate this particular magic side effect of the
previously buggy implementation of mlock().

-- Ted
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/