Re: [RESEND PATCH V2 0/3] Allow user to request memory to be locked on page fault

From: Andrew Morton
Date: Thu Jun 11 2015 - 15:34:28 EST


On Thu, 11 Jun 2015 15:21:30 -0400 Eric B Munson <emunson@xxxxxxxxxx> wrote:

> > Ditto mlockall(MCL_ONFAULT) followed by munlock(). I'm not sure
> > that even makes sense but the behaviour should be understood and
> > tested.
>
> I have extended the kselftest for lock-on-fault to try both of these
> scenarios and they work as expected. The VMA is split and the VM
> flags are set appropriately for the resulting VMAs.

munlock() should do vma merging as well. I *think* we implemented
that. More tests for you to add ;)

How are you testing the vma merging and splitting, btw? Parsing
the profcs files?

> > What's missing here is a syscall to set VM_LOCKONFAULT on an
> > arbitrary range of memory - mlock() for lock-on-fault. It's a
> > shame that mlock() didn't take a `mode' argument. Perhaps we
> > should add such a syscall - that would make the mmap flag unneeded
> > but I suppose it should be kept for symmetry.
>
> Do you want such a system call as part of this set? I would need some
> time to make sure I had thought through all the possible corners one
> could get into with such a call, so it would delay a V3 quite a bit.
> Otherwise I can send a V3 out immediately.

I think the way to look at this is to pretend that mm/mlock.c doesn't
exist and ask "how should we design these features".

And that would be:

- mmap() takes a `flags' argument: MAP_LOCKED|MAP_LOCKONFAULT.

- mlock() takes a `flags' argument. Presently that's
MLOCK_LOCKED|MLOCK_LOCKONFAULT.

- munlock() takes a `flags' arument. MLOCK_LOCKED|MLOCK_LOCKONFAULT
to specify which flags are being cleared.

- mlockall() and munlockall() ditto.


IOW, LOCKED and LOCKEDONFAULT are treated identically and independently.

Now, that's how we would have designed all this on day one. And I
think we can do this now, by adding new mlock2() and munlock2()
syscalls. And we may as well deprecate the old mlock() and munlock(),
not that this matters much.

*should* we do this? I'm thinking "yes" - it's all pretty simple
boilerplate and wrappers and such, and it gets the interface correct,
and extensible.

What do others think?
--
To unsubscribe from this list: send the line "unsubscribe linux-alpha" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html