Re: [PATCH 3/7] mm/gup: remove vmas parameter from get_user_pages_remote()

From: Eric W. Biederman
Date: Mon Apr 17 2023 - 14:33:44 EST


Lorenzo Stoakes <lstoakes@xxxxxxxxx> writes:

> On Mon, Apr 17, 2023 at 10:16:28AM -0300, Jason Gunthorpe wrote:
>> On Mon, Apr 17, 2023 at 02:13:39PM +0100, Lorenzo Stoakes wrote:
>> > On Mon, Apr 17, 2023 at 10:09:36AM -0300, Jason Gunthorpe wrote:
>> > > On Sat, Apr 15, 2023 at 12:27:31AM +0100, Lorenzo Stoakes wrote:
>> > > > The only instances of get_user_pages_remote() invocations which used the
>> > > > vmas parameter were for a single page which can instead simply look up the
>> > > > VMA directly. In particular:-
>> > > >
>> > > > - __update_ref_ctr() looked up the VMA but did nothing with it so we simply
>> > > > remove it.
>> > > >
>> > > > - __access_remote_vm() was already using vma_lookup() when the original
>> > > > lookup failed so by doing the lookup directly this also de-duplicates the
>> > > > code.
>> > > >
>> > > > This forms part of a broader set of patches intended to eliminate the vmas
>> > > > parameter altogether.
>> > > >
>> > > > Signed-off-by: Lorenzo Stoakes <lstoakes@xxxxxxxxx>
>> > > > ---
>> > > > arch/arm64/kernel/mte.c | 5 +++--
>> > > > arch/s390/kvm/interrupt.c | 2 +-
>> > > > fs/exec.c | 2 +-
>> > > > include/linux/mm.h | 2 +-
>> > > > kernel/events/uprobes.c | 10 +++++-----
>> > > > mm/gup.c | 12 ++++--------
>> > > > mm/memory.c | 9 +++++----
>> > > > mm/rmap.c | 2 +-
>> > > > security/tomoyo/domain.c | 2 +-
>> > > > virt/kvm/async_pf.c | 3 +--
>> > > > 10 files changed, 23 insertions(+), 26 deletions(-)
>> > > >
>> > > > diff --git a/arch/arm64/kernel/mte.c b/arch/arm64/kernel/mte.c
>> > > > index f5bcb0dc6267..74d8d4007dec 100644
>> > > > --- a/arch/arm64/kernel/mte.c
>> > > > +++ b/arch/arm64/kernel/mte.c
>> > > > @@ -437,8 +437,9 @@ static int __access_remote_tags(struct mm_struct *mm, unsigned long addr,
>> > > > struct page *page = NULL;
>> > > >
>> > > > ret = get_user_pages_remote(mm, addr, 1, gup_flags, &page,
>> > > > - &vma, NULL);
>> > > > - if (ret <= 0)
>> > > > + NULL);
>> > > > + vma = vma_lookup(mm, addr);
>> > > > + if (ret <= 0 || !vma)
>> > > > break;
>> > >
>> > > Given the slightly tricky error handling, it would make sense to turn
>> > > this pattern into a helper function:
>> > >
>> > > page = get_single_user_page_locked(mm, addr, gup_flags, &vma);
>> > > if (IS_ERR(page))
>> > > [..]
>> > >
>> > > static inline struct page *get_single_user_page_locked(struct mm_struct *mm,
>> > > unsigned long addr, int gup_flags, struct vm_area_struct **vma)
>> > > {
>> > > struct page *page;
>> > > int ret;
>> > >
>> > > ret = get_user_pages_remote(*mm, addr, 1, gup_flags, &page, NULL, NULL);
>> > > if (ret < 0)
>> > > return ERR_PTR(ret);
>> > > if (WARN_ON(ret == 0))
>> > > return ERR_PTR(-EINVAL);
>> > > *vma = vma_lookup(mm, addr);
>> > > if (WARN_ON(!*vma) {
>> > > put_user_page(page);
>> > > return ERR_PTR(-EINVAL);
>> > > }
>> > > return page;
>> > > }
>> > >
>> > > It could be its own patch so this change was just a mechanical removal
>> > > of NULL
>> > >
>> > > Jason
>> > >
>> >
>> > Agreed, I think this would work better as a follow up patch however so as
>> > not to distract too much from the core change.
>>
>> I don't think you should open code sketchy error handling in several
>> places and then clean it up later. Just do it right from the start.
>>
>
> Intent was to do smallest change possible (though through review that grew
> of course), but I see your point, in this instance this is fiddly stuff and
> probably better to abstract it to enforce correct handling.
>
> I'll respin + add something like this.

Could you include in your description why looking up the vma after
getting the page does not introduce a race?

I am probably silly and just looking at this quickly but it does not
seem immediately obvious why the vma and the page should match.

I would not be surprised if you hold the appropriate mutex over the
entire operation but it just isn't apparent from the diff.

I am concerned because it is an easy mistake to refactor something into
two steps and then discover you have introduced a race.

Eric