Re: [PATCH 1/1] page cache: fix page_cache_next/prev_miss off by one

From: Andrew Morton
Date: Fri Jun 02 2023 - 20:59:32 EST


On Fri, 2 Jun 2023 15:57:47 -0700 Mike Kravetz <mike.kravetz@xxxxxxxxxx> wrote:

> Ackerley Tng reported an issue with hugetlbfs fallocate here[1]. The
> issue showed up after the conversion of hugetlb page cache lookup code
> to use page_cache_next_miss.

So I'm assuming

Fixes: d0ce0e47b323 ("mm/hugetlb: convert hugetlb fault paths to use alloc_hugetlb_folio()")

?

> Code in hugetlb fallocate, userfaultfd
> and GUP is now using page_cache_next_miss to determine if a page is
> present the page cache. The following statement is used.
>
> present = page_cache_next_miss(mapping, index, 1) != index;
>
> There are two issues with page_cache_next_miss when used in this way.
> 1) If the passed value for index is equal to the 'wrap-around' value,
> the same index will always be returned. This wrap-around value is 0,
> so 0 will be returned even if page is present at index 0.
> 2) If there is no gap in the range passed, the last index in the range
> will be returned. When passed a range of 1 as above, the passed
> index value will be returned even if the page is present.
> The end result is the statement above will NEVER indicate a page is
> present in the cache, even if it is.
>
> As noted by Ackerley in [1], users can see this by hugetlb fallocate
> incorrectly returning EEXIST if pages are already present in the file.
> In addition, hugetlb pages will not be included in core dumps if they
> need to be brought in via GUP. userfaultfd UFFDIO_COPY also uses this
> code and will not notice pages already present in the cache. It may try
> to allocate a new page and potentially return ENOMEM as opposed to
> EEXIST.
>
> Both page_cache_next_miss and page_cache_prev_miss have similar issues.
> Fix by:
> - Check for index equal to 'wrap-around' value and do not exit early.
> - If no gap is found in range, return index outside range.
> - Update function description to say 'wrap-around' value could be
> returned if passed as index.
>
> [1] https://lore.kernel.org/linux-mm/cover.1683069252.git.ackerleytng@xxxxxxxxxx/
>