Re: [PATCH v3 0/4] Implement IOCTL to get and clear soft dirty PTE

From: Greg KH
Date: Tue Oct 18 2022 - 14:20:28 EST


On Tue, Oct 18, 2022 at 06:32:46PM +0500, Muhammad Usama Anjum wrote:
> On 10/18/22 3:36 PM, Muhammad Usama Anjum wrote:
> > > > > > > > I mean we should be able to specify for what
> > > > > > > > pages we need to get info
> > > > > > > > for. An ioctl argument can have these four fields:
> > > > > > > > * required bits (rmask & mask == mask) - all
> > > > > > > > bits from this mask have to be set.
> > > > > > > > * any of these bits (amask & mask != 0) - any of these bits is set.
> > > > > > > > * exclude masks (emask & mask == 0) = none of these bits are set.
> > > > > > > > * return mask - bits that have to be reported to user.
> > > > > The required mask (rmask) makes sense to me. At the moment, I only know
> > > > > about the practical use case for the required mask. Can you share how
> > > > > can any and exclude masks help for the CRIU?
> > > > >
> > > >
> > > > I looked at should_dump_page in the CRIU code:
> > > > https://github.com/checkpoint-restore/criu/blob/45641ab26d7bb78706a6215fdef8f9133abf8d10/criu/mem.c#L102
> > > >
> > > > When CRIU dumps file private mappings, it needs to get pages that have
> > > > PME_PRESENT or PME_SWAP but don't have PME_FILE.
> > >
> > > I would really like to see the mask discussed will be adopted. With
> > > it CRIU will
> > > be able to migrate huge sparse VMAs assuming that a single hole is
> > > processed in
> > > O(1) time.
> > >
> > > Use cases for migrating sparse VMAs are binaries sanitized with
> > > ASAN, MSAN or
> > > TSAN [1]. All of these sanitizers produce sparse mappings of shadow
> > > memory [2].
> > > Being able to migrate such binaries allows to highly reduce the
> > > amount of work
> > > needed to identify and fix post-migration crashes, which happen
> > > constantly.
> > >
> >
> > Hello all,
> >
> > I've included the masks which the CRIU developers have specified.
> > max_out_page is another new optional variable which is needed to
> > terminate the operation without visiting all the pages after finding the
> > max_out_page number of desired pages. There is no way to terminate the
> > operation without this variable.
> >
> > How does the interface looks now? Please comment.
> >
> Updated interface with only one IOCTL. If vec is defined, get operation will
> be performed. If PAGEMAP_SD_CLEAR flag is specified, soft dirty bit will be
> cleared as well. CLEAR flag can only be specified for clearing soft dirty
> bit.
>
> /* PAGEMAP IOCTL */
> #define PAGEMAP_SCAN _IOWR('f', 16, struct pagemap_sd_args)
>
> /* Bits are set in the bitmap of the page_region and masks in
> pagemap_sd_args */
> #define PAGE_IS_SD 1 << 0
> #define PAGE_IS_FILE 1 << 1
> #define PAGE_IS_PRESENT 1 << 2
> #define PAGE_IS_SWAPED 1 << 3
>
> /**
> * struct page_region - Page region with bitmap flags
> * @start: Start of the region
> * @len: Length of the region
> * bitmap: Bits sets for the region
> */
> struct page_region {
> __u64 start;
> __u64 len;
> __u64 bitmap;
> };
>
> /**
> * struct pagemap_sd_args - Soft-dirty IOCTL argument
> * @start: Starting address of the page
> * @len: Length of the region (All the pages in this length are included)
> * @vec: Output page_region struct array
> * @vec_len: Length of the page_region struct array
> * @max_out_page: Optional max output pages (It must be less than vec_len if
> specified)
> * @flags: Special flags for the IOCTL
> * @rmask: Required mask - All of these bits have to be set

Why have it at all if it always has to be all 1s?

> * @amask: Any mask - Any of these bits are set

which ones?

> * @emask: Exclude mask - None of these bits are set

Why have it, if none are ever set?

> * @rmask: Bits that have to be reported to the user in page_region

I feel like I have no idea what these bits are...

Anyway, please send a real patch, with real code, so we have a better
idea of what is happening. AFTER you have tested and made it all work
properly.

thanks,

greg k-h