Re: userspace pagecache management tool

From: bert hubert
Date: Sat Mar 03 2007 - 16:41:40 EST


On Sat, Mar 03, 2007 at 04:30:56PM -0500, Rik van Riel wrote:

> The user has been accessing the kernel tree over and over
> again, for hours on end (compile testing a patch). Along
> comes a backup program, that tells you to evict the whole
> thing from the cache.

This is arguably due to a linux misimplementation of posix_fadvise. SuS v3
clearly states:

The posix_fadvise() function shall advise the implementation on the
expected behavior of the application with respect to the data in the file
associated with the open file descriptor

Note how it refers to the *application*. This is reiterated here:

POSIX_FADV_WILLNEED
Specifies that the application expects to access the specified data in
the near future.

POSIX_FADV_DONTNEED
Specifies that the application expects that it will not access the
specified data in the near future.

POSIX_FADV_NOREUSE
Specifies that the application expects to access the specified data once
and then not reuse it thereafter.

Linux however implements posix_fadvise globally:

POSIX_FADV_DONTNEED
attempts to free cached pages associated with the specified region.

POSIX_FADV_WILLNEED and POSIX_FADV_NOREUSE
both initiate a non-blocking read of the specified region into the page
cache.

> How can you make global policy decisions based on the intent
> of one program?

By not doing so. Andrew's program is fine in principle, except that the
linux kernel treats the communication of a program's intent as a global
instruction.

Also, Andrew's description is a tad misleasing, as the size of the page
cache might be altered a lot in case content is accessed that was previously
cached!

With my userspace hat on, I'd love to have a proper way to communicate my
*program's* expectations to the kernel, without stomping other programs.

Also with the same hat on, I hope to rarely *need* to communicate my
expectations because the kernel correctly predicts many cases.

Bert

--
http://www.PowerDNS.com Open source, database driven DNS Software
http://netherlabs.nl Open and Closed source services
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/