[PATCH v6 0/3] Ignore non-LRU-based reclaim in memcg reclaim

From: Yosry Ahmed
Date: Thu Apr 13 2023 - 06:40:45 EST


Upon running some proactive reclaim tests using memory.reclaim, we
noticed some tests flaking where writing to memory.reclaim would be
successful even though we did not reclaim the requested amount fully
Looking further into it, I discovered that *sometimes* we overestimate
the number of reclaimed pages in memcg reclaim.

Reclaimed pages through other means than LRU-based reclaim are tracked
through reclaim_state in struct scan_control, which is stashed in
current task_struct. These pages are added to the number of reclaimed
pages through LRUs. For memcg reclaim, these pages generally cannot be
linked to the memcg under reclaim and can cause an overestimated count
of reclaimed pages. This short series tries to address that.

Patch 1 ignores pages reclaimed outside of LRU reclaim in memcg reclaim.
The pages are uncharged anyway, so even if we end up under-reporting
reclaimed pages we will still succeed in making progress during
charging.

Patches 2-3 are just refactoring. Patch 2 moves set_reclaim_state()
helper next to flush_reclaim_state(). Patch 3 adds a helper that wraps
updating current->reclaim_state, and renames
reclaim_state->reclaimed_slab to reclaim_state->reclaimed.

v5 -> v6:
- Re-arranged the patches:
- Pulled flush_reclaim_state() helper with the clarifyng comment to
the first patch so that the patch is clear on its own (David
Hildenbrand).
- Separated moving set_reclaim_state() to a separate patch so that we
can easily drop it if deemed unnecessary (Questioned by Peter Xu).
- Added a fixes tag (David Hildenbrand).
- Reworded comment in flush_reclaim_state() (David Hildenbrand and Tim
Chen).
- Dropped reclaim_state argument to flush_reclaim_state() and use
current->reclaim_state directly instead (Peter Xu).

v5: https://lore.kernel.org/linux-mm/20230405185427.1246289-1-yosryahmed@xxxxxxxxxx/

Yosry Ahmed (3):
mm: vmscan: ignore non-LRU-based reclaim in memcg reclaim
mm: vmscan: move set_task_reclaim_state() near flush_reclaim_state()
mm: vmscan: refactor updating current->reclaim_state

fs/inode.c | 3 +-
fs/xfs/xfs_buf.c | 3 +-
include/linux/swap.h | 17 ++++++++++-
mm/slab.c | 3 +-
mm/slob.c | 6 ++--
mm/slub.c | 5 ++-
mm/vmscan.c | 72 ++++++++++++++++++++++++++++++++------------
7 files changed, 76 insertions(+), 33 deletions(-)

--
2.40.0.577.gac1e443424-goog