[PATCH V6 00/16] Add the page size in the perf record

From: Kan Liang
Date: Mon Aug 10 2020 - 17:27:41 EST


Changes since V5
- Introduce a new universal page walker for the page size in the perf
subsystem.
- Rebased on Peter's tree.

Current perf can report both virtual addresses and physical addresses,
but not the page size. Without the page size information of the utilized
page, users cannot decide whether to promote/demote large pages to
optimize memory usage.

The patch set was submitted a year ago.
https://lkml.kernel.org/r/1549648509-12704-1-git-send-email-kan.liang@xxxxxxxxxxxxxxx
It introduced a __weak function, perf_get_page_size(), aim to retrieve
the page size via a given virtual address in the generic code, and
implemented a x86 specific version of perf_get_page_size().
However, the proposal was rejected, because it's a pure x86
implementation.
https://lkml.kernel.org/r/20190208200731.GN32511@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

At that time, it's not easy to support perf_get_page_size() universally,
because some key functions, e.g., p?d_large, are not supported by some
architectures.

Now, the generic p?d_leaf() functions are added in the latest kernel.
https://lkml.kernel.org/r/20191218162402.45610-2-steven.price@xxxxxxx
Starts from V6, a new universal perf_get_page_size() function is
implemented based on the generic p?d_leaf() functions.

On some platforms, e.g., X86, the page walker is invoked in an NMI
handler. So the page walker must be IRQ-safe and low overhead. Besides,
the page walker should work for both user and kernel virtual address.
The existing generic page walker, e.g., walk_page_range_novma(), is a
little bit complex and doesn't guarantee the IRQ-safe. The follow_page()
is only for the user-virtual address. So a simpler page walk function is
implemented here.

Kan Liang (11):
perf/core: Add PERF_SAMPLE_DATA_PAGE_SIZE
perf/x86/intel: Support PERF_SAMPLE_DATA_PAGE_SIZE
tools headers UAPI: Update tools's copy of linux/perf_event.h
perf record: Support new sample type for data page size
perf script: Use ULL for enum perf_output_field
perf script: Support data page size
perf sort: Add sort option for data page size
perf mem: Factor out a function to generate sort order
perf mem: Clean up output format
perf mem: Support data page size
perf test: Add test case for PERF_SAMPLE_DATA_PAGE_SIZE

Stephane Eranian (5):
perf/core: Add support for PERF_SAMPLE_CODE_PAGE_SIZE
perf tools: Add support for PERF_SAMPLE_CODE_PAGE_SIZE
perf script: Add support for PERF_SAMPLE_CODE_PAGE_SIZE
perf report: Add support for PERF_SAMPLE_CODE_PAGE_SIZE
perf test: Add test case for PERF_SAMPLE_CODE_PAGE_SIZE

arch/x86/events/intel/ds.c | 11 +-
include/linux/perf_event.h | 2 +
include/uapi/linux/perf_event.h | 6 +-
kernel/events/core.c | 132 ++++++++++++++++++-
tools/include/uapi/linux/perf_event.h | 6 +-
tools/perf/Documentation/perf-mem.txt | 3 +
tools/perf/Documentation/perf-record.txt | 6 +
tools/perf/Documentation/perf-report.txt | 2 +
tools/perf/Documentation/perf-script.txt | 5 +-
tools/perf/builtin-mem.c | 150 ++++++++++++----------
tools/perf/builtin-record.c | 4 +
tools/perf/builtin-script.c | 90 ++++++++-----
tools/perf/tests/sample-parsing.c | 10 +-
tools/perf/util/event.h | 5 +
tools/perf/util/evsel.c | 18 +++
tools/perf/util/hist.c | 5 +
tools/perf/util/hist.h | 2 +
tools/perf/util/machine.c | 7 +-
tools/perf/util/map_symbol.h | 1 +
tools/perf/util/perf_event_attr_fprintf.c | 2 +-
tools/perf/util/record.h | 2 +
tools/perf/util/session.c | 26 ++++
tools/perf/util/sort.c | 56 ++++++++
tools/perf/util/sort.h | 3 +
tools/perf/util/synthetic-events.c | 16 +++
25 files changed, 456 insertions(+), 114 deletions(-)

--
2.17.1