Re: [PATCH v3 3/5] mm: memcg: make stats flushing threshold per-memcg

From: kernel test robot
Date: Wed Nov 22 2023 - 08:54:53 EST




Hello,

kernel test robot noticed a -30.2% regression of will-it-scale.per_thread_ops on:


commit: c7fbfc7b4e089c4a9b292b1973a42a5761c1342f ("[PATCH v3 3/5] mm: memcg: make stats flushing threshold per-memcg")
url: https://github.com/intel-lab-lkp/linux/commits/Yosry-Ahmed/mm-memcg-change-flush_next_time-to-flush_last_time/20231116-103300
base: https://git.kernel.org/cgit/linux/kernel/git/akpm/mm.git mm-everything
patch link: https://lore.kernel.org/all/20231116022411.2250072-4-yosryahmed@xxxxxxxxxx/
patch subject: [PATCH v3 3/5] mm: memcg: make stats flushing threshold per-memcg

testcase: will-it-scale
test machine: 104 threads 2 sockets (Skylake) with 192G memory
parameters:

nr_task: 50%
mode: thread
test: fallocate2
cpufreq_governor: performance




If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@xxxxxxxxx>
| Closes: https://lore.kernel.org/oe-lkp/202311221542.973f16ad-oliver.sang@xxxxxxxxx


Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20231122/202311221542.973f16ad-oliver.sang@xxxxxxxxx

=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-8.3/thread/50%/debian-11.1-x86_64-20220510.cgz/lkp-skl-fpga01/fallocate2/will-it-scale

commit:
c5caa5bb03 ("mm: memcg: move vmstats structs definition above flushing code")
c7fbfc7b4e ("mm: memcg: make stats flushing threshold per-memcg")

c5caa5bb0376e3e5 c7fbfc7b4e089c4a9b292b1973a
---------------- ---------------------------
%stddev %change %stddev
\ | \
1.84 -0.5 1.37 ± 9% mpstat.cpu.all.usr%
0.08 -25.0% 0.06 turbostat.IPC
3121 -9.2% 2835 ± 5% vmstat.system.cs
78.17 ± 12% +96.6% 153.67 ± 18% perf-c2c.DRAM.local
504.17 ± 6% +34.4% 677.50 ± 4% perf-c2c.DRAM.remote
3980762 -30.2% 2777359 will-it-scale.52.threads
76552 -30.2% 53410 will-it-scale.per_thread_ops
3980762 -30.2% 2777359 will-it-scale.workload
1.192e+09 ± 2% -30.2% 8.324e+08 ± 3% numa-numastat.node0.local_node
1.192e+09 ± 2% -30.2% 8.324e+08 ± 3% numa-numastat.node0.numa_hit
1.215e+09 ± 2% -30.3% 8.471e+08 ± 3% numa-numastat.node1.local_node
1.215e+09 ± 2% -30.3% 8.474e+08 ± 3% numa-numastat.node1.numa_hit
1.192e+09 ± 2% -30.2% 8.324e+08 ± 3% numa-vmstat.node0.numa_hit
1.192e+09 ± 2% -30.2% 8.324e+08 ± 3% numa-vmstat.node0.numa_local
1.215e+09 ± 2% -30.3% 8.474e+08 ± 3% numa-vmstat.node1.numa_hit
1.215e+09 ± 2% -30.3% 8.471e+08 ± 3% numa-vmstat.node1.numa_local
31404 -1.6% 30913 proc-vmstat.nr_slab_reclaimable
2.408e+09 -30.2% 1.68e+09 proc-vmstat.numa_hit
2.407e+09 -30.2% 1.68e+09 proc-vmstat.numa_local
2.404e+09 -30.2% 1.678e+09 proc-vmstat.pgalloc_normal
2.403e+09 -30.2% 1.678e+09 proc-vmstat.pgfree
0.05 ± 8% -27.3% 0.04 ± 4% perf-sched.wait_and_delay.avg.ms.__cond_resched.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64
0.05 ± 10% -24.9% 0.04 ± 8% perf-sched.wait_and_delay.avg.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate
0.05 ± 8% -27.2% 0.04 ± 5% perf-sched.wait_and_delay.avg.ms.__cond_resched.shmem_undo_range.shmem_setattr.notify_change.do_truncate
1.14 +14.1% 1.30 perf-sched.wait_and_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
198.38 ± 3% +16.5% 231.12 ± 3% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
1563 ± 5% -11.4% 1384 ± 5% perf-sched.wait_and_delay.count.__cond_resched.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64
1677 ± 5% -18.7% 1364 ± 4% perf-sched.wait_and_delay.count.__cond_resched.shmem_undo_range.shmem_setattr.notify_change.do_truncate
3815 ± 2% -14.5% 3260 ± 2% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.51 ± 5% -32.3% 0.35 ± 16% perf-sched.wait_and_delay.max.ms.__cond_resched.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64
0.47 ± 11% -33.3% 0.31 ± 20% perf-sched.wait_and_delay.max.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate
2.37 +13.0% 2.68 ± 2% perf-sched.wait_and_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.05 ± 8% -27.3% 0.04 ± 4% perf-sched.wait_time.avg.ms.__cond_resched.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64
0.05 ± 10% -24.9% 0.04 ± 8% perf-sched.wait_time.avg.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate
0.05 ± 8% -27.2% 0.04 ± 5% perf-sched.wait_time.avg.ms.__cond_resched.shmem_undo_range.shmem_setattr.notify_change.do_truncate
1.14 +14.1% 1.30 perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
198.37 ± 3% +16.5% 231.11 ± 3% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.39 ± 31% -72.9% 0.11 ± 28% perf-sched.wait_time.max.ms.__cond_resched.__alloc_pages.alloc_pages_mpol.shmem_alloc_folio.shmem_alloc_and_add_folio
0.51 ± 5% -32.3% 0.35 ± 16% perf-sched.wait_time.max.ms.__cond_resched.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64
0.47 ± 11% -33.3% 0.31 ± 20% perf-sched.wait_time.max.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate
2.37 +13.1% 2.68 ± 2% perf-sched.wait_time.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.82 ± 14% +174.7% 2.24 ± 30% perf-stat.i.MPKI
8.476e+09 -27.7% 6.127e+09 ± 10% perf-stat.i.branch-instructions
55486131 -28.1% 39884260 ± 6% perf-stat.i.branch-misses
14.80 ± 2% +6.2 20.96 ± 7% perf-stat.i.cache-miss-rate%
30690945 ± 3% +79.9% 55207216 ± 10% perf-stat.i.cache-misses
2.066e+08 +24.2% 2.567e+08 ± 7% perf-stat.i.cache-references
3070 -9.7% 2772 ± 5% perf-stat.i.context-switches
3.58 ± 2% +39.7% 5.00 ± 11% perf-stat.i.cpi
4688 ± 3% -47.9% 2442 ± 4% perf-stat.i.cycles-between-cache-misses
4098916 -29.7% 2879809 perf-stat.i.dTLB-load-misses
1.052e+10 -27.5% 7.63e+09 ± 10% perf-stat.i.dTLB-loads
5.845e+09 -30.7% 4.051e+09 ± 10% perf-stat.i.dTLB-stores
77.61 -6.0 71.56 perf-stat.i.iTLB-load-miss-rate%
4058819 -32.5% 2739054 ± 8% perf-stat.i.iTLB-load-misses
4.089e+10 -28.3% 2.932e+10 ± 10% perf-stat.i.instructions
0.28 -26.8% 0.21 ± 5% perf-stat.i.ipc
240.84 -27.9% 173.57 ± 10% perf-stat.i.metric.M/sec
3814721 ± 3% +72.2% 6569712 ± 10% perf-stat.i.node-load-misses
407308 ± 7% +72.0% 700502 ± 18% perf-stat.i.node-loads
1323090 ± 2% -28.1% 951590 ± 12% perf-stat.i.node-store-misses
36568 ± 2% -20.7% 29014 ± 12% perf-stat.i.node-stores
0.75 ± 3% +151.0% 1.88 perf-stat.overall.MPKI
14.85 ± 2% +6.6 21.47 ± 3% perf-stat.overall.cache-miss-rate%
3.53 +33.8% 4.72 perf-stat.overall.cpi
4704 ± 3% -46.8% 2505 perf-stat.overall.cycles-between-cache-misses
77.62 -6.2 71.39 perf-stat.overall.iTLB-load-miss-rate%
0.28 -25.3% 0.21 perf-stat.overall.ipc
3121462 +7.4% 3353425 perf-stat.overall.path-length
8.451e+09 -27.6% 6.119e+09 ± 10% perf-stat.ps.branch-instructions
55320195 -28.0% 39804925 ± 6% perf-stat.ps.branch-misses
30594557 ± 3% +80.2% 55116821 ± 9% perf-stat.ps.cache-misses
2.059e+08 +24.4% 2.561e+08 ± 6% perf-stat.ps.cache-references
3059 -9.6% 2765 ± 5% perf-stat.ps.context-switches
4085949 -29.7% 2871251 perf-stat.ps.dTLB-load-misses
1.049e+10 -27.4% 7.62e+09 ± 10% perf-stat.ps.dTLB-loads
5.828e+09 -30.6% 4.046e+09 ± 10% perf-stat.ps.dTLB-stores
4046367 -32.4% 2734227 ± 7% perf-stat.ps.iTLB-load-misses
4.077e+10 -28.2% 2.928e+10 ± 10% perf-stat.ps.instructions
3802900 ± 3% +72.5% 6559980 ± 10% perf-stat.ps.node-load-misses
406123 ± 7% +72.2% 699397 ± 17% perf-stat.ps.node-loads
1319155 ± 2% -28.0% 950261 ± 12% perf-stat.ps.node-store-misses
36542 ± 2% -20.6% 29007 ± 11% perf-stat.ps.node-stores
1.243e+13 -25.0% 9.313e+12 perf-stat.total.instructions
1.26 ± 2% -0.4 0.91 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.fallocate64
1.22 -0.3 0.88 ± 2% perf-profile.calltrace.cycles-pp.shmem_alloc_folio.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate
0.92 ± 2% -0.3 0.62 ± 3% perf-profile.calltrace.cycles-pp.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate
1.04 -0.3 0.76 ± 2% perf-profile.calltrace.cycles-pp.alloc_pages_mpol.shmem_alloc_folio.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate
0.80 -0.2 0.58 ± 3% perf-profile.calltrace.cycles-pp.__alloc_pages.alloc_pages_mpol.shmem_alloc_folio.shmem_alloc_and_add_folio.shmem_get_folio_gfp
1.25 ± 2% -0.2 1.07 perf-profile.calltrace.cycles-pp.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release.shmem_undo_range.shmem_setattr
1.25 ± 2% -0.2 1.07 perf-profile.calltrace.cycles-pp.lru_add_drain_cpu.__folio_batch_release.shmem_undo_range.shmem_setattr.notify_change
1.23 ± 2% -0.2 1.06 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release.shmem_undo_range
1.23 ± 2% -0.2 1.06 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu.__folio_batch_release
1.23 ± 2% -0.2 1.05 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.folio_batch_move_lru.lru_add_drain_cpu
1.16 ± 2% -0.1 1.02 ± 2% perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
0.68 +0.1 0.75 ± 2% perf-profile.calltrace.cycles-pp.__mem_cgroup_uncharge_list.release_pages.__folio_batch_release.shmem_undo_range.shmem_setattr
1.07 +0.1 1.18 ± 2% perf-profile.calltrace.cycles-pp.lru_add_fn.folio_batch_move_lru.folio_add_lru.shmem_alloc_and_add_folio.shmem_get_folio_gfp
2.95 +0.3 3.21 ± 2% perf-profile.calltrace.cycles-pp.truncate_inode_folio.shmem_undo_range.shmem_setattr.notify_change.do_truncate
2.60 +0.4 2.95 perf-profile.calltrace.cycles-pp.filemap_remove_folio.truncate_inode_folio.shmem_undo_range.shmem_setattr.notify_change
2.27 +0.4 2.71 ± 2% perf-profile.calltrace.cycles-pp.__filemap_remove_folio.filemap_remove_folio.truncate_inode_folio.shmem_undo_range.shmem_setattr
1.38 ± 3% +0.5 1.85 ± 5% perf-profile.calltrace.cycles-pp.get_mem_cgroup_from_mm.__mem_cgroup_charge.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate
2.29 ± 2% +0.6 2.90 ± 2% perf-profile.calltrace.cycles-pp.shmem_add_to_page_cache.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate
0.00 +0.6 0.63 ± 2% perf-profile.calltrace.cycles-pp.__mod_memcg_lruvec_state.release_pages.__folio_batch_release.shmem_undo_range.shmem_setattr
0.00 +0.7 0.74 ± 3% perf-profile.calltrace.cycles-pp.__mod_memcg_lruvec_state.lru_add_fn.folio_batch_move_lru.folio_add_lru.shmem_alloc_and_add_folio
1.30 +0.8 2.07 ± 3% perf-profile.calltrace.cycles-pp.filemap_unaccount_folio.__filemap_remove_folio.filemap_remove_folio.truncate_inode_folio.shmem_undo_range
0.73 ± 2% +0.8 1.53 ± 2% perf-profile.calltrace.cycles-pp.__mod_memcg_lruvec_state.__mod_lruvec_page_state.filemap_unaccount_folio.__filemap_remove_folio.filemap_remove_folio
1.23 +0.8 2.04 ± 3% perf-profile.calltrace.cycles-pp.__mod_lruvec_page_state.filemap_unaccount_folio.__filemap_remove_folio.filemap_remove_folio.truncate_inode_folio
0.00 +0.8 0.82 ± 2% perf-profile.calltrace.cycles-pp.__count_memcg_events.mem_cgroup_commit_charge.__mem_cgroup_charge.shmem_alloc_and_add_folio.shmem_get_folio_gfp
1.39 ± 2% +0.9 2.32 ± 2% perf-profile.calltrace.cycles-pp.__mod_lruvec_page_state.shmem_add_to_page_cache.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate
0.59 ± 2% +0.9 1.53 ± 2% perf-profile.calltrace.cycles-pp.__mod_memcg_lruvec_state.__mod_lruvec_page_state.shmem_add_to_page_cache.shmem_alloc_and_add_folio.shmem_get_folio_gfp
38.12 +1.0 39.16 perf-profile.calltrace.cycles-pp.vfs_fallocate.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe.fallocate64
0.62 ± 4% +1.1 1.71 ± 3% perf-profile.calltrace.cycles-pp.mem_cgroup_commit_charge.__mem_cgroup_charge.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate
37.61 +1.2 38.80 perf-profile.calltrace.cycles-pp.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64.entry_SYSCALL_64_after_hwframe
36.54 +1.5 38.02 perf-profile.calltrace.cycles-pp.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate.do_syscall_64
35.97 +1.6 37.60 perf-profile.calltrace.cycles-pp.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate.__x64_sys_fallocate
2.48 ± 3% +2.3 4.80 ± 4% perf-profile.calltrace.cycles-pp.__mem_cgroup_charge.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_fallocate.vfs_fallocate
1.28 ± 2% -0.4 0.92 perf-profile.children.cycles-pp.syscall_return_via_sysret
1.23 -0.3 0.88 ± 2% perf-profile.children.cycles-pp.shmem_alloc_folio
0.95 ± 2% -0.3 0.64 ± 3% perf-profile.children.cycles-pp.shmem_inode_acct_blocks
1.07 -0.3 0.77 ± 3% perf-profile.children.cycles-pp.alloc_pages_mpol
0.86 ± 2% -0.3 0.58 ± 2% perf-profile.children.cycles-pp.xas_store
0.84 -0.2 0.61 ± 3% perf-profile.children.cycles-pp.__alloc_pages
1.26 ± 2% -0.2 1.08 perf-profile.children.cycles-pp.lru_add_drain_cpu
0.61 ± 3% -0.2 0.43 perf-profile.children.cycles-pp.__entry_text_start
0.56 ± 2% -0.2 0.40 ± 3% perf-profile.children.cycles-pp.free_unref_page_list
0.26 ± 7% -0.2 0.11 ± 5% perf-profile.children.cycles-pp.__list_add_valid_or_report
1.19 ± 2% -0.1 1.04 ± 2% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.45 ± 4% -0.1 0.31 perf-profile.children.cycles-pp.__mod_lruvec_state
0.48 ± 2% -0.1 0.35 ± 2% perf-profile.children.cycles-pp.get_page_from_freelist
0.38 ± 5% -0.1 0.27 ± 2% perf-profile.children.cycles-pp.xas_load
0.38 ± 2% -0.1 0.27 ± 2% perf-profile.children.cycles-pp._raw_spin_lock
0.33 ± 4% -0.1 0.23 ± 2% perf-profile.children.cycles-pp.__mod_node_page_state
0.42 ± 2% -0.1 0.32 ± 5% perf-profile.children.cycles-pp.find_lock_entries
0.32 ± 2% -0.1 0.23 ± 2% perf-profile.children.cycles-pp.__dquot_alloc_space
0.33 ± 2% -0.1 0.24 ± 3% perf-profile.children.cycles-pp.rmqueue
0.24 ± 3% -0.1 0.17 ± 4% perf-profile.children.cycles-pp.xas_descend
0.23 ± 3% -0.1 0.16 ± 4% perf-profile.children.cycles-pp.xas_init_marks
0.25 ± 3% -0.1 0.17 ± 2% perf-profile.children.cycles-pp.xas_clear_mark
0.23 ± 2% -0.1 0.16 ± 5% perf-profile.children.cycles-pp.__cond_resched
0.28 ± 5% -0.1 0.22 ± 2% perf-profile.children.cycles-pp.filemap_get_entry
0.24 ± 3% -0.1 0.18 ± 4% perf-profile.children.cycles-pp.truncate_cleanup_folio
0.16 ± 4% -0.1 0.10 ± 4% perf-profile.children.cycles-pp.xas_find_conflict
0.09 ± 7% -0.1 0.03 ± 70% perf-profile.children.cycles-pp.mem_cgroup_update_lru_size
0.18 -0.1 0.12 ± 6% perf-profile.children.cycles-pp.shmem_recalc_inode
0.18 ± 2% -0.1 0.12 ± 3% perf-profile.children.cycles-pp.folio_unlock
0.17 ± 4% -0.1 0.12 ± 3% perf-profile.children.cycles-pp.free_unref_page_prepare
0.16 ± 6% -0.1 0.11 ± 4% perf-profile.children.cycles-pp.security_file_permission
0.13 ± 7% -0.0 0.08 ± 13% perf-profile.children.cycles-pp.security_vm_enough_memory_mm
0.20 ± 4% -0.0 0.15 ± 2% perf-profile.children.cycles-pp.free_unref_page_commit
0.16 ± 5% -0.0 0.11 ± 3% perf-profile.children.cycles-pp.noop_dirty_folio
0.15 ± 5% -0.0 0.11 ± 4% perf-profile.children.cycles-pp.file_modified
0.12 ± 10% -0.0 0.08 perf-profile.children.cycles-pp.__percpu_counter_limited_add
0.19 ± 5% -0.0 0.14 ± 5% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.11 ± 12% -0.0 0.06 ± 17% perf-profile.children.cycles-pp.cap_vm_enough_memory
0.14 ± 5% -0.0 0.10 ± 6% perf-profile.children.cycles-pp.__fget_light
0.14 ± 7% -0.0 0.10 ± 4% perf-profile.children.cycles-pp.apparmor_file_permission
0.14 ± 2% -0.0 0.10 ± 4% perf-profile.children.cycles-pp.__folio_cancel_dirty
0.12 ± 3% -0.0 0.08 ± 4% perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
0.11 ± 10% -0.0 0.08 ± 16% perf-profile.children.cycles-pp.__vm_enough_memory
0.11 ± 8% -0.0 0.08 ± 4% perf-profile.children.cycles-pp.xas_start
0.11 ± 3% -0.0 0.08 ± 4% perf-profile.children.cycles-pp.__fsnotify_parent
0.18 ± 2% -0.0 0.14 ± 6% perf-profile.children.cycles-pp.__list_del_entry_valid_or_report
0.08 ± 6% -0.0 0.04 ± 45% perf-profile.children.cycles-pp.__get_file_rcu
0.12 ± 7% -0.0 0.09 ± 5% perf-profile.children.cycles-pp.inode_add_bytes
0.11 ± 4% -0.0 0.08 ± 8% perf-profile.children.cycles-pp._raw_spin_trylock
0.08 ± 6% -0.0 0.05 ± 45% perf-profile.children.cycles-pp.percpu_counter_add_batch
0.10 -0.0 0.07 ± 9% perf-profile.children.cycles-pp.inode_needs_update_time
0.09 ± 7% -0.0 0.06 ± 6% perf-profile.children.cycles-pp.get_pfnblock_flags_mask
0.07 ± 6% -0.0 0.05 ± 45% perf-profile.children.cycles-pp.shmem_is_huge
0.09 ± 7% -0.0 0.07 ± 7% perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
0.08 ± 4% -0.0 0.06 ± 8% perf-profile.children.cycles-pp.policy_nodemask
0.19 ± 3% -0.0 0.17 ± 4% perf-profile.children.cycles-pp.try_charge_memcg
0.08 ± 8% -0.0 0.06 ± 8% perf-profile.children.cycles-pp.down_write
0.09 ± 7% -0.0 0.07 ± 5% perf-profile.children.cycles-pp.xas_create
0.09 ± 7% -0.0 0.07 ± 7% perf-profile.children.cycles-pp.filemap_free_folio
0.08 ± 4% -0.0 0.06 ± 6% perf-profile.children.cycles-pp.xas_find
0.07 ± 5% +0.0 0.09 ± 5% perf-profile.children.cycles-pp.propagate_protected_usage
0.25 +0.0 0.28 ± 2% perf-profile.children.cycles-pp.uncharge_folio
0.43 ± 2% +0.0 0.47 ± 2% perf-profile.children.cycles-pp.uncharge_batch
0.69 +0.1 0.75 ± 2% perf-profile.children.cycles-pp.__mem_cgroup_uncharge_list
1.10 +0.1 1.20 ± 2% perf-profile.children.cycles-pp.lru_add_fn
2.96 +0.3 3.21 perf-profile.children.cycles-pp.truncate_inode_folio
2.60 +0.4 2.96 perf-profile.children.cycles-pp.filemap_remove_folio
2.29 +0.4 2.73 ± 2% perf-profile.children.cycles-pp.__filemap_remove_folio
1.39 ± 3% +0.5 1.85 ± 5% perf-profile.children.cycles-pp.get_mem_cgroup_from_mm
2.34 ± 2% +0.6 2.93 ± 2% perf-profile.children.cycles-pp.shmem_add_to_page_cache
0.18 ± 5% +0.7 0.92 ± 2% perf-profile.children.cycles-pp.__count_memcg_events
1.32 +0.8 2.07 ± 3% perf-profile.children.cycles-pp.filemap_unaccount_folio
38.14 +1.0 39.17 perf-profile.children.cycles-pp.vfs_fallocate
0.64 ± 4% +1.1 1.72 ± 3% perf-profile.children.cycles-pp.mem_cgroup_commit_charge
37.63 +1.2 38.81 perf-profile.children.cycles-pp.shmem_fallocate
36.57 +1.5 38.05 perf-profile.children.cycles-pp.shmem_get_folio_gfp
36.04 +1.6 37.65 perf-profile.children.cycles-pp.shmem_alloc_and_add_folio
2.66 +1.7 4.38 ± 3% perf-profile.children.cycles-pp.__mod_lruvec_page_state
2.49 ± 2% +2.3 4.80 ± 4% perf-profile.children.cycles-pp.__mem_cgroup_charge
1.99 ± 2% +2.5 4.46 perf-profile.children.cycles-pp.__mod_memcg_lruvec_state
1.28 ± 2% -0.4 0.92 perf-profile.self.cycles-pp.syscall_return_via_sysret
0.69 ± 2% -0.2 0.50 ± 2% perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
0.54 ± 2% -0.2 0.36 ± 2% perf-profile.self.cycles-pp.release_pages
0.47 ± 2% -0.2 0.31 ± 2% perf-profile.self.cycles-pp.xas_store
0.53 ± 3% -0.2 0.37 ± 2% perf-profile.self.cycles-pp.__entry_text_start
0.36 ± 3% -0.2 0.21 ± 2% perf-profile.self.cycles-pp.shmem_add_to_page_cache
0.26 ± 8% -0.2 0.11 ± 5% perf-profile.self.cycles-pp.__list_add_valid_or_report
1.14 ± 2% -0.1 1.01 ± 2% perf-profile.self.cycles-pp.syscall_exit_to_user_mode
0.40 ± 4% -0.1 0.28 ± 4% perf-profile.self.cycles-pp.lru_add_fn
0.37 ± 2% -0.1 0.26 perf-profile.self.cycles-pp._raw_spin_lock
0.32 ± 3% -0.1 0.22 ± 2% perf-profile.self.cycles-pp.__mod_node_page_state
0.35 ± 3% -0.1 0.25 ± 2% perf-profile.self.cycles-pp.shmem_fallocate
0.50 ± 2% -0.1 0.40 ± 3% perf-profile.self.cycles-pp.folio_batch_move_lru
0.34 ± 3% -0.1 0.26 ± 5% perf-profile.self.cycles-pp.find_lock_entries
0.28 ± 2% -0.1 0.20 ± 5% perf-profile.self.cycles-pp.__alloc_pages
0.22 ± 2% -0.1 0.16 ± 3% perf-profile.self.cycles-pp.xas_clear_mark
0.21 ± 3% -0.1 0.15 ± 4% perf-profile.self.cycles-pp.shmem_alloc_and_add_folio
0.18 ± 3% -0.1 0.12 ± 5% perf-profile.self.cycles-pp.free_unref_page_list
0.22 ± 3% -0.1 0.16 ± 3% perf-profile.self.cycles-pp.xas_descend
0.20 ± 6% -0.1 0.14 ± 2% perf-profile.self.cycles-pp.__dquot_alloc_space
0.18 ± 4% -0.1 0.12 ± 6% perf-profile.self.cycles-pp.shmem_inode_acct_blocks
0.21 ± 5% -0.1 0.15 ± 5% perf-profile.self.cycles-pp.vfs_fallocate
0.18 ± 2% -0.1 0.12 ± 3% perf-profile.self.cycles-pp.__filemap_remove_folio
0.17 ± 4% -0.1 0.12 ± 3% perf-profile.self.cycles-pp.folio_unlock
0.20 ± 3% -0.0 0.14 ± 5% perf-profile.self.cycles-pp.shmem_get_folio_gfp
0.16 ± 5% -0.0 0.12 ± 6% perf-profile.self.cycles-pp.__cond_resched
0.17 ± 5% -0.0 0.12 ± 3% perf-profile.self.cycles-pp._raw_spin_lock_irq
0.14 ± 4% -0.0 0.10 ± 5% perf-profile.self.cycles-pp.xas_load
0.15 ± 2% -0.0 0.10 ± 4% perf-profile.self.cycles-pp.get_page_from_freelist
0.16 ± 4% -0.0 0.12 ± 4% perf-profile.self.cycles-pp.alloc_pages_mpol
0.12 ± 6% -0.0 0.08 ± 6% perf-profile.self.cycles-pp.rmqueue
0.13 ± 6% -0.0 0.09 ± 4% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.13 ± 8% -0.0 0.09 ± 4% perf-profile.self.cycles-pp.apparmor_file_permission
0.15 ± 3% -0.0 0.11 ± 3% perf-profile.self.cycles-pp.noop_dirty_folio
0.10 ± 10% -0.0 0.06 ± 17% perf-profile.self.cycles-pp.cap_vm_enough_memory
0.07 ± 8% -0.0 0.03 ± 70% perf-profile.self.cycles-pp.percpu_counter_add_batch
0.12 ± 10% -0.0 0.08 ± 4% perf-profile.self.cycles-pp.__percpu_counter_limited_add
0.12 ± 4% -0.0 0.08 ± 8% perf-profile.self.cycles-pp.folio_add_lru
0.10 ± 3% -0.0 0.06 ± 7% perf-profile.self.cycles-pp.xas_init_marks
0.10 ± 9% -0.0 0.07 perf-profile.self.cycles-pp.xas_start
0.13 ± 3% -0.0 0.10 ± 5% perf-profile.self.cycles-pp.filemap_remove_folio
0.11 ± 6% -0.0 0.08 ± 6% perf-profile.self.cycles-pp.__fsnotify_parent
0.16 ± 4% -0.0 0.12 ± 4% perf-profile.self.cycles-pp.free_unref_page_commit
0.12 ± 4% -0.0 0.08 ± 4% perf-profile.self.cycles-pp.__mod_lruvec_state
0.11 ± 3% -0.0 0.08 ± 7% perf-profile.self.cycles-pp.fallocate64
0.08 ± 6% -0.0 0.04 ± 45% perf-profile.self.cycles-pp.__get_file_rcu
0.11 ± 4% -0.0 0.08 ± 8% perf-profile.self.cycles-pp.__folio_cancel_dirty
0.11 ± 3% -0.0 0.08 ± 6% perf-profile.self.cycles-pp._raw_spin_trylock
0.11 ± 6% -0.0 0.08 ± 6% perf-profile.self.cycles-pp.truncate_cleanup_folio
0.06 ± 7% -0.0 0.03 ± 70% perf-profile.self.cycles-pp.__fget_light
0.17 ± 3% -0.0 0.14 ± 5% perf-profile.self.cycles-pp.__list_del_entry_valid_or_report
0.09 ± 5% -0.0 0.06 ± 7% perf-profile.self.cycles-pp.filemap_get_entry
0.22 ± 6% -0.0 0.19 ± 3% perf-profile.self.cycles-pp.page_counter_uncharge
0.09 ± 7% -0.0 0.06 ± 6% perf-profile.self.cycles-pp.get_pfnblock_flags_mask
0.08 ± 5% -0.0 0.06 ± 8% perf-profile.self.cycles-pp.free_unref_page_prepare
0.07 ± 11% -0.0 0.04 ± 44% perf-profile.self.cycles-pp.shmem_is_huge
0.08 ± 6% -0.0 0.05 ± 7% perf-profile.self.cycles-pp.__x64_sys_fallocate
0.08 ± 7% -0.0 0.06 ± 6% perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
0.09 ± 7% -0.0 0.07 ± 7% perf-profile.self.cycles-pp.filemap_free_folio
0.08 ± 8% -0.0 0.06 perf-profile.self.cycles-pp.shmem_alloc_folio
0.12 ± 6% -0.0 0.10 ± 6% perf-profile.self.cycles-pp.try_charge_memcg
0.07 ± 5% +0.0 0.09 ± 5% perf-profile.self.cycles-pp.propagate_protected_usage
0.24 +0.0 0.27 ± 3% perf-profile.self.cycles-pp.uncharge_folio
0.46 ± 4% +0.4 0.86 ± 5% perf-profile.self.cycles-pp.mem_cgroup_commit_charge
1.38 ± 3% +0.5 1.84 ± 5% perf-profile.self.cycles-pp.get_mem_cgroup_from_mm
0.16 ± 3% +0.7 0.90 ± 2% perf-profile.self.cycles-pp.__count_memcg_events
0.28 ± 3% +0.8 1.06 ± 5% perf-profile.self.cycles-pp.__mem_cgroup_charge
1.86 ± 2% +2.5 4.36 perf-profile.self.cycles-pp.__mod_memcg_lruvec_state




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki