[linus:master] [mm/mmap] 6852c46c78: stress-ng.spawn.ops_per_sec 2.9% improvement

From: kernel test robot
Date: Thu Sep 21 2023 - 15:04:17 EST




Hello,

kernel test robot noticed a 2.9% improvement of stress-ng.spawn.ops_per_sec on:


commit: 6852c46c783d20a4c0153d14d2990040e5e6e47e ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: stress-ng
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

nr_threads: 100%
disk: 1HDD
testtime: 60s
class: exec_spawn
test: spawn
cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+----------------------------------------------------------------------------------------------------+
| testcase: change | unixbench: unixbench.score 6.4% improvement |
| test machine | 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480L (Sapphire Rapids) with 512G memory |
| test parameters | cpufreq_governor=performance |
| | nr_task=100% |
| | runtime=300s |
| | test=shell8 |
+------------------+----------------------------------------------------------------------------------------------------+
| testcase: change | vm-scalability: vm-scalability.throughput 23.5% improvement |
| test machine | 224 threads 4 sockets Intel(R) Xeon(R) Platinum 8380H CPU @ 2.90GHz (Cooper Lake) with 192G memory |
| test parameters | cpufreq_governor=performance |
| | runtime=300s |
| | test=small-allocs |
+------------------+----------------------------------------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20230921/202309211530.ca9c00a1-oliver.sang@xxxxxxxxx

=========================================================================================
class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
exec_spawn/gcc-12/performance/1HDD/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp8/spawn/stress-ng/60s

commit:
73e791d738 ("mm: remove clear_page_idle()")
6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29
---------------- ---------------------------
%stddev %change %stddev
\ | \
10609 ± 11% -19.9% 8493 ± 9% numa-meminfo.node0.PageTables
2666 ± 11% -22.4% 2069 ± 9% numa-vmstat.node0.nr_page_table_pages
73.76 -1.5% 72.67 iostat.cpu.system
11.69 +3.3% 12.07 iostat.cpu.user
9706 ± 5% +35.6% 13159 ± 22% turbostat.POLL
86.46 +0.9% 87.20 turbostat.RAMWatt
301.24 ± 14% -16.3% 252.23 ± 5% sched_debug.cfs_rq:/.runnable_avg.stddev
-60827 +81.7% -110508 sched_debug.cfs_rq:/.spread0.avg
1216 ± 13% +33.8% 1627 ± 12% sched_debug.cpu.nr_uninterruptible.max
1282527 +2.9% 1319688 stress-ng.spawn.ops
21375 +2.9% 21994 stress-ng.spawn.ops_per_sec
593917 +46.2% 868090 stress-ng.time.major_page_faults
2.721e+08 +2.9% 2.799e+08 stress-ng.time.minor_page_faults
2943 -1.5% 2899 stress-ng.time.system_time
434.40 +4.3% 453.01 stress-ng.time.user_time
602081 ± 2% +4.2% 627620 proc-vmstat.nr_anon_pages
823789 ± 2% +4.2% 858619 proc-vmstat.nr_inactive_anon
219961 +3.6% 227832 proc-vmstat.nr_mapped
346359 +2.1% 353585 proc-vmstat.nr_shmem
823789 ± 2% +4.2% 858619 proc-vmstat.nr_zone_inactive_anon
1.826e+08 +3.0% 1.881e+08 proc-vmstat.numa_hit
1.825e+08 +3.0% 1.88e+08 proc-vmstat.numa_local
425017 +2.2% 434312 proc-vmstat.pgactivate
1.874e+08 +3.0% 1.931e+08 proc-vmstat.pgalloc_normal
2.767e+08 +3.0% 2.85e+08 proc-vmstat.pgfault
1.857e+08 +3.0% 1.913e+08 proc-vmstat.pgfree
5030 +2.6% 5159 proc-vmstat.thp_split_pmd
0.01 ± 9% +30.2% 0.01 ± 17% perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc.vm_area_alloc.alloc_bprm.do_execveat_common
0.54 ± 22% -46.9% 0.29 ± 52% perf-sched.sch_delay.max.ms.__cond_resched.filemap_read.__kernel_read.search_binary_handler.exec_binprm
0.15 ± 80% -67.6% 0.05 ± 41% perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc.__anon_vma_prepare.do_cow_fault.do_fault
0.63 ± 12% -32.5% 0.42 ± 12% perf-sched.sch_delay.max.ms.__cond_resched.remove_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
0.03 ± 2% +15.9% 0.03 ± 4% perf-sched.wait_and_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
4292 ± 5% -14.2% 3683 ± 4% perf-sched.wait_and_delay.count.__cond_resched.__kmem_cache_alloc_node.kmalloc_trace.perf_event_mmap_event.perf_event_mmap
3397 ± 4% +16.8% 3966 ± 4% perf-sched.wait_and_delay.count.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
5346 ± 4% +14.1% 6100 ± 2% perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_exc_page_fault
27421 ± 2% +49.2% 40919 perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
18122 ± 5% -13.9% 15598 ± 3% perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
78347 ± 5% -14.6% 66894 ± 3% perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.unlink_file_vma
2.02 ± 27% -46.2% 1.09 ± 14% perf-sched.wait_and_delay.max.ms.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
0.01 ± 23% +109.1% 0.02 ± 63% perf-sched.wait_time.avg.ms.__cond_resched.__alloc_pages.pte_alloc_one.__do_fault.do_cow_fault
0.02 ± 6% +68.4% 0.03 ± 29% perf-sched.wait_time.avg.ms.__cond_resched.down_write.do_brk_flags.vm_brk_flags.load_elf_interp
0.02 ± 21% +49.0% 0.03 ± 26% perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc.getname_flags.part.0
0.02 ± 6% +19.5% 0.03 ± 10% perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc.vm_area_dup.__split_vma.mprotect_fixup
0.01 ± 3% +20.3% 0.01 ± 7% perf-sched.wait_time.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.04 ± 54% +138.4% 0.10 ± 68% perf-sched.wait_time.max.ms.__cond_resched.down_read_killable.iterate_dir.__x64_sys_getdents64.do_syscall_64
0.08 ± 52% +290.1% 0.33 ± 61% perf-sched.wait_time.max.ms.__cond_resched.down_write.do_brk_flags.vm_brk_flags.load_elf_interp
0.03 ± 70% +101.1% 0.06 ± 16% perf-sched.wait_time.max.ms.__cond_resched.down_write.exit_mmap.__mmput.exit_mm
0.54 ± 7% -48.8% 0.27 ± 69% perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.vm_mmap_pgoff.elf_map.load_elf_interp
0.54 ± 3% +18.9% 0.64 ± 16% perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc.alloc_empty_file.path_openat.do_filp_open
2.30 ± 18% -22.0% 1.79 ± 8% perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
13.82 +1.5% 14.03 perf-stat.i.MPKI
1.93 +0.0 1.96 perf-stat.i.branch-miss-rate%
4.182e+08 +2.3% 4.28e+08 perf-stat.i.branch-misses
2.549e+08 +1.9% 2.598e+08 perf-stat.i.cache-misses
1.545e+09 +2.5% 1.584e+09 perf-stat.i.cache-references
1.86 -2.0% 1.82 perf-stat.i.cpi
945.76 ± 3% -5.3% 895.17 ± 2% perf-stat.i.cycles-between-cache-misses
1.249e+10 +2.7% 1.283e+10 perf-stat.i.dTLB-stores
0.55 +3.0% 0.56 perf-stat.i.ipc
9557 +46.8% 14033 perf-stat.i.major-faults
2025 +2.4% 2074 perf-stat.i.metric.K/sec
996.43 +1.3% 1009 perf-stat.i.metric.M/sec
4413113 +2.7% 4533786 perf-stat.i.minor-faults
12764008 ± 2% +3.7% 13236279 perf-stat.i.node-loads
47197681 +2.6% 48424472 perf-stat.i.node-stores
4422671 +2.8% 4547820 perf-stat.i.page-faults
14.16 +1.6% 14.39 perf-stat.overall.MPKI
1.92 +0.0 1.95 perf-stat.overall.branch-miss-rate%
1.87 -1.8% 1.84 perf-stat.overall.cpi
810.88 -2.8% 788.33 perf-stat.overall.cycles-between-cache-misses
0.53 +1.8% 0.54 perf-stat.overall.ipc
4.094e+08 +2.3% 4.19e+08 perf-stat.ps.branch-misses
1.511e+09 +2.5% 1.549e+09 perf-stat.ps.cache-references
17556 ± 2% +3.9% 18239 perf-stat.ps.cpu-migrations
1.226e+10 +2.7% 1.26e+10 perf-stat.ps.dTLB-stores
9359 +46.0% 13666 perf-stat.ps.major-faults
4325409 +2.8% 4445394 perf-stat.ps.minor-faults
12743809 ± 2% +4.1% 13262041 perf-stat.ps.node-loads
46090814 +2.7% 47343915 perf-stat.ps.node-stores
4334769 +2.9% 4459060 perf-stat.ps.page-faults
17.36 -1.2 16.11 perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables
14.04 -1.1 12.96 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma
69.96 -0.9 69.02 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
70.01 -0.9 69.08 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
12.28 -0.9 11.37 perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma
9.72 -0.8 8.90 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables.exit_mmap
10.90 ± 2% -0.8 10.10 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare
9.96 -0.8 9.16 perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.exit_mmap.__mmput
7.46 ± 2% -0.8 6.69 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
7.33 ± 2% -0.8 6.56 perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
10.95 -0.8 10.20 perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.exit_mmap.__mmput.exit_mm
12.30 -0.7 11.60 perf-profile.calltrace.cycles-pp.free_pgtables.exit_mmap.__mmput.exit_mm.do_exit
8.52 ± 2% -0.5 8.03 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma.do_vmi_align_munmap
8.72 ± 2% -0.5 8.24 perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
7.36 -0.4 6.92 perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
7.95 -0.4 7.52 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables.unmap_region
5.72 ± 2% -0.4 5.30 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
3.96 ± 2% -0.4 3.55 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma.mprotect_fixup
8.40 -0.4 7.98 perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap
8.12 -0.4 7.71 perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.unmap_region.do_vmi_align_munmap
4.10 ± 2% -0.4 3.70 perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey
8.68 -0.4 8.28 perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region
5.16 -0.3 4.84 perf-profile.calltrace.cycles-pp.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect
9.61 -0.3 9.30 perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap
7.33 -0.2 7.14 perf-profile.calltrace.cycles-pp.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64
2.84 -0.2 2.66 perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma
2.18 -0.2 2.02 perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare
1.18 -0.1 1.11 perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
0.84 ± 3% -0.1 0.77 ± 3% perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.elf_map.load_elf_interp.load_elf_binary.search_binary_handler
0.79 ± 3% -0.1 0.73 ± 3% perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.elf_map.load_elf_interp
0.82 -0.0 0.78 perf-profile.calltrace.cycles-pp.mas_store_prealloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
0.56 +0.0 0.58 perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.mmap_region.do_mmap
0.60 +0.0 0.63 perf-profile.calltrace.cycles-pp.__sysconf
0.57 ± 2% +0.0 0.60 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.wait4
0.55 ± 2% +0.0 0.58 perf-profile.calltrace.cycles-pp.kernel_wait4.__do_sys_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe.wait4
0.57 ± 2% +0.0 0.60 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.wait4
0.67 +0.0 0.70 perf-profile.calltrace.cycles-pp.sync_regs.asm_exc_page_fault
0.55 +0.0 0.58 ± 2% perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey
0.61 +0.0 0.64 perf-profile.calltrace.cycles-pp.copy_process.kernel_clone.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.04 +0.0 1.08 perf-profile.calltrace.cycles-pp.perf_event_mmap_event.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.08 +0.0 1.12 perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
0.56 +0.0 0.60 ± 3% perf-profile.calltrace.cycles-pp.alloc_bprm.do_execveat_common.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.64 +0.0 0.68 perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey
0.54 +0.0 0.58 perf-profile.calltrace.cycles-pp.do_set_pte.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
0.54 ± 2% +0.0 0.58 perf-profile.calltrace.cycles-pp.finish_fault.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault
0.93 +0.0 0.98 perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64
0.94 +0.0 0.99 perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.63 ± 2% +0.1 0.68 perf-profile.calltrace.cycles-pp._compound_head.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
1.06 +0.1 1.12 perf-profile.calltrace.cycles-pp.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.00 +0.1 1.06 perf-profile.calltrace.cycles-pp._dl_addr
1.07 +0.1 1.12 perf-profile.calltrace.cycles-pp.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.58 ± 2% +0.1 0.63 ± 2% perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey
0.93 +0.1 0.99 perf-profile.calltrace.cycles-pp.kernfs_fop_readdir.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.03 +0.1 1.09 perf-profile.calltrace.cycles-pp.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
1.03 +0.1 1.09 perf-profile.calltrace.cycles-pp.kernel_clone.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
0.97 +0.1 1.04 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.getdents64
0.97 +0.1 1.04 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
0.96 +0.1 1.03 perf-profile.calltrace.cycles-pp.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
0.98 +0.1 1.04 perf-profile.calltrace.cycles-pp.getdents64
0.97 +0.1 1.03 perf-profile.calltrace.cycles-pp.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
1.37 +0.1 1.44 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__clone
1.33 ± 2% +0.1 1.40 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
0.96 +0.1 1.04 perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap
1.50 +0.1 1.58 perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect
1.30 +0.1 1.38 perf-profile.calltrace.cycles-pp.wp_page_copy.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
1.51 +0.1 1.60 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve.__clone
1.51 +0.1 1.60 perf-profile.calltrace.cycles-pp.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve.__clone
1.52 +0.1 1.61 perf-profile.calltrace.cycles-pp.execve.__clone
1.52 +0.1 1.60 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.execve.__clone
1.60 +0.1 1.70 perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
1.26 ± 2% +0.1 1.36 perf-profile.calltrace.cycles-pp.page_remove_rmap.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
1.20 +0.1 1.30 perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
1.52 +0.1 1.62 perf-profile.calltrace.cycles-pp.next_uptodate_page.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
0.43 ± 44% +0.1 0.55 ± 3% perf-profile.calltrace.cycles-pp.mm_init.alloc_bprm.do_execveat_common.__x64_sys_execve.do_syscall_64
1.86 ± 2% +0.2 2.03 perf-profile.calltrace.cycles-pp.release_pages.tlb_batch_pages_flush.tlb_finish_mmu.exit_mmap.__mmput
3.24 +0.2 3.42 perf-profile.calltrace.cycles-pp.__clone
2.30 +0.2 2.50 perf-profile.calltrace.cycles-pp.tlb_batch_pages_flush.tlb_finish_mmu.exit_mmap.__mmput.exit_mm
2.33 +0.2 2.52 perf-profile.calltrace.cycles-pp.tlb_finish_mmu.exit_mmap.__mmput.exit_mm.do_exit
2.64 +0.2 2.84 perf-profile.calltrace.cycles-pp.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault
2.83 +0.2 3.03 perf-profile.calltrace.cycles-pp.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
3.13 +0.2 3.34 perf-profile.calltrace.cycles-pp.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap
3.34 +0.2 3.55 perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap.__mmput
3.41 +0.2 3.63 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.exit_mmap.__mmput.exit_mm
2.02 +0.2 2.24 perf-profile.calltrace.cycles-pp.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
3.62 +0.2 3.86 perf-profile.calltrace.cycles-pp.unmap_vmas.exit_mmap.__mmput.exit_mm.do_exit
4.88 +0.4 5.31 perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
0.00 +0.5 0.51 ± 2% perf-profile.calltrace.cycles-pp.copy_page.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault
7.28 +0.6 7.86 perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
7.68 +0.6 8.29 perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
8.90 +0.7 9.59 perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
8.97 +0.7 9.67 perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault
9.85 +0.7 10.58 perf-profile.calltrace.cycles-pp.asm_exc_page_fault
39.11 -3.0 36.10 perf-profile.children.cycles-pp.rwsem_optimistic_spin
39.79 -3.0 36.79 perf-profile.children.cycles-pp.rwsem_down_write_slowpath
40.99 -2.9 38.11 perf-profile.children.cycles-pp.down_write
31.42 -2.5 28.90 perf-profile.children.cycles-pp.osq_lock
19.60 -1.2 18.41 perf-profile.children.cycles-pp.unlink_file_vma
21.27 -1.1 20.14 perf-profile.children.cycles-pp.free_pgtables
40.67 -1.0 39.70 perf-profile.children.cycles-pp.mmap_region
41.06 -1.0 40.10 perf-profile.children.cycles-pp.do_mmap
41.28 -0.9 40.34 perf-profile.children.cycles-pp.vm_mmap_pgoff
79.55 -0.8 78.73 perf-profile.children.cycles-pp.do_syscall_64
79.62 -0.8 78.82 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
17.54 -0.7 16.80 perf-profile.children.cycles-pp.vma_prepare
6.88 -0.5 6.38 perf-profile.children.cycles-pp.rwsem_spin_on_owner
24.95 -0.5 24.48 perf-profile.children.cycles-pp.do_vmi_align_munmap
25.10 -0.5 24.64 perf-profile.children.cycles-pp.do_vmi_munmap
21.02 -0.4 20.64 perf-profile.children.cycles-pp.__split_vma
10.25 -0.3 9.94 perf-profile.children.cycles-pp.unmap_region
2.79 ± 4% -0.2 2.54 ± 3% perf-profile.children.cycles-pp.elf_map
0.39 -0.1 0.30 perf-profile.children.cycles-pp.mas_wr_walk
1.50 -0.0 1.46 perf-profile.children.cycles-pp.mas_wr_store_entry
0.11 +0.0 0.12 ± 3% perf-profile.children.cycles-pp.security_file_alloc
0.23 ± 2% +0.0 0.24 ± 2% perf-profile.children.cycles-pp.__percpu_counter_sum
0.16 ± 2% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.init_file
0.16 ± 3% +0.0 0.17 perf-profile.children.cycles-pp.unmap_single_vma
0.25 +0.0 0.27 perf-profile.children.cycles-pp.__memcpy
0.24 ± 3% +0.0 0.26 perf-profile.children.cycles-pp.native_flush_tlb_one_user
0.21 ± 2% +0.0 0.23 perf-profile.children.cycles-pp.mas_push_data
0.16 +0.0 0.18 ± 2% perf-profile.children.cycles-pp.free_unref_page_list
0.14 ± 3% +0.0 0.15 ± 3% perf-profile.children.cycles-pp.padzero
0.38 +0.0 0.40 perf-profile.children.cycles-pp.d_path
0.30 +0.0 0.31 perf-profile.children.cycles-pp.__vm_area_free
0.29 +0.0 0.31 perf-profile.children.cycles-pp.mas_find
0.45 +0.0 0.47 perf-profile.children.cycles-pp.percpu_counter_add_batch
0.34 ± 2% +0.0 0.35 ± 2% perf-profile.children.cycles-pp.find_idlest_group
0.30 +0.0 0.32 perf-profile.children.cycles-pp.update_sg_wakeup_stats
0.36 ± 2% +0.0 0.37 perf-profile.children.cycles-pp.find_idlest_cpu
0.29 ± 2% +0.0 0.31 ± 2% perf-profile.children.cycles-pp.__filemap_get_folio
0.34 +0.0 0.36 perf-profile.children.cycles-pp.link_path_walk
0.44 ± 2% +0.0 0.46 perf-profile.children.cycles-pp.__mem_cgroup_charge
0.41 +0.0 0.43 perf-profile.children.cycles-pp.unlink_anon_vmas
0.31 ± 2% +0.0 0.33 perf-profile.children.cycles-pp.alloc_empty_file
0.25 ± 2% +0.0 0.27 perf-profile.children.cycles-pp.wake_up_new_task
0.41 ± 2% +0.0 0.43 ± 2% perf-profile.children.cycles-pp.__mmdrop
0.35 ± 2% +0.0 0.37 perf-profile.children.cycles-pp.mas_next_slot
0.35 ± 2% +0.0 0.37 perf-profile.children.cycles-pp.flush_tlb_mm_range
0.33 +0.0 0.35 perf-profile.children.cycles-pp.find_vma
0.45 ± 2% +0.0 0.48 ± 2% perf-profile.children.cycles-pp.__rb_erase_color
0.28 ± 2% +0.0 0.30 ± 2% perf-profile.children.cycles-pp.filemap_get_entry
0.27 +0.0 0.29 ± 3% perf-profile.children.cycles-pp.up_read
0.51 +0.0 0.53 perf-profile.children.cycles-pp.mas_store_gfp
0.41 ± 2% +0.0 0.43 perf-profile.children.cycles-pp.free_swap_cache
0.44 +0.0 0.47 perf-profile.children.cycles-pp.___perf_sw_event
0.05 ± 13% +0.0 0.08 ± 14% perf-profile.children.cycles-pp.get_any_partial
0.35 +0.0 0.38 ± 2% perf-profile.children.cycles-pp.mt_find
0.37 ± 2% +0.0 0.39 ± 2% perf-profile.children.cycles-pp.vfs_read
0.33 ± 2% +0.0 0.35 ± 2% perf-profile.children.cycles-pp.vma_interval_tree_augment_rotate
0.50 ± 2% +0.0 0.52 ± 2% perf-profile.children.cycles-pp.copy_page
0.33 ± 2% +0.0 0.36 ± 2% perf-profile.children.cycles-pp.__percpu_counter_init
0.24 ± 5% +0.0 0.26 ± 3% perf-profile.children.cycles-pp.down_read
0.31 +0.0 0.33 ± 2% perf-profile.children.cycles-pp.lru_add_fn
0.37 +0.0 0.40 perf-profile.children.cycles-pp.__open64_nocancel
0.39 ± 2% +0.0 0.42 ± 2% perf-profile.children.cycles-pp.ksys_read
0.37 ± 2% +0.0 0.40 perf-profile.children.cycles-pp.__mod_lruvec_page_state
0.53 +0.0 0.56 perf-profile.children.cycles-pp.__perf_sw_event
0.43 ± 2% +0.0 0.46 perf-profile.children.cycles-pp.free_pages_and_swap_cache
0.66 +0.0 0.68 perf-profile.children.cycles-pp.get_page_from_freelist
0.62 +0.0 0.65 perf-profile.children.cycles-pp.__sysconf
0.55 ± 2% +0.0 0.58 perf-profile.children.cycles-pp.kernel_wait4
0.59 ± 2% +0.0 0.62 perf-profile.children.cycles-pp.wait4
0.47 ± 2% +0.0 0.50 ± 3% perf-profile.children.cycles-pp.try_to_wake_up
0.53 +0.0 0.56 perf-profile.children.cycles-pp.mas_split
0.30 +0.0 0.33 ± 2% perf-profile.children.cycles-pp.folio_add_lru_vma
0.06 ± 11% +0.0 0.09 ± 6% perf-profile.children.cycles-pp.io_schedule
0.51 +0.0 0.54 perf-profile.children.cycles-pp.lock_mm_and_find_vma
0.47 +0.0 0.50 ± 2% perf-profile.children.cycles-pp.__x64_sys_munmap
0.41 +0.0 0.44 ± 3% perf-profile.children.cycles-pp.pcpu_alloc
0.60 +0.0 0.64 perf-profile.children.cycles-pp.mas_wr_bnode
0.06 ± 11% +0.0 0.10 ± 5% perf-profile.children.cycles-pp.folio_wait_bit_common
0.61 ± 2% +0.0 0.65 perf-profile.children.cycles-pp.copy_process
0.48 +0.0 0.51 perf-profile.children.cycles-pp.page_add_file_rmap
0.75 +0.0 0.79 perf-profile.children.cycles-pp.native_irq_return_iret
0.69 +0.0 0.73 perf-profile.children.cycles-pp.vm_area_alloc
0.56 +0.0 0.60 ± 2% perf-profile.children.cycles-pp._raw_spin_lock
0.54 +0.0 0.58 perf-profile.children.cycles-pp.lock_vma_under_rcu
0.46 ± 2% +0.0 0.50 ± 2% perf-profile.children.cycles-pp.__pte_offset_map_lock
0.51 +0.0 0.55 ± 2% perf-profile.children.cycles-pp.mm_init
0.15 ± 2% +0.0 0.19 ± 3% perf-profile.children.cycles-pp.__wake_up_common
0.61 +0.0 0.65 perf-profile.children.cycles-pp.mas_walk
0.67 +0.0 0.71 perf-profile.children.cycles-pp.vma_alloc_folio
0.88 +0.0 0.92 perf-profile.children.cycles-pp.sync_regs
0.83 +0.0 0.87 perf-profile.children.cycles-pp.perf_event_mmap_output
0.56 +0.0 0.60 ± 2% perf-profile.children.cycles-pp.alloc_bprm
0.83 +0.0 0.87 perf-profile.children.cycles-pp.mas_wr_node_store
0.58 +0.0 0.62 perf-profile.children.cycles-pp.__rb_insert_augmented
0.79 +0.0 0.83 ± 2% perf-profile.children.cycles-pp.memcg_slab_post_alloc_hook
0.06 ± 8% +0.0 0.10 perf-profile.children.cycles-pp.folio_wake_bit
0.92 +0.0 0.97 perf-profile.children.cycles-pp.__alloc_pages
0.75 +0.0 0.80 perf-profile.children.cycles-pp.mtree_range_walk
1.04 +0.0 1.09 perf-profile.children.cycles-pp.__slab_free
0.77 +0.0 0.82 perf-profile.children.cycles-pp.vm_area_dup
0.55 ± 2% +0.0 0.60 perf-profile.children.cycles-pp.finish_fault
1.01 +0.1 1.06 perf-profile.children.cycles-pp._dl_addr
1.06 +0.1 1.12 perf-profile.children.cycles-pp.perf_iterate_sb
0.75 ± 2% +0.1 0.81 perf-profile.children.cycles-pp._compound_head
1.03 +0.1 1.09 perf-profile.children.cycles-pp.__do_sys_clone
1.03 +0.1 1.09 perf-profile.children.cycles-pp.kernel_clone
0.96 +0.1 1.03 perf-profile.children.cycles-pp.iterate_dir
0.94 +0.1 1.00 perf-profile.children.cycles-pp.kernfs_fop_readdir
0.98 +0.1 1.04 perf-profile.children.cycles-pp.getdents64
0.97 +0.1 1.03 perf-profile.children.cycles-pp.__x64_sys_getdents64
0.36 ± 4% +0.1 0.42 ± 2% perf-profile.children.cycles-pp.__unfreeze_partials
0.01 ±223% +0.1 0.08 ± 6% perf-profile.children.cycles-pp.wake_page_function
0.36 +0.1 0.43 ± 2% perf-profile.children.cycles-pp.filemap_fault
1.81 +0.1 1.89 perf-profile.children.cycles-pp.perf_event_mmap_event
1.42 +0.1 1.49 perf-profile.children.cycles-pp.do_sys_openat2
0.56 +0.1 0.63 perf-profile.children.cycles-pp.__do_fault
1.87 +0.1 1.95 perf-profile.children.cycles-pp.perf_event_mmap
0.76 ± 2% +0.1 0.84 perf-profile.children.cycles-pp.kmem_cache_free_bulk
1.42 +0.1 1.50 perf-profile.children.cycles-pp.__x64_sys_openat
0.32 ± 3% +0.1 0.40 ± 5% perf-profile.children.cycles-pp.get_partial_node
1.40 +0.1 1.48 perf-profile.children.cycles-pp.path_openat
1.42 +0.1 1.50 perf-profile.children.cycles-pp.do_filp_open
0.70 +0.1 0.79 ± 2% perf-profile.children.cycles-pp.lru_add_drain_cpu
0.71 +0.1 0.79 perf-profile.children.cycles-pp.lru_add_drain
1.39 +0.1 1.48 perf-profile.children.cycles-pp.wp_page_copy
1.43 +0.1 1.53 perf-profile.children.cycles-pp.do_set_pte
3.10 +0.1 3.20 perf-profile.children.cycles-pp.mas_store_prealloc
2.00 +0.1 2.11 perf-profile.children.cycles-pp.kmem_cache_alloc
1.35 ± 2% +0.1 1.46 perf-profile.children.cycles-pp.page_remove_rmap
0.94 +0.1 1.04 perf-profile.children.cycles-pp.folio_batch_move_lru
0.76 ± 2% +0.1 0.87 ± 3% perf-profile.children.cycles-pp.___slab_alloc
1.22 +0.1 1.33 perf-profile.children.cycles-pp.mas_destroy
0.98 +0.1 1.11 ± 2% perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
1.11 +0.1 1.24 ± 2% perf-profile.children.cycles-pp.kmem_cache_alloc_bulk
0.96 ± 3% +0.1 1.09 perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
1.38 +0.1 1.52 perf-profile.children.cycles-pp.mas_preallocate
1.43 +0.1 1.58 perf-profile.children.cycles-pp.mas_alloc_nodes
2.02 +0.2 2.19 perf-profile.children.cycles-pp.release_pages
3.25 +0.2 3.43 perf-profile.children.cycles-pp.__clone
3.39 +0.2 3.58 perf-profile.children.cycles-pp.vma_complete
2.40 +0.2 2.60 perf-profile.children.cycles-pp.tlb_batch_pages_flush
4.42 +0.2 4.62 perf-profile.children.cycles-pp.vma_interval_tree_insert
2.82 +0.2 3.02 perf-profile.children.cycles-pp.next_uptodate_page
2.65 +0.2 2.86 perf-profile.children.cycles-pp.tlb_finish_mmu
3.50 +0.2 3.73 perf-profile.children.cycles-pp.zap_pte_range
2.10 +0.2 2.33 perf-profile.children.cycles-pp.do_cow_fault
3.59 +0.2 3.82 perf-profile.children.cycles-pp.zap_pmd_range
3.73 +0.2 3.97 perf-profile.children.cycles-pp.unmap_page_range
1.61 ± 2% +0.3 1.87 ± 2% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
3.99 +0.3 4.26 perf-profile.children.cycles-pp.unmap_vmas
1.33 ± 2% +0.3 1.60 ± 2% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
5.00 +0.3 5.33 perf-profile.children.cycles-pp.filemap_map_pages
5.37 +0.4 5.72 perf-profile.children.cycles-pp.do_read_fault
7.51 +0.6 8.09 perf-profile.children.cycles-pp.do_fault
10.64 +0.8 11.41 perf-profile.children.cycles-pp.__handle_mm_fault
11.19 +0.8 12.00 perf-profile.children.cycles-pp.handle_mm_fault
12.75 +0.9 13.66 perf-profile.children.cycles-pp.do_user_addr_fault
12.82 +0.9 13.73 perf-profile.children.cycles-pp.exc_page_fault
14.29 +1.0 15.27 perf-profile.children.cycles-pp.asm_exc_page_fault
30.81 -2.5 28.33 perf-profile.self.cycles-pp.osq_lock
6.72 -0.5 6.23 perf-profile.self.cycles-pp.rwsem_spin_on_owner
0.34 ± 2% -0.1 0.26 perf-profile.self.cycles-pp.mas_wr_walk
0.09 +0.0 0.10 perf-profile.self.cycles-pp.access_error
0.20 +0.0 0.21 perf-profile.self.cycles-pp.__memcpy
0.11 ± 3% +0.0 0.12 ± 4% perf-profile.self.cycles-pp.__unfreeze_partials
0.38 +0.0 0.40 perf-profile.self.cycles-pp.___perf_sw_event
0.37 +0.0 0.39 perf-profile.self.cycles-pp.free_swap_cache
0.26 +0.0 0.28 ± 4% perf-profile.self.cycles-pp.up_read
0.47 +0.0 0.49 ± 2% perf-profile.self.cycles-pp.kmem_cache_free_bulk
0.32 ± 2% +0.0 0.34 ± 2% perf-profile.self.cycles-pp.vma_interval_tree_augment_rotate
0.22 ± 4% +0.0 0.24 ± 3% perf-profile.self.cycles-pp.down_read
0.49 +0.0 0.51 ± 2% perf-profile.self.cycles-pp.copy_page
0.36 +0.0 0.38 perf-profile.self.cycles-pp.do_set_pte
0.37 +0.0 0.40 perf-profile.self.cycles-pp.__kmem_cache_alloc_bulk
0.41 +0.0 0.44 ± 3% perf-profile.self.cycles-pp.___slab_alloc
0.06 ± 7% +0.0 0.09 perf-profile.self.cycles-pp.try_to_wake_up
0.49 +0.0 0.52 perf-profile.self.cycles-pp.memcg_slab_post_alloc_hook
0.50 +0.0 0.52 ± 2% perf-profile.self.cycles-pp._raw_spin_lock
0.41 +0.0 0.44 perf-profile.self.cycles-pp.page_add_file_rmap
0.58 ± 2% +0.0 0.62 perf-profile.self.cycles-pp.kmem_cache_alloc
0.75 +0.0 0.79 perf-profile.self.cycles-pp.native_irq_return_iret
0.50 +0.0 0.53 perf-profile.self.cycles-pp.rwsem_down_write_slowpath
0.55 +0.0 0.59 perf-profile.self.cycles-pp.__rb_insert_augmented
0.77 +0.0 0.81 perf-profile.self.cycles-pp.filemap_map_pages
0.87 +0.0 0.91 perf-profile.self.cycles-pp.sync_regs
0.72 +0.0 0.76 perf-profile.self.cycles-pp.mtree_range_walk
1.03 +0.0 1.07 ± 2% perf-profile.self.cycles-pp.__slab_free
0.89 +0.0 0.94 perf-profile.self.cycles-pp._dl_addr
1.10 +0.1 1.15 perf-profile.self.cycles-pp.zap_pte_range
0.69 ± 2% +0.1 0.74 perf-profile.self.cycles-pp._compound_head
1.08 +0.1 1.16 perf-profile.self.cycles-pp.release_pages
1.14 ± 2% +0.1 1.23 perf-profile.self.cycles-pp.page_remove_rmap
1.10 ± 2% +0.1 1.20 perf-profile.self.cycles-pp.down_write
4.32 +0.2 4.52 perf-profile.self.cycles-pp.vma_interval_tree_insert
2.72 +0.2 2.92 perf-profile.self.cycles-pp.next_uptodate_page
1.33 ± 2% +0.3 1.60 ± 2% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath


***************************************************************************************************
lkp-spr-2sp3: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480L (Sapphire Rapids) with 512G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/300s/lkp-spr-2sp3/shell8/unixbench

commit:
73e791d738 ("mm: remove clear_page_idle()")
6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29
---------------- ---------------------------
%stddev %change %stddev
\ | \
128621 ± 44% -35.5% 83021 ± 18% sched_debug.cfs_rq:/.spread0.stddev
68.67 -2.7% 66.83 turbostat.PkgTmp
30.36 +1.2% 30.74 turbostat.RAMWatt
23470 ± 2% +6.4% 24970 ± 2% unixbench.score
799587 ± 3% +11.6% 892414 ± 3% unixbench.time.involuntary_context_switches
83795 ± 8% +39.5% 116914 ± 5% unixbench.time.major_page_faults
1.003e+08 ± 2% +6.5% 1.068e+08 ± 2% unixbench.time.minor_page_faults
328.24 ± 2% +6.1% 348.20 unixbench.time.user_time
894180 ± 2% +6.7% 953859 ± 2% unixbench.workload
70334849 ± 2% +6.3% 74748722 ± 2% proc-vmstat.numa_hit
70084753 ± 2% +6.3% 74504737 ± 2% proc-vmstat.numa_local
54187 ± 2% +6.3% 57597 ± 2% proc-vmstat.pgactivate
74859673 ± 2% +6.3% 79582401 ± 2% proc-vmstat.pgalloc_normal
1.024e+08 ± 2% +6.4% 1.09e+08 ± 2% proc-vmstat.pgfault
74344826 ± 2% +6.4% 79067399 ± 2% proc-vmstat.pgfree
4212400 ± 2% +6.2% 4474364 ± 2% proc-vmstat.pgreuse
3869 ± 2% +7.1% 4145 ± 3% proc-vmstat.thp_fault_alloc
1612101 ± 2% +6.3% 1713519 ± 2% proc-vmstat.unevictable_pgs_culled
5.69 ± 2% +5.1% 5.98 ± 2% perf-stat.i.MPKI
0.94 ± 2% +0.0 0.98 perf-stat.i.branch-miss-rate%
2.934e+08 ± 2% +5.8% 3.105e+08 ± 2% perf-stat.i.branch-misses
2.844e+08 ± 2% +6.1% 3.016e+08 ± 2% perf-stat.i.cache-misses
9.76e+08 ± 2% +5.8% 1.033e+09 ± 2% perf-stat.i.cache-references
2.99 -1.3% 2.95 perf-stat.i.cpi
5.226e+11 -0.9% 5.179e+11 perf-stat.i.cpu-cycles
33882826 ± 2% +7.2% 36317869 ± 2% perf-stat.i.dTLB-store-misses
9.478e+09 ± 2% +5.9% 1.004e+10 ± 2% perf-stat.i.dTLB-stores
0.36 +2.1% 0.36 perf-stat.i.ipc
1312 ± 8% +39.6% 1831 ± 5% perf-stat.i.major-faults
2.33 -0.9% 2.31 perf-stat.i.metric.GHz
513.19 ± 2% +6.0% 544.03 ± 2% perf-stat.i.metric.K/sec
1545804 ± 2% +6.5% 1646269 ± 2% perf-stat.i.minor-faults
60502067 ± 2% +5.4% 63798162 ± 2% perf-stat.i.node-load-misses
14871403 ± 2% +6.2% 15796551 ± 2% perf-stat.i.node-loads
1547117 ± 2% +6.5% 1648101 ± 2% perf-stat.i.page-faults
5.78 ± 2% +5.4% 6.10 ± 2% perf-stat.overall.MPKI
0.87 ± 2% +0.0 0.92 ± 2% perf-stat.overall.branch-miss-rate%
3.10 -1.3% 3.06 perf-stat.overall.cpi
1839 ± 3% -6.6% 1718 ± 2% perf-stat.overall.cycles-between-cache-misses
12132105 ± 2% -5.5% 11462241 ± 2% perf-stat.overall.path-length
2.901e+08 ± 2% +5.8% 3.07e+08 ± 2% perf-stat.ps.branch-misses
2.815e+08 ± 2% +6.1% 2.986e+08 ± 2% perf-stat.ps.cache-misses
9.661e+08 ± 2% +5.8% 1.022e+09 ± 2% perf-stat.ps.cache-references
5.174e+11 -0.9% 5.128e+11 perf-stat.ps.cpu-cycles
33544567 ± 2% +7.2% 35957170 ± 2% perf-stat.ps.dTLB-store-misses
9.376e+09 ± 2% +5.9% 9.933e+09 ± 2% perf-stat.ps.dTLB-stores
1299 ± 8% +39.6% 1813 ± 5% perf-stat.ps.major-faults
1529961 ± 2% +6.5% 1629550 ± 2% perf-stat.ps.minor-faults
59908353 ± 2% +5.4% 63173333 ± 2% perf-stat.ps.node-load-misses
14730891 ± 2% +6.2% 15646714 ± 2% perf-stat.ps.node-loads
1531260 ± 2% +6.5% 1631363 ± 2% perf-stat.ps.page-faults



***************************************************************************************************
lkp-cpl-4sp2: 224 threads 4 sockets Intel(R) Xeon(R) Platinum 8380H CPU @ 2.90GHz (Cooper Lake) with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/rootfs/runtime/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-8.3/debian-11.1-x86_64-20220510.cgz/300s/lkp-cpl-4sp2/small-allocs/vm-scalability

commit:
73e791d738 ("mm: remove clear_page_idle()")
6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29
---------------- ---------------------------
%stddev %change %stddev
\ | \
42167 +23.5% 52085 vm-scalability.median
55.63 +4.8 60.48 ± 5% vm-scalability.stddev%
9445724 +23.5% 11667563 vm-scalability.throughput
380.73 +5.2% 400.43 vm-scalability.time.elapsed_time
380.73 +5.2% 400.43 vm-scalability.time.elapsed_time.max
106067 +51.4% 160541 vm-scalability.time.involuntary_context_switches
6.307e+08 +23.6% 7.795e+08 vm-scalability.time.minor_page_faults
78306 +4.6% 81916 vm-scalability.time.system_time
777.26 +22.8% 954.12 vm-scalability.time.user_time
2562168 ± 2% -19.3% 2068789 ± 3% vm-scalability.time.voluntary_context_switches
2.838e+09 +23.6% 3.507e+09 vm-scalability.workload
5.815e+09 +10.4% 6.421e+09 ± 3% cpuidle..time
6.66 +0.3 6.99 ± 3% turbostat.C1%
17067 ± 2% -16.7% 14220 ± 2% vmstat.system.cs
0.05 -0.0 0.04 ± 3% mpstat.cpu.all.soft%
0.96 +0.1 1.11 mpstat.cpu.all.usr%
2261 ± 5% +22.8% 2776 ± 6% perf-c2c.HITM.local
1329 ± 8% +20.8% 1605 ± 4% perf-c2c.HITM.remote
18499960 +19.0% 22006663 meminfo.Memused
2990186 +26.8% 3790943 meminfo.PageTables
10350121 +26.1% 13050399 meminfo.SUnreclaim
10515083 +25.7% 13215478 meminfo.Slab
18692470 +19.1% 22260889 meminfo.max_used_kB
407798 ± 17% +67.2% 682010 ± 20% sched_debug.cfs_rq:/.load.max
869895 ± 16% -34.9% 566399 ± 18% sched_debug.cfs_rq:/.spread0.avg
-743993 +53.2% -1139615 sched_debug.cfs_rq:/.spread0.min
17559 ± 2% -12.8% 15312 ± 2% sched_debug.cpu.nr_switches.avg
13015 ± 2% -10.6% 11629 ± 2% sched_debug.cpu.nr_switches.min
4612 -18.2% 3772 ± 10% sched_debug.cpu.nr_switches.stddev
223.38 ± 8% -15.7% 188.21 ± 12% sched_debug.cpu.nr_uninterruptible.max
1169323 +13.4% 1326004 ± 4% numa-numastat.node0.local_node
1239338 +13.9% 1411769 ± 2% numa-numastat.node0.numa_hit
1115777 ± 2% +16.7% 1302136 ± 3% numa-numastat.node1.local_node
1181029 ± 2% +19.6% 1412041 ± 3% numa-numastat.node1.numa_hit
65251 ± 31% +68.4% 109907 ± 7% numa-numastat.node1.other_node
1111909 ± 5% +15.5% 1284183 ± 2% numa-numastat.node2.local_node
1218212 ± 5% +11.5% 1358029 numa-numastat.node2.numa_hit
1285007 ± 2% +25.7% 1615828 ± 2% numa-numastat.node3.local_node
1391326 ± 2% +21.8% 1694543 ± 2% numa-numastat.node3.numa_hit
137025 ± 2% +6.1% 145442 proc-vmstat.nr_active_anon
95233 -1.5% 93787 proc-vmstat.nr_anon_pages
4427600 -2.0% 4339914 proc-vmstat.nr_dirty_background_threshold
8866026 -2.0% 8690440 proc-vmstat.nr_dirty_threshold
44521360 -2.0% 43643224 proc-vmstat.nr_free_pages
747466 +26.8% 948001 proc-vmstat.nr_page_table_pages
168941 ± 2% +4.9% 177164 proc-vmstat.nr_shmem
2587999 +26.1% 3264111 proc-vmstat.nr_slab_unreclaimable
137025 ± 2% +6.1% 145442 proc-vmstat.nr_zone_active_anon
5032565 +16.8% 5879446 proc-vmstat.numa_hit
4684671 +18.1% 5530835 proc-vmstat.numa_local
7168214 +18.3% 8479220 proc-vmstat.pgalloc_normal
6.322e+08 +23.5% 7.81e+08 proc-vmstat.pgfault
6881231 +18.9% 8181469 proc-vmstat.pgfree
3627776 +4.5% 3791232 proc-vmstat.unevictable_pgs_scanned
3935473 ± 5% +34.6% 5297695 ± 16% numa-meminfo.node0.MemUsed
748076 +26.7% 947801 numa-meminfo.node0.PageTables
2597779 +26.1% 3275316 numa-meminfo.node0.SUnreclaim
2625903 +26.1% 3310684 numa-meminfo.node0.Slab
747754 +26.9% 948714 numa-meminfo.node1.PageTables
2580884 +26.5% 3264576 numa-meminfo.node1.SUnreclaim
2652930 +25.8% 3336480 numa-meminfo.node1.Slab
141326 ± 10% -21.1% 111532 ± 19% numa-meminfo.node2.Inactive(anon)
3886953 ± 2% +20.1% 4666698 numa-meminfo.node2.MemUsed
751265 +26.3% 949188 numa-meminfo.node2.PageTables
2591514 +25.6% 3255894 numa-meminfo.node2.SUnreclaim
2622178 +24.9% 3274703 numa-meminfo.node2.Slab
327366 ± 10% +39.0% 454988 ± 20% numa-meminfo.node3.Active
327339 ± 10% +39.0% 454931 ± 20% numa-meminfo.node3.Active(anon)
17238 ± 7% +16.9% 20158 ± 11% numa-meminfo.node3.Mapped
4217474 +22.9% 5184418 ± 3% numa-meminfo.node3.MemUsed
745042 +27.0% 946381 numa-meminfo.node3.PageTables
2586773 +26.0% 3259792 numa-meminfo.node3.SUnreclaim
379407 ± 18% +30.5% 495065 ± 18% numa-meminfo.node3.Shmem
2620922 +25.9% 3298794 numa-meminfo.node3.Slab
0.88 ± 71% -80.8% 0.17 ± 52% perf-sched.sch_delay.avg.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
0.13 ± 53% +311.7% 0.53 ± 55% perf-sched.sch_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
0.20 ± 13% +32.6% 0.26 ± 15% perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
0.10 ± 9% +110.7% 0.21 ± 8% perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.06 ± 28% +138.4% 0.13 ± 29% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.01 +14036.1% 1.70 ±221% perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
9.50 ± 34% +476.1% 54.73 ± 84% perf-sched.sch_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
2.31 ± 39% -78.8% 0.49 ±216% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
138.30 ± 14% +48.7% 205.63 ± 23% perf-sched.sch_delay.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
13.72 ± 12% +100.6% 27.53 ± 35% perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
9.63 ± 18% +114.0% 20.60 ± 10% perf-sched.sch_delay.max.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
2.26 ± 6% +17.9% 2.66 ± 3% perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
213.13 ± 2% -11.2% 189.35 ± 2% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
1074 ± 4% +65.8% 1781 ± 4% perf-sched.wait_and_delay.count.__cond_resched.__kmem_cache_alloc_node.kmalloc_trace.perf_event_mmap_event.perf_event_mmap
39318 ± 7% -17.4% 32472 ± 5% perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
171.51 ± 29% +87.8% 322.16 ± 43% perf-sched.wait_and_delay.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
2.06 ± 6% +16.5% 2.40 ± 3% perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
213.03 ± 2% -11.2% 189.14 ± 2% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
10.68 ± 19% +53.1% 16.34 ± 21% perf-sched.wait_time.max.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
0.02 ± 29% -42.3% 0.01 ± 6% perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
27.23 ± 10% +56.4% 42.58 ± 17% perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
72.93 ± 46% +126.7% 165.37 ± 43% perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
186850 +26.8% 236835 numa-vmstat.node0.nr_page_table_pages
648927 +26.1% 818461 numa-vmstat.node0.nr_slab_unreclaimable
1239327 +13.9% 1411687 ± 2% numa-vmstat.node0.numa_hit
1169311 +13.4% 1325922 ± 4% numa-vmstat.node0.numa_local
186769 +26.9% 237070 numa-vmstat.node1.nr_page_table_pages
644727 +26.5% 815732 numa-vmstat.node1.nr_slab_unreclaimable
1180822 ± 2% +19.6% 1411883 ± 3% numa-vmstat.node1.numa_hit
1115570 ± 2% +16.7% 1301978 ± 3% numa-vmstat.node1.numa_local
65251 ± 31% +68.4% 109907 ± 7% numa-vmstat.node1.numa_other
35329 ± 10% -21.1% 27874 ± 19% numa-vmstat.node2.nr_inactive_anon
187645 +26.4% 237184 numa-vmstat.node2.nr_page_table_pages
647359 +25.7% 813590 numa-vmstat.node2.nr_slab_unreclaimable
35330 ± 10% -21.1% 27874 ± 19% numa-vmstat.node2.nr_zone_inactive_anon
1218056 ± 5% +11.5% 1357863 numa-vmstat.node2.numa_hit
1111753 ± 5% +15.5% 1284018 ± 2% numa-vmstat.node2.numa_local
81831 ± 10% +38.9% 113699 ± 20% numa-vmstat.node3.nr_active_anon
4376 ± 8% +17.1% 5125 ± 10% numa-vmstat.node3.nr_mapped
186088 +27.1% 236505 numa-vmstat.node3.nr_page_table_pages
94854 ± 18% +30.5% 123744 ± 18% numa-vmstat.node3.nr_shmem
646172 +26.1% 814570 numa-vmstat.node3.nr_slab_unreclaimable
81831 ± 10% +38.9% 113699 ± 20% numa-vmstat.node3.nr_zone_active_anon
1391229 ± 2% +21.8% 1694455 ± 2% numa-vmstat.node3.numa_hit
1284909 ± 2% +25.7% 1615740 ± 2% numa-vmstat.node3.numa_local
2.182e+10 +5.7% 2.307e+10 perf-stat.i.branch-instructions
67.44 -1.4 66.01 perf-stat.i.cache-miss-rate%
1.084e+08 +5.0% 1.138e+08 ± 3% perf-stat.i.cache-references
17046 ± 2% -17.1% 14132 ± 2% perf-stat.i.context-switches
8.89 -2.0% 8.71 perf-stat.i.cpi
488.32 +5.3% 514.19 perf-stat.i.cpu-migrations
0.03 +0.0 0.04 ± 2% perf-stat.i.dTLB-load-miss-rate%
9099499 +14.7% 10433460 ± 2% perf-stat.i.dTLB-load-misses
2.527e+10 +4.3% 2.636e+10 perf-stat.i.dTLB-loads
1732945 +15.5% 2001701 perf-stat.i.dTLB-store-misses
2.005e+09 +12.5% 2.256e+09 perf-stat.i.dTLB-stores
93.66 -3.6 90.02 perf-stat.i.iTLB-load-miss-rate%
431003 ± 5% +42.8% 615338 perf-stat.i.iTLB-loads
9.349e+10 +4.1% 9.729e+10 perf-stat.i.instructions
17597 +4.2% 18333 ± 2% perf-stat.i.instructions-per-iTLB-miss
0.12 +4.5% 0.12 perf-stat.i.ipc
218.73 +5.1% 229.81 perf-stat.i.metric.M/sec
1645267 +16.2% 1911350 perf-stat.i.minor-faults
80.15 +1.7 81.87 perf-stat.i.node-load-miss-rate%
22474764 ± 2% +3.5% 23258402 perf-stat.i.node-load-misses
6215674 ± 3% -6.3% 5824416 perf-stat.i.node-loads
5207478 +17.4% 6113199 perf-stat.i.node-store-misses
831077 +17.9% 979621 perf-stat.i.node-stores
1645267 +16.2% 1911350 perf-stat.i.page-faults
68.37 -2.1 66.31 perf-stat.overall.cache-miss-rate%
8.49 -4.5% 8.11 perf-stat.overall.cpi
0.04 +0.0 0.04 ± 2% perf-stat.overall.dTLB-load-miss-rate%
0.09 +0.0 0.09 perf-stat.overall.dTLB-store-miss-rate%
93.41 -2.5 90.92 perf-stat.overall.iTLB-load-miss-rate%
15175 +3.4% 15687 perf-stat.overall.instructions-per-iTLB-miss
0.12 +4.7% 0.12 perf-stat.overall.ipc
78.20 +1.6 79.79 perf-stat.overall.node-load-miss-rate%
12543 -11.2% 11140 perf-stat.overall.path-length
2.183e+10 +6.1% 2.316e+10 perf-stat.ps.branch-instructions
1.102e+08 +5.6% 1.164e+08 ± 2% perf-stat.ps.cache-references
17070 ± 2% -16.9% 14191 ± 2% perf-stat.ps.context-switches
486.54 +5.7% 514.33 perf-stat.ps.cpu-migrations
9294872 +15.8% 10767066 ± 2% perf-stat.ps.dTLB-load-misses
2.525e+10 +4.5% 2.639e+10 perf-stat.ps.dTLB-loads
1747511 +16.6% 2038370 perf-stat.ps.dTLB-store-misses
2.024e+09 +13.5% 2.298e+09 perf-stat.ps.dTLB-stores
434045 ± 5% +42.8% 619721 perf-stat.ps.iTLB-loads
9.339e+10 +4.3% 9.739e+10 perf-stat.ps.instructions
1658111 +17.4% 1946167 perf-stat.ps.minor-faults
22728635 ± 2% +4.0% 23636858 perf-stat.ps.node-load-misses
6338224 ± 3% -5.5% 5988664 perf-stat.ps.node-loads
5139882 +17.1% 6017185 perf-stat.ps.node-store-misses
840111 +18.9% 998845 perf-stat.ps.node-stores
1658111 +17.4% 1946167 perf-stat.ps.page-faults
3.56e+13 +9.8% 3.907e+13 perf-stat.total.instructions
94.07 -1.6 92.46 perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
93.61 -1.6 92.01 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
94.14 -1.6 92.57 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
94.18 -1.6 92.62 perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
94.87 -1.3 93.61 perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
94.94 -1.2 93.69 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
94.95 -1.2 93.70 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
94.99 -1.2 93.76 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
96.35 -0.9 95.49 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
96.35 -0.9 95.49 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
96.39 -0.9 95.53 perf-profile.calltrace.cycles-pp.__mmap
0.77 +0.2 0.98 perf-profile.calltrace.cycles-pp.do_rw_once
0.55 +0.3 0.81 ± 2% perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.do_access
0.56 +0.3 0.82 ± 2% perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault.do_access
0.80 +0.3 1.12 perf-profile.calltrace.cycles-pp.asm_exc_page_fault.do_access
1.36 +0.4 1.72 ± 5% perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
1.36 +0.4 1.72 ± 5% perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
1.36 +0.4 1.72 ± 5% perf-profile.calltrace.cycles-pp.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.36 +0.4 1.72 ± 5% perf-profile.calltrace.cycles-pp.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
1.33 +0.4 1.70 ± 5% perf-profile.calltrace.cycles-pp.task_numa_work.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
1.16 +0.4 1.53 ± 6% perf-profile.calltrace.cycles-pp.vma_migratable.task_numa_work.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare
1.07 +0.4 1.48 perf-profile.calltrace.cycles-pp.do_access
94.07 -1.6 92.46 perf-profile.children.cycles-pp.rwsem_optimistic_spin
93.63 -1.6 92.03 perf-profile.children.cycles-pp.osq_lock
94.14 -1.6 92.57 perf-profile.children.cycles-pp.rwsem_down_write_slowpath
94.18 -1.6 92.62 perf-profile.children.cycles-pp.down_write
94.88 -1.3 93.61 perf-profile.children.cycles-pp.mmap_region
94.94 -1.2 93.69 perf-profile.children.cycles-pp.do_mmap
94.95 -1.2 93.70 perf-profile.children.cycles-pp.vm_mmap_pgoff
94.99 -1.2 93.76 perf-profile.children.cycles-pp.ksys_mmap_pgoff
96.38 -0.9 95.52 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
96.38 -0.9 95.52 perf-profile.children.cycles-pp.do_syscall_64
96.39 -0.9 95.54 perf-profile.children.cycles-pp.__mmap
0.38 -0.0 0.35 perf-profile.children.cycles-pp.rwsem_spin_on_owner
0.05 +0.0 0.06 perf-profile.children.cycles-pp.mas_walk
0.05 +0.0 0.06 ± 6% perf-profile.children.cycles-pp.vm_unmapped_area
0.05 ± 8% +0.0 0.07 ± 7% perf-profile.children.cycles-pp.__perf_sw_event
0.06 ± 8% +0.0 0.07 perf-profile.children.cycles-pp.lock_mm_and_find_vma
0.06 ± 7% +0.0 0.08 perf-profile.children.cycles-pp.sync_regs
0.08 +0.0 0.10 ± 3% perf-profile.children.cycles-pp.mtree_range_walk
0.10 +0.0 0.13 ± 3% perf-profile.children.cycles-pp.native_irq_return_iret
0.06 ± 7% +0.0 0.09 ± 7% perf-profile.children.cycles-pp.__unfreeze_partials
0.17 ± 2% +0.0 0.21 ± 8% perf-profile.children.cycles-pp.update_cfs_group
0.12 ± 4% +0.0 0.16 ± 3% perf-profile.children.cycles-pp.lock_vma_under_rcu
0.06 +0.0 0.10 ± 10% perf-profile.children.cycles-pp.kmem_cache_free_bulk
0.28 ± 2% +0.0 0.33 ± 8% perf-profile.children.cycles-pp.task_tick_fair
0.07 ± 7% +0.0 0.11 ± 8% perf-profile.children.cycles-pp.___slab_alloc
0.13 ± 3% +0.0 0.18 ± 6% perf-profile.children.cycles-pp.mas_store_prealloc
0.00 +0.1 0.05 perf-profile.children.cycles-pp.try_to_wake_up
0.00 +0.1 0.05 perf-profile.children.cycles-pp.wake_up_q
0.00 +0.1 0.05 perf-profile.children.cycles-pp.perf_event_mmap
0.00 +0.1 0.05 perf-profile.children.cycles-pp.perf_event_mmap_event
0.07 ± 7% +0.1 0.12 ± 7% perf-profile.children.cycles-pp.mas_destroy
0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.___perf_sw_event
0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.mt_find
0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.mas_empty_area_rev
0.00 +0.1 0.05 ± 8% perf-profile.children.cycles-pp.find_vma
0.10 +0.1 0.16 ± 6% perf-profile.children.cycles-pp.mas_preallocate
0.10 +0.1 0.16 ± 6% perf-profile.children.cycles-pp.mas_alloc_nodes
0.09 +0.1 0.14 ± 6% perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
0.00 +0.1 0.06 ± 13% perf-profile.children.cycles-pp.irqentry_exit_to_user_mode
0.09 +0.1 0.15 ± 7% perf-profile.children.cycles-pp.kmem_cache_alloc_bulk
0.10 ± 4% +0.1 0.16 ± 4% perf-profile.children.cycles-pp.rwsem_wake
0.00 +0.1 0.06 ± 11% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.00 +0.1 0.07 ± 11% perf-profile.children.cycles-pp.get_partial_node
0.15 ± 5% +0.1 0.22 ± 3% perf-profile.children.cycles-pp.up_write
0.17 ± 2% +0.1 0.24 perf-profile.children.cycles-pp.vma_interval_tree_insert
0.14 ± 6% +0.1 0.25 ± 5% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
0.08 +0.1 0.20 ± 6% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
0.28 ± 2% +0.2 0.45 ± 3% perf-profile.children.cycles-pp.__handle_mm_fault
0.34 ± 2% +0.2 0.51 ± 2% perf-profile.children.cycles-pp.handle_mm_fault
2.24 ± 3% +0.2 2.47 ± 5% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
0.90 +0.2 1.15 perf-profile.children.cycles-pp.do_rw_once
0.56 +0.3 0.81 perf-profile.children.cycles-pp.do_user_addr_fault
0.57 +0.3 0.82 perf-profile.children.cycles-pp.exc_page_fault
0.83 +0.3 1.16 perf-profile.children.cycles-pp.asm_exc_page_fault
1.36 +0.4 1.73 ± 5% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
1.39 +0.4 1.78 ± 5% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
1.38 +0.4 1.77 ± 5% perf-profile.children.cycles-pp.exit_to_user_mode_loop
1.38 +0.4 1.77 ± 5% perf-profile.children.cycles-pp.task_work_run
1.36 +0.4 1.75 ± 5% perf-profile.children.cycles-pp.task_numa_work
1.18 +0.4 1.58 ± 5% perf-profile.children.cycles-pp.vma_migratable
1.46 +0.5 1.96 perf-profile.children.cycles-pp.do_access
92.91 -1.6 91.28 perf-profile.self.cycles-pp.osq_lock
0.38 -0.0 0.35 perf-profile.self.cycles-pp.rwsem_spin_on_owner
0.06 ± 7% +0.0 0.08 perf-profile.self.cycles-pp.sync_regs
0.10 +0.0 0.12 ± 4% perf-profile.self.cycles-pp.native_irq_return_iret
0.07 ± 6% +0.0 0.10 ± 3% perf-profile.self.cycles-pp.mtree_range_walk
0.06 +0.0 0.09 ± 5% perf-profile.self.cycles-pp.lock_vma_under_rcu
0.08 ± 5% +0.0 0.11 ± 6% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.17 ± 2% +0.0 0.21 ± 8% perf-profile.self.cycles-pp.update_cfs_group
0.02 ±141% +0.0 0.06 ± 7% perf-profile.self.cycles-pp.rwsem_optimistic_spin
0.00 +0.1 0.05 perf-profile.self.cycles-pp.___perf_sw_event
0.00 +0.1 0.06 ± 6% perf-profile.self.cycles-pp.down_write
0.17 ± 2% +0.1 0.24 perf-profile.self.cycles-pp.vma_interval_tree_insert
0.08 +0.1 0.20 ± 6% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
0.55 +0.1 0.70 perf-profile.self.cycles-pp.do_access
0.24 ± 3% +0.2 0.39 ± 3% perf-profile.self.cycles-pp.__handle_mm_fault
0.68 +0.2 0.86 perf-profile.self.cycles-pp.do_rw_once
1.16 +0.4 1.56 ± 6% perf-profile.self.cycles-pp.vma_migratable





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki