[linus:master] [vfs] 93faf426e3: stress-ng.dynlib.ops_per_sec 7.2% improvement

From: kernel test robot
Date: Mon Jan 29 2024 - 03:06:03 EST




Hello,

kernel test robot noticed a 7.2% improvement of stress-ng.dynlib.ops_per_sec on:


commit: 93faf426e3cc000c95f1a5d3510b77ce99adac52 ("vfs: shave work on failed file open")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: stress-ng
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

nr_threads: 10%
disk: 1HDD
testtime: 60s
fs: ext4
class: os
test: dynlib
cpufreq_governor: performance



Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240129/202401291500.8546fbc3-oliver.sang@xxxxxxxxx

=========================================================================================
class/compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
os/gcc-12/performance/1HDD/ext4/x86_64-rhel-8.3/10%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp7/dynlib/stress-ng/60s

commit:
6036c5f131 ("fs: simplify misleading code to remove ambiguity regarding ihold()/iput()")
93faf426e3 ("vfs: shave work on failed file open")

6036c5f131752689 93faf426e3cc000c95f1a5d3510
---------------- ---------------------------
%stddev %change %stddev
\ | \
8.49 -0.9% 8.41 iostat.cpu.system
0.61 -0.2 0.39 mpstat.cpu.all.soft%
55.50 ± 11% -44.7% 30.67 ± 21% perf-c2c.DRAM.local
165542 ± 2% -9.7% 149471 numa-meminfo.node1.Active
161615 -9.0% 147100 ± 2% numa-meminfo.node1.Active(anon)
40405 -9.0% 36775 ± 2% numa-vmstat.node1.nr_active_anon
40405 -9.0% 36775 ± 2% numa-vmstat.node1.nr_zone_active_anon
51245 ± 35% -42.9% 29277 ± 7% sched_debug.cfs_rq:/.avg_vruntime.max
9910 ± 54% -48.0% 5156 ± 8% sched_debug.cfs_rq:/.avg_vruntime.stddev
51247 ± 35% -42.9% 29277 ± 7% sched_debug.cfs_rq:/.min_vruntime.max
9911 ± 54% -48.0% 5156 ± 8% sched_debug.cfs_rq:/.min_vruntime.stddev
249.51 -6.7% 232.83 stress-ng.dynlib.nanosecs_per_dlsym_lookup
287979 +7.2% 308763 stress-ng.dynlib.ops
4799 +7.2% 5146 stress-ng.dynlib.ops_per_sec
2343 ± 5% -39.9% 1409 ± 5% stress-ng.time.involuntary_context_switches
20176141 +7.4% 21671738 stress-ng.time.minor_page_faults
535.17 +2.6% 548.83 stress-ng.time.percent_of_cpu_this_job_got
291.91 +2.2% 298.23 stress-ng.time.system_time
40770 -7.3% 37774 ± 2% proc-vmstat.nr_active_anon
3098 ± 2% +3.4% 3204 proc-vmstat.nr_inactive_file
34934 -5.1% 33139 proc-vmstat.nr_mapped
70157 -6.1% 65866 proc-vmstat.nr_shmem
42286 -3.3% 40888 proc-vmstat.nr_slab_unreclaimable
40770 -7.3% 37774 ± 2% proc-vmstat.nr_zone_active_anon
3098 ± 2% +3.4% 3204 proc-vmstat.nr_zone_inactive_file
15514950 -3.9% 14905053 proc-vmstat.numa_hit
15448751 -3.9% 14838822 proc-vmstat.numa_local
124008 -5.2% 117617 proc-vmstat.pgactivate
18884464 -11.2% 16772814 proc-vmstat.pgalloc_normal
20593501 +7.2% 22078068 proc-vmstat.pgfault
18735781 -11.2% 16638429 proc-vmstat.pgfree
0.36 ±179% -94.5% 0.02 ± 44% perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
165.10 ±215% -97.9% 3.40 ± 37% perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.11 ± 40% -100.0% 0.00 perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
0.13 ± 25% -39.4% 0.08 ± 21% perf-sched.wait_and_delay.avg.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
23.83 ± 16% -100.0% 0.00 perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
545.33 ± 4% -14.0% 468.83 ± 7% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
1.19 ± 41% -100.0% 0.00 perf-sched.wait_and_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
0.46 ±124% -99.3% 0.00 ±146% perf-sched.wait_time.avg.ms.__cond_resched.dput.open_last_lookups.path_openat.do_filp_open
0.09 ± 63% -92.8% 0.01 ± 77% perf-sched.wait_time.avg.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
0.11 ± 23% -37.4% 0.07 ± 20% perf-sched.wait_time.avg.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
2.02 ± 6% -29.8% 1.42 ± 44% perf-sched.wait_time.avg.ms.syslog_print.do_syslog.kmsg_read.vfs_read
0.56 ±104% -99.3% 0.00 ±156% perf-sched.wait_time.max.ms.__cond_resched.dput.open_last_lookups.path_openat.do_filp_open
0.44 ± 81% -98.5% 0.01 ± 76% perf-sched.wait_time.max.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
4.02 ± 6% -12.3% 3.53 ± 8% perf-sched.wait_time.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
1.18 ± 42% -63.7% 0.43 ± 83% perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
3.24 ± 22% -44.2% 1.81 ± 19% perf-sched.wait_time.max.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
4.05 ± 6% -29.8% 2.84 ± 44% perf-sched.wait_time.max.ms.syslog_print.do_syslog.kmsg_read.vfs_read
0.58 -12.8% 0.51 ± 3% perf-stat.i.MPKI
9185336 ± 2% -11.2% 8152378 ± 3% perf-stat.i.cache-misses
51941695 -13.4% 44958941 perf-stat.i.cache-references
1.29 -2.8% 1.26 perf-stat.i.cpi
2.039e+10 -0.9% 2.022e+10 perf-stat.i.cpu-cycles
107.66 ± 3% -8.6% 98.35 ± 5% perf-stat.i.cpu-migrations
2258 +11.1% 2508 ± 3% perf-stat.i.cycles-between-cache-misses
0.05 ± 2% +0.0 0.06 ± 2% perf-stat.i.dTLB-load-miss-rate%
2224314 ± 2% +5.6% 2349507 ± 2% perf-stat.i.dTLB-load-misses
0.08 +0.0 0.08 perf-stat.i.dTLB-store-miss-rate%
1755339 +7.6% 1889357 perf-stat.i.dTLB-store-misses
0.78 +2.8% 0.80 perf-stat.i.ipc
0.32 -0.8% 0.32 perf-stat.i.metric.GHz
965.70 -11.8% 851.51 perf-stat.i.metric.K/sec
326896 +7.3% 350610 perf-stat.i.minor-faults
89.04 +3.5 92.53 perf-stat.i.node-load-miss-rate%
617886 ± 7% -34.0% 407524 ± 12% perf-stat.i.node-loads
49.79 ± 3% +6.2 56.00 perf-stat.i.node-store-miss-rate%
1915596 ± 3% -9.6% 1731427 ± 4% perf-stat.i.node-store-misses
2010814 ± 4% -29.8% 1410941 ± 2% perf-stat.i.node-stores
327425 +7.3% 351195 perf-stat.i.page-faults
0.58 -13.1% 0.51 ± 3% perf-stat.overall.MPKI
1.30 -2.9% 1.26 perf-stat.overall.cpi
2219 +11.8% 2482 ± 3% perf-stat.overall.cycles-between-cache-misses
0.05 ± 2% +0.0 0.06 ± 2% perf-stat.overall.dTLB-load-miss-rate%
0.08 +0.0 0.08 perf-stat.overall.dTLB-store-miss-rate%
0.77 +3.0% 0.79 perf-stat.overall.ipc
87.69 +5.5 93.21 ± 3% perf-stat.overall.node-load-miss-rate%
48.82 ± 3% +6.3 55.14 ± 2% perf-stat.overall.node-store-miss-rate%
9052155 ± 2% -11.3% 8027092 ± 3% perf-stat.ps.cache-misses
51114488 -13.4% 44240106 perf-stat.ps.cache-references
2.008e+10 -0.9% 1.991e+10 perf-stat.ps.cpu-cycles
106.26 ± 3% -8.7% 97.05 ± 5% perf-stat.ps.cpu-migrations
2190693 ± 2% +5.6% 2313598 ± 2% perf-stat.ps.dTLB-load-misses
1728462 +7.6% 1860472 perf-stat.ps.dTLB-store-misses
321900 +7.3% 345241 perf-stat.ps.minor-faults
621192 ± 7% -34.3% 407816 ± 12% perf-stat.ps.node-loads
1885823 ± 3% -9.6% 1703871 ± 4% perf-stat.ps.node-store-misses
1978205 ± 4% -30.0% 1385178 ± 2% perf-stat.ps.node-stores
322420 +7.3% 345817 perf-stat.ps.page-faults
6.48 -4.9 1.63 ± 4% perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
6.66 -4.8 1.81 ± 4% perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
6.41 -4.8 1.57 ± 4% perf-profile.calltrace.cycles-pp.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
6.06 -4.5 1.53 ± 4% perf-profile.calltrace.cycles-pp.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
5.03 -3.6 1.41 ± 5% perf-profile.calltrace.cycles-pp.__fput.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
9.04 ± 2% -2.3 6.78 perf-profile.calltrace.cycles-pp.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
6.84 ± 2% -1.6 5.24 perf-profile.calltrace.cycles-pp.init_file.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2
72.38 -0.9 71.44 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
72.16 -0.9 71.21 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.99 ± 3% -0.7 1.31 ± 3% perf-profile.calltrace.cycles-pp.kmem_cache_alloc.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2
0.65 ± 6% -0.4 0.27 ±100% perf-profile.calltrace.cycles-pp.__do_softirq.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state
0.69 ± 5% -0.3 0.38 ± 70% perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter
1.37 ± 4% -0.1 1.22 ± 6% perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
1.45 ± 4% -0.1 1.32 ± 6% perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
1.69 ± 3% -0.1 1.57 ± 6% perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
0.74 ± 5% -0.1 0.63 ± 9% perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
0.67 ± 3% +0.1 0.72 perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
0.94 ± 3% +0.1 0.99 ± 2% perf-profile.calltrace.cycles-pp.__call_rcu_common.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
0.94 ± 3% +0.1 1.01 perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
0.75 ± 4% +0.1 0.83 ± 2% perf-profile.calltrace.cycles-pp.rcu_segcblist_enqueue.__call_rcu_common.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
0.89 ± 3% +0.1 0.97 ± 2% perf-profile.calltrace.cycles-pp.up_write.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
1.06 ± 4% +0.1 1.14 ± 3% perf-profile.calltrace.cycles-pp.vma_expand.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
1.10 ± 2% +0.1 1.20 ± 2% perf-profile.calltrace.cycles-pp.release_pages.tlb_batch_pages_flush.tlb_finish_mmu.unmap_region.do_vmi_align_munmap
1.06 ± 4% +0.1 1.16 ± 2% perf-profile.calltrace.cycles-pp.link_path_walk.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
1.74 +0.1 1.88 perf-profile.calltrace.cycles-pp.tlb_batch_pages_flush.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap
2.44 +0.2 2.58 ± 2% perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
2.43 ± 2% +0.2 2.60 perf-profile.calltrace.cycles-pp.flush_tlb_mm_range.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap
2.79 +0.2 2.96 ± 3% perf-profile.calltrace.cycles-pp.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception
2.84 +0.2 3.01 ± 3% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe._dl_catch_exception
2.78 +0.2 2.95 ± 3% perf-profile.calltrace.cycles-pp.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception
2.17 ± 2% +0.2 2.34 perf-profile.calltrace.cycles-pp.native_flush_tlb_one_user.flush_tlb_func.flush_tlb_mm_range.tlb_finish_mmu.unmap_region
2.82 +0.2 3.00 ± 3% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception
2.91 ± 2% +0.2 3.09 ± 3% perf-profile.calltrace.cycles-pp.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.58 ± 6% +0.2 1.76 ± 6% perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
2.22 ± 2% +0.2 2.40 perf-profile.calltrace.cycles-pp.flush_tlb_func.flush_tlb_mm_range.tlb_finish_mmu.unmap_region.do_vmi_align_munmap
0.26 ±100% +0.3 0.54 ± 3% perf-profile.calltrace.cycles-pp.lock_vma_under_rcu.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
1.86 ± 4% +0.3 2.14 ± 4% perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region
3.32 ± 2% +0.3 3.60 ± 3% perf-profile.calltrace.cycles-pp.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
2.44 ± 3% +0.3 2.73 ± 4% perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap
4.31 +0.3 4.62 perf-profile.calltrace.cycles-pp.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
5.40 +0.3 5.73 ± 3% perf-profile.calltrace.cycles-pp.next_uptodate_folio.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
8.33 ± 3% +0.4 8.68 ± 4% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
8.40 ± 3% +0.4 8.75 ± 4% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
7.06 +0.4 7.47 ± 3% perf-profile.calltrace.cycles-pp.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault
7.11 +0.4 7.53 ± 3% perf-profile.calltrace.cycles-pp.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
8.33 +0.4 8.78 ± 2% perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
1.94 ± 12% +0.5 2.40 ± 3% perf-profile.calltrace.cycles-pp.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64
9.37 +0.5 9.84 ± 2% perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
15.46 +0.5 15.94 perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
9.70 +0.5 10.20 perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
10.75 +0.5 11.30 perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
10.79 +0.5 11.34 perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault
7.83 +0.6 8.43 perf-profile.calltrace.cycles-pp.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap
11.55 +0.6 12.16 perf-profile.calltrace.cycles-pp.asm_exc_page_fault
20.74 +0.7 21.44 perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
20.88 +0.7 21.58 perf-profile.calltrace.cycles-pp.do_vmi_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
20.95 +0.7 21.67 perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
20.95 +0.7 21.67 perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.00 +0.7 0.74 ± 5% perf-profile.calltrace.cycles-pp.kmem_cache_free.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
3.49 ± 2% +0.8 4.31 perf-profile.calltrace.cycles-pp.apparmor_file_alloc_security.security_file_alloc.init_file.alloc_empty_file.path_openat
3.96 +0.8 4.78 perf-profile.calltrace.cycles-pp.security_file_alloc.init_file.alloc_empty_file.path_openat.do_filp_open
11.55 +0.9 12.50 ± 2% perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap.vm_mmap_pgoff
11.78 +1.0 12.74 ± 2% perf-profile.calltrace.cycles-pp.do_vmi_munmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
23.77 +1.3 25.09 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
21.46 +1.3 22.79 perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
22.25 +1.3 23.58 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
23.91 +1.3 25.25 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
14.68 +1.5 16.19 perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64
14.99 +1.6 16.54 perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
17.95 +1.7 19.64 perf-profile.calltrace.cycles-pp.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
18.05 +1.7 19.74 perf-profile.calltrace.cycles-pp.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.00 +2.5 2.54 ± 5% perf-profile.calltrace.cycles-pp.apparmor_file_free_security.security_file_free.release_empty_file.path_openat.do_filp_open
0.00 +2.6 2.56 ± 5% perf-profile.calltrace.cycles-pp.security_file_free.release_empty_file.path_openat.do_filp_open.do_sys_openat2
0.00 +3.2 3.21 ± 4% perf-profile.calltrace.cycles-pp.release_empty_file.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
6.53 -4.9 1.67 ± 4% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
6.44 -4.9 1.58 ± 4% perf-profile.children.cycles-pp.exit_to_user_mode_loop
6.72 -4.8 1.88 ± 3% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
6.10 -4.6 1.54 ± 4% perf-profile.children.cycles-pp.task_work_run
5.05 -3.6 1.42 ± 5% perf-profile.children.cycles-pp.__fput
7.18 ± 2% -2.6 4.60 ± 4% perf-profile.children.cycles-pp.__do_softirq
6.77 ± 2% -2.6 4.20 ± 4% perf-profile.children.cycles-pp.rcu_core
6.72 ± 2% -2.6 4.17 ± 4% perf-profile.children.cycles-pp.rcu_do_batch
7.08 ± 2% -2.5 4.59 ± 3% perf-profile.children.cycles-pp.__irq_exit_rcu
9.07 ± 2% -2.3 6.81 perf-profile.children.cycles-pp.alloc_empty_file
10.42 -2.1 8.34 ± 4% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
10.17 -2.1 8.09 ± 4% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
6.86 ± 2% -1.6 5.26 perf-profile.children.cycles-pp.init_file
3.66 ± 4% -0.9 2.80 ± 4% perf-profile.children.cycles-pp.apparmor_file_free_security
3.70 ± 4% -0.9 2.84 ± 5% perf-profile.children.cycles-pp.security_file_free
1.21 ± 3% -0.7 0.50 ± 7% perf-profile.children.cycles-pp.rcu_cblist_dequeue
1.55 ± 2% -0.6 0.94 ± 4% perf-profile.children.cycles-pp.___slab_alloc
4.97 ± 2% -0.6 4.38 ± 2% perf-profile.children.cycles-pp.kmem_cache_alloc
2.62 ± 2% -0.6 2.05 ± 4% perf-profile.children.cycles-pp.__slab_free
2.26 -0.5 1.71 perf-profile.children.cycles-pp.__call_rcu_common
0.58 ± 6% -0.5 0.10 ± 19% perf-profile.children.cycles-pp.file_free_rcu
0.63 ± 2% -0.4 0.18 ± 4% perf-profile.children.cycles-pp.fput
0.38 ± 4% -0.3 0.08 ± 12% perf-profile.children.cycles-pp.task_work_add
0.66 ± 2% -0.3 0.37 ± 7% perf-profile.children.cycles-pp.allocate_slab
0.46 ± 3% -0.2 0.27 ± 8% perf-profile.children.cycles-pp.shuffle_freelist
0.29 ± 5% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.inc_slabs_node
0.18 ± 5% -0.1 0.05 ± 47% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.17 ± 10% -0.1 0.07 ± 48% perf-profile.children.cycles-pp.smpboot_thread_fn
0.15 ± 10% -0.1 0.06 ± 51% perf-profile.children.cycles-pp.run_ksoftirqd
0.20 ± 3% -0.1 0.12 ± 9% perf-profile.children.cycles-pp.rcu_nocb_try_bypass
0.20 ± 7% -0.1 0.13 ± 5% perf-profile.children.cycles-pp.__unfreeze_partials
1.16 ± 2% -0.1 1.10 perf-profile.children.cycles-pp.rcu_segcblist_enqueue
0.29 ± 5% -0.1 0.24 ± 5% perf-profile.children.cycles-pp.refill_obj_stock
0.22 ± 4% -0.0 0.17 ± 11% perf-profile.children.cycles-pp.get_page_from_freelist
0.14 ± 10% -0.0 0.09 ± 9% perf-profile.children.cycles-pp.setup_object
0.18 ± 7% -0.0 0.15 ± 8% perf-profile.children.cycles-pp.__kmem_cache_alloc_node
0.09 ± 14% -0.0 0.06 ± 11% perf-profile.children.cycles-pp.free_unref_page
0.09 ± 7% +0.0 0.11 ± 6% perf-profile.children.cycles-pp.cp_new_stat
0.20 ± 9% +0.0 0.24 ± 9% perf-profile.children.cycles-pp.generic_file_mmap
0.53 ± 4% +0.0 0.57 ± 2% perf-profile.children.cycles-pp.lock_vma_under_rcu
0.27 ± 5% +0.0 0.31 ± 6% perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
0.16 ± 12% +0.0 0.21 ± 7% perf-profile.children.cycles-pp.put_unused_fd
0.24 ± 6% +0.1 0.29 ± 7% perf-profile.children.cycles-pp.path_init
0.26 ± 7% +0.1 0.31 ± 10% perf-profile.children.cycles-pp.inode_permission
0.01 ±223% +0.1 0.07 ± 11% perf-profile.children.cycles-pp.__irqentry_text_end
0.62 ± 3% +0.1 0.67 ± 3% perf-profile.children.cycles-pp.__rb_insert_augmented
0.11 ± 31% +0.1 0.18 ± 17% perf-profile.children.cycles-pp.tick_sched_do_timer
1.06 ± 4% +0.1 1.15 ± 3% perf-profile.children.cycles-pp.vma_expand
0.12 ± 22% +0.1 0.21 ± 28% perf-profile.children.cycles-pp.ktime_get_update_offsets_now
1.14 ± 2% +0.1 1.24 ± 2% perf-profile.children.cycles-pp.release_pages
1.09 ± 4% +0.1 1.19 ± 2% perf-profile.children.cycles-pp.link_path_walk
2.12 ± 2% +0.1 2.24 ± 2% perf-profile.children.cycles-pp.vma_interval_tree_remove
0.51 ± 5% +0.1 0.64 ± 4% perf-profile.children.cycles-pp.percpu_counter_add_batch
1.75 +0.1 1.90 perf-profile.children.cycles-pp.tlb_batch_pages_flush
3.61 ± 2% +0.2 3.77 ± 2% perf-profile.children.cycles-pp._dl_catch_exception
2.66 ± 2% +0.2 2.83 perf-profile.children.cycles-pp.flush_tlb_mm_range
2.91 ± 2% +0.2 3.09 ± 3% perf-profile.children.cycles-pp.mprotect_fixup
3.36 +0.2 3.54 perf-profile.children.cycles-pp.vma_complete
2.28 +0.2 2.46 perf-profile.children.cycles-pp.kmem_cache_free
2.32 ± 2% +0.2 2.51 perf-profile.children.cycles-pp.native_flush_tlb_one_user
3.38 +0.2 3.58 ± 2% perf-profile.children.cycles-pp.__x64_sys_mprotect
3.38 +0.2 3.58 ± 2% perf-profile.children.cycles-pp.do_mprotect_pkey
2.41 ± 2% +0.2 2.62 perf-profile.children.cycles-pp.flush_tlb_func
3.09 ± 2% +0.2 3.33 ± 2% perf-profile.children.cycles-pp.up_write
3.12 ± 2% +0.3 3.37 ± 3% perf-profile.children.cycles-pp.vma_interval_tree_insert
4.52 +0.3 4.83 perf-profile.children.cycles-pp.tlb_finish_mmu
0.21 ± 62% +0.4 0.57 ± 60% perf-profile.children.cycles-pp.tick_irq_enter
5.76 +0.4 6.13 ± 3% perf-profile.children.cycles-pp.next_uptodate_folio
4.70 ± 2% +0.5 5.15 ± 3% perf-profile.children.cycles-pp.vma_prepare
7.52 +0.5 7.98 ± 2% perf-profile.children.cycles-pp.filemap_map_pages
7.56 +0.5 8.03 ± 2% perf-profile.children.cycles-pp.do_read_fault
8.78 +0.5 9.27 ± 2% perf-profile.children.cycles-pp.do_fault
9.84 +0.5 10.37 perf-profile.children.cycles-pp.__handle_mm_fault
10.18 +0.6 10.74 perf-profile.children.cycles-pp.handle_mm_fault
11.30 +0.6 11.90 perf-profile.children.cycles-pp.exc_page_fault
11.27 +0.6 11.87 perf-profile.children.cycles-pp.do_user_addr_fault
12.10 +0.7 12.76 perf-profile.children.cycles-pp.asm_exc_page_fault
20.95 +0.7 21.67 perf-profile.children.cycles-pp.__vm_munmap
20.95 +0.7 21.67 perf-profile.children.cycles-pp.__x64_sys_munmap
10.11 +0.7 10.85 perf-profile.children.cycles-pp.__split_vma
18.12 +0.8 18.88 perf-profile.children.cycles-pp.unmap_region
3.51 ± 2% +0.8 4.34 perf-profile.children.cycles-pp.apparmor_file_alloc_security
3.98 +0.8 4.81 perf-profile.children.cycles-pp.security_file_alloc
24.22 +1.3 25.56 perf-profile.children.cycles-pp.vm_mmap_pgoff
22.69 +1.3 24.04 perf-profile.children.cycles-pp.do_mmap
21.92 +1.4 23.27 perf-profile.children.cycles-pp.mmap_region
23.92 +1.4 25.27 perf-profile.children.cycles-pp.ksys_mmap_pgoff
14.74 +1.5 16.27 perf-profile.children.cycles-pp.path_openat
15.02 +1.6 16.58 perf-profile.children.cycles-pp.do_filp_open
32.60 +1.7 34.26 perf-profile.children.cycles-pp.do_vmi_align_munmap
32.93 +1.7 34.60 perf-profile.children.cycles-pp.do_vmi_munmap
18.02 +1.7 19.71 perf-profile.children.cycles-pp.do_sys_openat2
18.07 +1.7 19.77 perf-profile.children.cycles-pp.__x64_sys_openat
0.00 +3.2 3.22 ± 4% perf-profile.children.cycles-pp.release_empty_file
2.63 ± 4% -2.2 0.41 ± 6% perf-profile.self.cycles-pp.init_file
3.44 ± 4% -0.8 2.68 ± 4% perf-profile.self.cycles-pp.apparmor_file_free_security
1.20 ± 3% -0.7 0.49 ± 7% perf-profile.self.cycles-pp.rcu_cblist_dequeue
2.54 ± 2% -0.5 2.01 ± 4% perf-profile.self.cycles-pp.__slab_free
0.57 ± 6% -0.5 0.09 ± 13% perf-profile.self.cycles-pp.file_free_rcu
0.78 ± 3% -0.3 0.44 ± 4% perf-profile.self.cycles-pp.__call_rcu_common
0.30 ± 6% -0.2 0.07 ± 14% perf-profile.self.cycles-pp.task_work_add
0.42 ± 6% -0.1 0.28 ± 3% perf-profile.self.cycles-pp.___slab_alloc
0.28 ± 5% -0.1 0.14 ± 4% perf-profile.self.cycles-pp.inc_slabs_node
0.34 ± 6% -0.1 0.20 ± 7% perf-profile.self.cycles-pp.shuffle_freelist
0.23 ± 3% -0.1 0.10 ± 12% perf-profile.self.cycles-pp.fput
0.16 ± 5% -0.1 0.04 ± 71% perf-profile.self.cycles-pp._raw_spin_lock_irq
0.14 ± 4% -0.1 0.08 ± 14% perf-profile.self.cycles-pp.rcu_nocb_try_bypass
1.13 ± 2% -0.1 1.08 perf-profile.self.cycles-pp.rcu_segcblist_enqueue
0.27 ± 4% -0.0 0.22 ± 6% perf-profile.self.cycles-pp.refill_obj_stock
0.10 ± 8% -0.0 0.06 ± 6% perf-profile.self.cycles-pp.__unfreeze_partials
0.11 ± 8% -0.0 0.08 ± 10% perf-profile.self.cycles-pp.rcu_do_batch
0.11 ± 4% +0.0 0.14 ± 10% perf-profile.self.cycles-pp.mas_walk
0.07 ± 16% +0.0 0.10 ± 6% perf-profile.self.cycles-pp.atime_needs_update
0.21 ± 5% +0.0 0.25 ± 6% perf-profile.self.cycles-pp.vm_area_dup
0.89 +0.0 0.93 perf-profile.self.cycles-pp.zap_pte_range
0.40 ± 5% +0.0 0.44 ± 2% perf-profile.self.cycles-pp.__split_vma
0.44 ± 3% +0.0 0.48 ± 4% perf-profile.self.cycles-pp.vma_interval_tree_augment_rotate
0.15 ± 8% +0.0 0.20 ± 7% perf-profile.self.cycles-pp.syscall_enter_from_user_mode
0.17 ± 11% +0.0 0.22 ± 7% perf-profile.self.cycles-pp.path_init
0.52 ± 3% +0.0 0.57 ± 3% perf-profile.self.cycles-pp.__rb_insert_augmented
0.20 ± 8% +0.1 0.26 ± 4% perf-profile.self.cycles-pp.get_obj_cgroup_from_current
0.01 ±223% +0.1 0.06 ± 14% perf-profile.self.cycles-pp.__irqentry_text_end
0.49 ± 4% +0.1 0.55 ± 4% perf-profile.self.cycles-pp.free_swap_cache
0.60 ± 5% +0.1 0.66 ± 3% perf-profile.self.cycles-pp.mmap_region
0.74 ± 3% +0.1 0.81 perf-profile.self.cycles-pp._raw_spin_lock
0.08 ± 53% +0.1 0.15 ± 20% perf-profile.self.cycles-pp.tick_sched_do_timer
0.11 ± 24% +0.1 0.19 ± 29% perf-profile.self.cycles-pp.ktime_get_update_offsets_now
0.35 ± 11% +0.1 0.44 ± 5% perf-profile.self.cycles-pp.rwsem_down_write_slowpath
0.44 ± 7% +0.1 0.58 ± 3% perf-profile.self.cycles-pp.percpu_counter_add_batch
1.33 ± 2% +0.2 1.48 ± 3% perf-profile.self.cycles-pp.down_write
1.92 ± 2% +0.2 2.09 ± 2% perf-profile.self.cycles-pp.vma_interval_tree_remove
2.29 ± 2% +0.2 2.48 perf-profile.self.cycles-pp.native_flush_tlb_one_user
1.34 +0.2 1.58 ± 2% perf-profile.self.cycles-pp.kmem_cache_free
2.90 ± 2% +0.2 3.14 ± 2% perf-profile.self.cycles-pp.up_write
0.00 +0.3 0.26 ± 3% perf-profile.self.cycles-pp.release_empty_file
2.89 +0.3 3.20 ± 3% perf-profile.self.cycles-pp.vma_interval_tree_insert
2.54 ± 3% +0.3 2.85 ± 4% perf-profile.self.cycles-pp.rwsem_spin_on_owner
5.28 +0.4 5.73 ± 3% perf-profile.self.cycles-pp.next_uptodate_folio
3.22 +0.9 4.07 perf-profile.self.cycles-pp.apparmor_file_alloc_security




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki