[linus:master] [apparmor] 90c436a64a: stress-ng.kill.ops_per_sec -8.0% regression

From: kernel test robot
Date: Wed Nov 15 2023 - 23:52:01 EST



hi, John Johansen,

we reported
"[linux-next:master] [apparmor] 90c436a64a: stress-ng.kcmp.ops_per_sec 383.0% improvement"
in
https://lore.kernel.org/all/202310311037.173ebf2b-oliver.sang@xxxxxxxxx/
when this commit is in linux-next/master.

now we noticed this commit is in mainline, however, we catpured a small
regression by another stress-ng test. FYI.


Hello,

kernel test robot noticed a -8.0% regression of stress-ng.kill.ops_per_sec on:


commit: 90c436a64a6e20482a9a613c47eb4af2e8a5328e ("apparmor: pass cred through to audit info.")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: stress-ng
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

nr_threads: 100%
disk: 1HDD
testtime: 60s
class: interrupt
test: kill
cpufreq_governor: performance



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@xxxxxxxxx>
| Closes: https://lore.kernel.org/oe-lkp/202311161149.4d39cc37-oliver.sang@xxxxxxxxx


Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20231116/202311161149.4d39cc37-oliver.sang@xxxxxxxxx

=========================================================================================
class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
interrupt/gcc-12/performance/1HDD/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp8/kill/stress-ng/60s

commit:
d20f5a1a6e ("apparmor: rename audit_data->label to audit_data->subj_label")
90c436a64a ("apparmor: pass cred through to audit info.")

d20f5a1a6e792d22 90c436a64a6e20482a9a613c47e
---------------- ---------------------------
%stddev %change %stddev
\ | \
112.01 +21438.4% 24124 ±200% meminfo.Active(file)
62.80 +4.5% 65.65 turbostat.RAMWatt
15.04 ±100% +4879.0% 748.88 ±114% vmstat.io.bo
82957 ± 2% -7.4% 76795 ± 3% vmstat.system.cs
28.00 +21403.0% 6021 ±200% proc-vmstat.nr_active_file
28.00 +21403.0% 6021 ±200% proc-vmstat.nr_zone_active_file
129482 -4.4% 123826 proc-vmstat.pgactivate
1032 ±100% +4641.1% 48928 ±113% proc-vmstat.pgpgout
16.90 ± 14% +25.5% 21.21 ± 20% sched_debug.cfs_rq:/.removed.runnable_avg.avg
16.89 ± 14% +25.5% 21.21 ± 20% sched_debug.cfs_rq:/.removed.util_avg.avg
247.17 ± 6% +14.0% 281.82 ± 8% sched_debug.cfs_rq:/.util_est_enqueued.avg
34402 ± 2% -21.9% 26867 ± 6% sched_debug.cpu.nr_switches.min
181.33 ± 7% +129.4% 416.00 ± 2% perf-c2c.DRAM.local
1766 ± 4% +140.5% 4249 ± 4% perf-c2c.DRAM.remote
3189 ± 5% +119.0% 6984 perf-c2c.HITM.local
538.00 ± 10% +273.7% 2010 ± 7% perf-c2c.HITM.remote
3727 ± 6% +141.3% 8995 ± 2% perf-c2c.HITM.total
738.58 -8.1% 678.88 stress-ng.kill.kill_calls_per_sec
912564 -8.0% 839201 stress-ng.kill.ops
15208 -8.0% 13985 stress-ng.kill.ops_per_sec
2741058 -7.8% 2527111 stress-ng.time.involuntary_context_switches
2721296 -7.9% 2505879 stress-ng.time.voluntary_context_switches
0.58 ± 7% +35.9% 0.79 ± 6% perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork
21.33 ± 18% -50.0% 10.67 ± 32% perf-sched.wait_and_delay.count.__cond_resched.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write
0.01 ± 26% +54.5% 0.01 ± 25% perf-sched.wait_time.avg.ms.__cond_resched.__kmem_cache_alloc_node.__kmalloc_node.proc_sys_call_handler.vfs_read
1.68 ±214% -99.5% 0.01 ± 64% perf-sched.wait_time.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
0.01 ± 36% +66.0% 0.01 ± 28% perf-sched.wait_time.max.ms.__cond_resched.__kmem_cache_alloc_node.__kmalloc_node.proc_sys_call_handler.vfs_read
8.10 ± 43% -29.7% 5.69 ± 12% perf-sched.wait_time.max.ms.__cond_resched.dput.__fput.task_work_run.exit_to_user_mode_loop
1.64 ± 2% +152.0% 4.13 ± 3% perf-stat.i.MPKI
11.98 +6.8 18.75 ± 2% perf-stat.i.cache-miss-rate%
13072329 ± 2% +145.9% 32141491 ± 4% perf-stat.i.cache-misses
1.082e+08 ± 3% +53.1% 1.657e+08 ± 4% perf-stat.i.cache-references
86654 ± 2% -8.4% 79368 ± 4% perf-stat.i.context-switches
15915 -59.2% 6487 perf-stat.i.cycles-between-cache-misses
0.01 ± 15% -0.0 0.01 ± 5% perf-stat.i.dTLB-load-miss-rate%
338311 ± 15% -21.7% 264729 ± 8% perf-stat.i.dTLB-load-misses
149.20 ± 5% +115.1% 320.97 ± 2% perf-stat.i.metric.K/sec
86.96 +3.9 90.83 perf-stat.i.node-load-miss-rate%
2659602 ± 3% +205.8% 8133328 ± 4% perf-stat.i.node-load-misses
398586 ± 5% +69.2% 674468 ± 5% perf-stat.i.node-loads
4272013 ± 2% +102.4% 8646615 ± 5% perf-stat.i.node-store-misses
444659 ± 3% +117.1% 965253 ± 5% perf-stat.i.node-stores
1.63 ± 2% +149.4% 4.06 ± 2% perf-stat.overall.MPKI
12.08 +7.3 19.41 perf-stat.overall.cache-miss-rate%
16313 -59.8% 6554 perf-stat.overall.cycles-between-cache-misses
0.02 ± 15% -0.0 0.01 ± 7% perf-stat.overall.dTLB-load-miss-rate%
86.84 +5.5 92.33 perf-stat.overall.node-load-miss-rate%
12940552 ± 2% +146.4% 31886577 ± 3% perf-stat.ps.cache-misses
1.072e+08 ± 2% +53.3% 1.643e+08 ± 3% perf-stat.ps.cache-references
85805 ± 2% -8.3% 78664 ± 3% perf-stat.ps.context-switches
341154 ± 16% -21.3% 268598 ± 9% perf-stat.ps.dTLB-load-misses
2632824 ± 2% +206.4% 8067907 ± 4% perf-stat.ps.node-load-misses
398939 ± 5% +68.2% 670849 ± 5% perf-stat.ps.node-loads
4233610 ± 2% +102.6% 8578379 ± 4% perf-stat.ps.node-store-misses
434918 ± 3% +119.3% 953796 ± 4% perf-stat.ps.node-stores
38.66 -38.7 0.00 perf-profile.calltrace.cycles-pp.aa_get_task_label.apparmor_task_kill.security_task_kill.kill_something_info.__x64_sys_kill
9.04 ± 6% -2.4 6.67 perf-profile.calltrace.cycles-pp.aa_may_signal.apparmor_task_kill.security_task_kill.kill_something_info.__x64_sys_kill
0.74 -0.0 0.68 perf-profile.calltrace.cycles-pp.profile_signal_perm.aa_may_signal.apparmor_task_kill.security_task_kill.kill_something_info
0.76 -0.0 0.72 perf-profile.calltrace.cycles-pp.kill_pid_info.kill_something_info.__x64_sys_kill.do_syscall_64.entry_SYSCALL_64_after_hwframe
98.37 +0.1 98.46 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.kill
98.38 +0.1 98.47 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.kill
98.40 +0.1 98.49 perf-profile.calltrace.cycles-pp.kill
98.24 +0.1 98.36 perf-profile.calltrace.cycles-pp.__x64_sys_kill.do_syscall_64.entry_SYSCALL_64_after_hwframe.kill
98.20 +0.1 98.32 perf-profile.calltrace.cycles-pp.kill_something_info.__x64_sys_kill.do_syscall_64.entry_SYSCALL_64_after_hwframe.kill
0.00 +0.7 0.68 perf-profile.calltrace.cycles-pp.check_kill_permission.kill_something_info.__x64_sys_kill.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.00 +0.7 0.74 perf-profile.calltrace.cycles-pp.get_task_cred.apparmor_task_kill.security_task_kill.kill_something_info.__x64_sys_kill
38.96 -39.0 0.00 perf-profile.children.cycles-pp.aa_get_task_label
9.14 ± 6% -2.4 6.74 perf-profile.children.cycles-pp.aa_may_signal
0.47 ± 4% -0.1 0.41 ± 2% perf-profile.children.cycles-pp.__x64_sys_openat
0.47 ± 3% -0.1 0.41 ± 2% perf-profile.children.cycles-pp.do_sys_openat2
0.46 ± 3% -0.1 0.41 perf-profile.children.cycles-pp.open64
0.76 -0.1 0.71 perf-profile.children.cycles-pp.profile_signal_perm
0.45 ± 4% -0.1 0.40 perf-profile.children.cycles-pp.do_filp_open
0.45 ± 4% -0.1 0.40 ± 2% perf-profile.children.cycles-pp.path_openat
0.20 ± 2% -0.0 0.15 ± 3% perf-profile.children.cycles-pp.__schedule
0.20 ± 3% -0.0 0.16 ± 3% perf-profile.children.cycles-pp.schedule
0.32 ± 6% -0.0 0.28 ± 4% perf-profile.children.cycles-pp._raw_spin_lock
0.76 -0.0 0.72 perf-profile.children.cycles-pp.kill_pid_info
0.07 ± 7% -0.0 0.03 ± 70% perf-profile.children.cycles-pp.complete_signal
0.10 ± 5% -0.0 0.07 perf-profile.children.cycles-pp.do_send_sig_info
0.09 -0.0 0.06 ± 7% perf-profile.children.cycles-pp.__send_signal_locked
0.19 ± 3% -0.0 0.17 ± 3% perf-profile.children.cycles-pp.apparmor_capable
0.07 ± 5% -0.0 0.05 ± 7% perf-profile.children.cycles-pp.switch_mm_irqs_off
0.07 ± 5% -0.0 0.05 ± 7% perf-profile.children.cycles-pp.try_to_wake_up
0.19 ± 3% -0.0 0.17 ± 2% perf-profile.children.cycles-pp.ns_capable
0.19 ± 3% -0.0 0.17 ± 2% perf-profile.children.cycles-pp.security_capable
0.09 -0.0 0.07 ± 5% perf-profile.children.cycles-pp.__close
0.07 ± 7% -0.0 0.05 perf-profile.children.cycles-pp.__x64_sys_pause
0.07 -0.0 0.06 ± 8% perf-profile.children.cycles-pp.task_work_run
0.09 ± 5% -0.0 0.07 ± 6% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.09 ± 8% -0.0 0.07 ± 6% perf-profile.children.cycles-pp.lookup_fast
0.07 ± 5% -0.0 0.06 perf-profile.children.cycles-pp.proc_sys_call_handler
98.24 +0.1 98.36 perf-profile.children.cycles-pp.__x64_sys_kill
98.23 +0.1 98.35 perf-profile.children.cycles-pp.kill_something_info
0.05 +0.1 0.17 ± 2% perf-profile.children.cycles-pp.audit_signal_info
0.07 ± 7% +0.2 0.27 ± 2% perf-profile.children.cycles-pp.audit_signal_info_syscall
0.42 +0.3 0.70 perf-profile.children.cycles-pp.check_kill_permission
0.00 +0.8 0.76 perf-profile.children.cycles-pp.get_task_cred
38.83 -38.8 0.00 perf-profile.self.cycles-pp.aa_get_task_label
8.36 ± 6% -2.3 6.03 perf-profile.self.cycles-pp.aa_may_signal
0.74 -0.1 0.68 perf-profile.self.cycles-pp.profile_signal_perm
0.10 ± 4% -0.0 0.08 ± 4% perf-profile.self.cycles-pp.security_task_kill
0.09 ± 4% -0.0 0.07 ± 5% perf-profile.self.cycles-pp._raw_spin_lock
0.07 ± 5% -0.0 0.05 perf-profile.self.cycles-pp.switch_mm_irqs_off
0.12 ± 4% -0.0 0.11 ± 5% perf-profile.self.cycles-pp.__kill_pgrp_info
0.09 ± 5% -0.0 0.08 perf-profile.self.cycles-pp.kill_something_info
0.11 ± 4% -0.0 0.10 perf-profile.self.cycles-pp.check_kill_permission
0.00 +0.2 0.16 ± 2% perf-profile.self.cycles-pp.audit_signal_info
0.06 ± 8% +0.2 0.26 ± 2% perf-profile.self.cycles-pp.audit_signal_info_syscall
0.00 +0.8 0.75 perf-profile.self.cycles-pp.get_task_cred
46.37 +40.2 86.58 perf-profile.self.cycles-pp.apparmor_task_kill



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki