[PATCH v2 0/3] perf report: Implement visual marker for macro fusion in annotate

From: Jin Yao
Date: Sun Jun 18 2017 - 22:53:26 EST


Macro fusion merges two instructions to a single micro-op. Intel
core platform performs this hardware optimization under limited
circumstances. For example, CMP + JCC can be "fused" and executed
/retired together. While with sampling this can result in the
sample sometimes being on the JCC and sometimes on the CMP.
So for the fused instruction pair, they could be considered
together.

In general, the fused instruction pairs are:

cmp/test/add/sub/and/inc/dec + jcc.

This patch series marks the case clearly by joining the fused
instruction pair in the arrow of the jump.

For example:

â âââcmpl $0x0,argp_program_version_hook
81.93 â âââje 20
â â lock cmpxchg %esi,0x38a9a4(%rip)
â ââ jne 29
â ââ jmp 43
11.47 â20:âââcmpxch %esi,0x38a999(%rip)

Change-log:
-----------
v2: According to Arnaldo's comments, remove the weak function and
use an arch-specific function instead to check fused instruction
pair.

v1: Inital post

Jin Yao (3):
perf util: Return arch from symbol__disassemble and save it in browser
perf util: Check for fused instruction
perf report: Implement visual marker for macro fusion in annotate

tools/perf/arch/x86/annotate/instructions.c | 18 ++++++++++++++++
tools/perf/builtin-top.c | 2 +-
tools/perf/ui/browser.c | 27 ++++++++++++++++++++++++
tools/perf/ui/browser.h | 2 ++
tools/perf/ui/browsers/annotate.c | 32 ++++++++++++++++++++++++++++-
tools/perf/ui/gtk/annotate.c | 3 ++-
tools/perf/util/annotate.c | 25 ++++++++++++++++++++--
tools/perf/util/annotate.h | 6 +++++-
8 files changed, 109 insertions(+), 6 deletions(-)

--
2.7.4