Re: [PATCH v6 17/18] arm64/mm: __always_inline to improve fork() perf

From: Catalin Marinas
Date: Fri Feb 16 2024 - 07:34:34 EST


On Thu, Feb 15, 2024 at 10:32:04AM +0000, Ryan Roberts wrote:
> As set_ptes() and wrprotect_ptes() become a bit more complex, the
> compiler may choose not to inline them. But this is critical for fork()
> performance. So mark the functions, along with contpte_try_unfold()
> which is called by them, as __always_inline. This is worth ~1% on the
> fork() microbenchmark with order-0 folios (the common case).
>
> Acked-by: Mark Rutland <mark.rutland@xxxxxxx>
> Signed-off-by: Ryan Roberts <ryan.roberts@xxxxxxx>

Acked-by: Catalin Marinas <catalin.marinas@xxxxxxx>