Re: [PATCH v12 1/5] asm-generic: Improve csum_fold

From: Arnd Bergmann
Date: Wed Dec 13 2023 - 02:52:28 EST


On Wed, Dec 13, 2023, at 02:18, Charlie Jenkins wrote:
> This csum_fold implementation introduced into arch/arc by Vineet Gupta
> is better than the default implementation on at least arc, x86, and
> riscv. Using GCC trunk and compiling non-inlined version, this
> implementation has 41.6667%, 25% fewer instructions on riscv64, x86-64
> respectively with -O3 optimization. Most implmentations override this
> default in asm, but this should be more performant than all of those
> other implementations except for arm which has barrel shifting and
> sparc32 which has a carry flag.
>
> Signed-off-by: Charlie Jenkins <charlie@xxxxxxxxxxxx>

Acked-by: Arnd Bergmann <arnd@xxxxxxxx>