[PATCH v2 0/2] powerpc32: optimisation of csum_partial_copy_generic()

From: Christophe Leroy
Date: Tue Jun 23 2015 - 01:38:37 EST


This patch optimises csum_partial_copy_generic() by making use of cache
instructions (dcbt/dcbz) just like copy_tofrom_user() does

On a TCP benchmark using socklib on the loopback interface on which checksum
offload and scatter/gather have been deactivated, we get about 20% performance
increase.

v2 is just a new issue with format-patch -M -C, other changes.

Christophe Leroy (2):
powerpc32: checksum_wrappers_64 becomes checksum_wrappers
powerpc32: rewrite of csum_partial_copy_generic based of copy_tofrom_user

arch/powerpc/include/asm/checksum.h | 9 -
arch/powerpc/lib/Makefile | 3 +-
arch/powerpc/lib/checksum_32.S | 320 ++++++++++++++-------
...{checksum_wrappers_64.c => checksum_wrappers.c} | 0
4 files changed, 210 insertions(+), 122 deletions(-)
rename arch/powerpc/lib/{checksum_wrappers_64.c => checksum_wrappers.c} (100%)

--
2.1.0

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
Please read the FAQ at http://www.tux.org/lkml/