RE: [PATCH 2/3] riscv: optimized memmove

From: David Laight
Date: Tue Jan 30 2024 - 06:51:45 EST

Next message: Jisheng Zhang: "Re: [PATCH net v2] net: stmmac: protect updates of 64-bit statistics counters"
Previous message: Will Deacon: "Re: [PATCH] arm64/signal: Don't assume that TIF_SVE means we saved SVE state"
In reply to: Jisheng Zhang: "Re: [PATCH 2/3] riscv: optimized memmove"
Next in thread: Nick Kossifidis: "Re: [PATCH 2/3] riscv: optimized memmove"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: Jisheng Zhang
> Sent: 30 January 2024 11:31
>
> On Sun, Jan 28, 2024 at 12:47:00PM +0000, David Laight wrote:
> > From: Jisheng Zhang
> > > Sent: 28 January 2024 11:10
> > >
> > > When the destination buffer is before the source one, or when the
> > > buffers doesn't overlap, it's safe to use memcpy() instead, which is
> > > optimized to use a bigger data size possible.
> > >
> > ...
> > > + * Simply check if the buffer overlaps an call memcpy() in case,
> > > + * otherwise do a simple one byte at time backward copy.
> >
> > I'd at least do a 64bit copy loop if the addresses are aligned.
> >
> > Thinks a bit more....
> >
> > Put the copy 64 bytes code (the body of the memcpy() loop)
> > into it an inline function and call it with increasing addresses
> > in memcpy() are decrementing addresses in memmove.
>
> Hi David,
>
> Besides the 64 bytes copy, there's another optimization in __memcpy:
> word-by-word copy even if s and d are not aligned.
> So if we make the two optimizd copy as inline functions and call them
> in memmove(), we almost duplicate the __memcpy code, so I think
> directly calling __memcpy is a bit better.

If a forwards copy is valid call memcpy() - which I think you do.
If not you can still use the same 'copy 8 register' code
that memcpy() uses - just with a decrementing block address.

David

-
Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK
Registration No: 1397386 (Wales)

Next message: Jisheng Zhang: "Re: [PATCH net v2] net: stmmac: protect updates of 64-bit statistics counters"
Previous message: Will Deacon: "Re: [PATCH] arm64/signal: Don't assume that TIF_SVE means we saved SVE state"
In reply to: Jisheng Zhang: "Re: [PATCH 2/3] riscv: optimized memmove"
Next in thread: Nick Kossifidis: "Re: [PATCH 2/3] riscv: optimized memmove"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]