arch/86: AVX RAID5 xor checksumming v1

From: Jim Kukunas
Date: Wed Apr 18 2012 - 18:58:24 EST

Hi Folks,

The following patch adds an AVX implementation of the RAID5 xor checksumming

This version differs from version 0, in that it:
0) checks whether assembler supports AVX
1) replaces __attribute__(aligned(32)) with __aligned(32)
2) adds do/while(0); to macro blocks

Based on xor_speed, the AVX implementation appears to be ~32% faster than the
SSE implementation on my i7 2600:

generic_sse: 15088.000 MB/sec
avx: 19936.000 MB/sec


