lib/raid6/altivec: Add vpermxor implementation for raid6 Q syndrome
This patch uses the vpermxor instruction to optimise the raid6 Q syndrome. This instruction was made available with POWER8, ISA version 2.07. It allows for both vperm and vxor instructions to be done in a single instruction. This has been tested for correctness on a ppc64le vm with a basic RAID6 setup containing 5 drives. The performance benchmarks are from the raid6test in the /lib/raid6/test directory. These results are from an IBM Firestone machine with ppc64le architecture. The benchmark results show a 35% speed increase over the best existing algorithm for powerpc (altivec). The raid6test has also been run on a big-endian ppc64 vm to ensure it also works for big-endian architectures. Performance benchmarks: raid6: altivecx4 gen() 18773 MB/s raid6: altivecx8 gen() 19438 MB/s raid6: vpermxor4 gen() 25112 MB/s raid6: vpermxor8 gen() 26279 MB/s Signed-off-by:Matt Brown <matthew.brown.dev@gmail.com> Reviewed-by:
Daniel Axtens <dja@axtens.net> [mpe: Add VPERMXOR macro so we can build with old binutils] Signed-off-by:
Michael Ellerman <mpe@ellerman.id.au>
Showing
- arch/powerpc/include/asm/ppc-opcode.h 6 additions, 0 deletionsarch/powerpc/include/asm/ppc-opcode.h
- include/linux/raid/pq.h 4 additions, 0 deletionsinclude/linux/raid/pq.h
- lib/raid6/.gitignore 1 addition, 0 deletionslib/raid6/.gitignore
- lib/raid6/Makefile 26 additions, 1 deletionlib/raid6/Makefile
- lib/raid6/algos.c 4 additions, 0 deletionslib/raid6/algos.c
- lib/raid6/test/Makefile 15 additions, 2 deletionslib/raid6/test/Makefile
- lib/raid6/vpermxor.uc 105 additions, 0 deletionslib/raid6/vpermxor.uc
Loading
Please register or sign in to comment