finufft
finufft copied to clipboard
[DRAFT] Chunksize for spreader
Processing multiple NU points at the same time allows for further optimization in Horner. The suggestions from @mreineck in https://github.com/flatironinstitute/finufft/discussions/461 can have a huge impact as they allow to exploit extra SIMD vectorization.