flint
flint copied to clipboard
fft_small tuning for arm64
- Rewrite addmod / addmod_limited to use vector intrinsics
- Tune cutoffs for functions that call fft_small on arm64 (currently all such cutoffs are based on the AVX2 version on a Zen3 laptop)
The second point was fixed a week ago or so for Apple M1.