pythran
pythran copied to clipboard
Use faster scalar version for trigo function
Second try.
@JohanMabille what's your take on this?
If it turns out to be faster than the scalar (std) version, we can implement the same algorithms for scalars as the ones operating on batches (so we save the transfer to and from simd registers).
the speedup may come from the usage of sse instruction though. This needs more investigation. But yes, this is faster for the flagged functions, at least on my laptop, but this is probably hardware-dependent.