packed_simd
packed_simd copied to clipboard
The performance multipliers for binomial put in the readme.md file for options_pricing are incorrect.
It looks like the speedup multipliers for par_simd, ispc and ispc+tasks are incorrect given the execution time.
par_simd should be 7.37x
ispc should be 2.55x
ispc+tasks should be 5.09x.
And par_simd is 1.45x faster than ispc+tasks.
Also, have you since repeated these benchmarks with newer versions of the ispc compiler? Currently the setup_benchmarks.sh is pointing to 1.9.2 on LLVM 5 which was released almost 7 years ago. We are now at 1.23.0 on LLVM 16. Performance is quite a bit better as we've made improvements. These days ispc is often beating intrinsics implementations.
I'd be curious to see the results with a new compiler binary.