Matthew R. Becker
Matthew R. Becker
sorry yes. I meant CCD
@aphearin @manodeep I have been banging my head against this for a week. Here is my most minimal example of something I don't understand. Hopefully, it is my code!
@manodeep yes I can reproduce the bug in both of those cases.
Thanks for bumping me. We can debug here. The first thing to do is to figure out if the bug happens when the package is built outside of conda build...
> Does Conda-forge only target CPUs with AVX2? No. It actually disables most SIMD optimizations by default.
The patch in this PR solves this I hope: https://github.com/conda-forge/corrfunc-feedstock/pull/20
Yes, this is the correct effect. conda-forge does not support high-levels SIMD optimization unless you ship a fat binary that detects the instructions at runtime. There is an env var...
See this issue: https://github.com/conda-forge/corrfunc-feedstock/issues/2
Sure sounds great! We can redo the patching as needed to make this work.
For cross-reference, when we did this in numba to start, we found a big bottleneck that appears to have been fixed subsequently. xref: https://github.com/numba/numba/issues/4647