Ash Vardanian
Ash Vardanian
I think #642 might be related to this, as I use capitalized verbs.
Hi @ChillFish8! Which version of SimSIMD are you using? AVX2 for `float32` is practically the only SIMD+datatype combo we don't implement, as that's the only one that compilers vectorize well...
The SimSIMD repository contains Rust benchmarks against native implementations. Maybe they are poorly implemented... Can you try cloning the SimSIMD repository and then running the benchmarks, as described in the...
Is that all still on the same Ryzen CPU, @ChillFish8? I was just refreshing the [ParallelReductionsBenchmark](https://github.com/ashvardanian/ParallelReductionsBenchmark) and added a loop-unrolled variant with scalar code in the C++ layer. It still...
I believe this is related to #148 and can be improved with the next PR 🤗
Hey, @ChillFish8! Are you observing the same performance issues with the most recent 5.0.1 release as well?
Which machine are [these](https://github.com/ashvardanian/SimSIMD/issues/107#issuecomment-2299830834) numbers coming from? Is that an Arm machine? Is there SVE available?
In some cases, on older AMD CPUs, the latency of some instructions was too high and the compilers preferred using serial code. I think for now we can close this...
Great reference, @corani! Never seen it! I can try it later, but it's not a high priority right now. Feel free to open a PR if you ever have time...
Thanks, @amirzia! I think `include/simsimd/probability.h` is a good place for those 🤗