Scott Myron

Results 21 comments of Scott Myron

> This one is interesting, as it doesn't require any of the annoying feature detection SIMD impose. > > But if we end up going with SIMD anyway, might as...

> > It's nice to have a pure C implementation that doesn't require any special instructions. > > True. I guess my only real reservation with this PR (and also...

I think this is fine to close. I need to figure out how to build Ruby on Windows with Visual Studio so I can ensure the SIMD code build with...

Comparison between `master` and this branch in real world benchmarks. This is for the lookup table implementation. ``` == Encoding activitypub.json (52595 bytes) ruby 3.4.1 (2024-12-25 revision 48d4efcb85) +PRISM [arm64-darwin24]...

Apologies for going dark for a while. I've been trying to make incremental improvements on a different branch (found [here](https://github.com/ruby/json/compare/master...samyron:json:arm-neon-simd-v2-clz)). My hope was using a [move mask](https://community.arm.com/arm-community-blogs/b/servers-and-cloud-computing-blog/posts/porting-x86-vector-bitmask-optimizations-to-arm-neon) would be faster...

@byroot if you have a few minutes, would you be able to checkout [this branch](https://github.com/samyron/json/tree/arm-neon-simd-v2-optimized-scalar) and benchmark it against master. You'll have to tweak your compare script a bit to...

From a co-worker with an M4 Pro: ``` == Encoding activitypub.json (52595 bytes) ruby 3.2.6 (2024-10-30 revision 63aeb018eb) [arm64-darwin24] Warming up -------------------------------------- after 2.876k i/100ms Calculating ------------------------------------- after 28.251k (±...

From another co-worker with an M1 Pro: ``` == Encoding activitypub.json (52595 bytes) ruby 3.4.2 (2025-02-15 revision d2930f8e7a) +PRISM [arm64-darwin24] Warming up -------------------------------------- after 2.166k i/100ms Calculating ------------------------------------- after 21.521k...

Hi @radiospiel, I'll take a look at #769. I originally started working on https://github.com/ruby/json/pull/730 which supports Neon, SSE 4.2 and AVX2 with runtime detection support. The PR got a bit...

> This is magnitudes more than the numbers posted here. I have seen a 48% posted above (on the `activitypub` testcase), so is this a typo `x` → `%`? The...