kitty
kitty
hello,I encountered the same problem. Is there any progress on this issue?
急需这个功能
如果是手机端呢。。。
@urlyy I wrote a rust version, the neon performance improvement is about twice as good, we can optimize it together  my "Standard" is your normal
@urlyy I'm optimizing SIMD code for SSE2 and AVX
@urlyy neon I referred to part of your implementation, thank you