Alexander Karpinsky
Alexander Karpinsky
Also back ported to upstream: https://github.com/woltapp/blurhash/pull/256
I have updated branch to match the latest version from https://github.com/woltapp/blurhash/pull/256 @lautat Are you interested in this improvements?
~~I've also implemented SSE and NEON optimizations in [separate branch](https://github.com/homm/blurhash/blob/optimize-simd/C/encode.c).~~ The last optimization with unrolling loop in `multiplyBasisFunction` is actually works better since it allows any compiler effectively autovectorize the...
@DagAgren Are you interested in this improvements?
I also improved decoder performance about 14 times using the same techniques: caching cos values, linearTosRGB values and unrolling loops. This improves performance of decoding from 6 Mpx/s to 86...
@DagAgren How can I earn your attention?
> However, this code is written intentionally to be simple rather than performant Does this mean you’re rejecting any performance improvements entirely, or only the more radical ones (like 4×...
> split 10-bit 3-channel images to 3 10-bit 1-channel images in advance, then use Pillow to handle the LUT Again, this wouldn't 3D LUT, this will be three 1D LUTs.
@python-pillow/pillow-team What do you think?
@radarhere Ok, added, but it's just some random image from stocks)