aqrit
aqrit
should be fixed by #632 ?
`_mm_alignr_epi8()` would map to `wasm_i8x16_shuffle()`. If for some reason you need a ~~dynamic~~ variable shift by bytes then that can be emulated using `wasm_i8x16_swizzle()`.
at the least v8/turbofan should lower a `wasm_i8x16_shuffle` in this instance to 1 instruction on both SSSE3 and NEON: https://github.com/zeux/wasm-simd/blob/master/Shuffles.md#concats However, I'm having trouble getting the disassembly to prove this...
On a related note: amalgamate.py has to be edited to work if one just grabs a tarball of the source.
> it sometimes gets incremented multiple times because of macro syntax That is weird. From [here](https://gcc.gnu.org/onlinedocs/gcc/Typeof.html), I'd say try replacing `typeof` with `__auto_type` and see if that works. However, that...
Could you add some clarity? The PR is for `sse_convert_latin1_to_utf8`. But you're discussing a performance regression with `sse_convert_utf16_to_latin1` ? This PR : I suspect the branch for 'ASCII fast path'...
A `Nehalem` would not have a problem with the shuffles. However, a `Haswell` or `Skylake` might. On what CPU are you performing the benchmark? The terms would be [register renaming](https://en.wikipedia.org/wiki/Register_renaming)...
> cache miss If we're willing to do 4 lookups per 16 bytes of input, then we'd only use 2 cache lines for tables. https://gist.github.com/aqrit/5c914da98006874d0401983eb687e30e Note: I haven't actually studied...
I highly suspect... the menu issues with d1 are cause by the same issue. unfortunately, I was not able to get this project working quite right so I stopped development....
Is `__attribute__((aligned(x)))` being used incorrectly? "Cannot decrease the alignment below the natural alignment of the type." "For a variable that is not in a structure, the minimum alignment is the...