easyaspi314 comments

Results 132 comments of


                                            easyaspi314

i686 gcc 12: regression at -O1 or -O2

> Ah, I'll try `__attribute__((optimize("no-tree-vectorize")))` > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106322#c14 attribute optimize does work temporarily but it does disable inlining which emits a warn/error. ```c /* GCC 12.1-12.2.0 emit garbage if vector.umulh...

i686 gcc 12: regression at -O1 or -O2

That backwards loop is off by one.

Remove use of native MMX intrinsics

So primarily looking at neon, what if we had ```c typedef struct { ... } simde_uint8x8_t; typedef struct { ... #ifdef SIMDE_X86_SSE2_NATIVE __m128i m128i; #endif } simde_private_uint8x8_t; simde_private_uint8x8_t simde_uint8x8_t_to_private(simde_uint8x8_t x)...

Remove use of native MMX intrinsics

Yes, but also I say no MMX allowed, period. It isn't something that is transparently handled by the compiler (unlike, say, `vzeroupper`) which is inappropriate for a library that attempts...

Remove use of native MMX intrinsics

[Example of MMX breaking things](https://tio.run/##XVBha4MwEP2eX3F0DGJ7TdU6V2g72PfBfkAZEo26gCZF7WhX/OtzF7t1o5Dkcu/uvbxcNs8qacphuNMmqw4qh01da9M12oj3J/aHtp3S1kEfVisorOUMIEnqOIJpk7dEyDqQSGBlTQnKHtIq/1dKmcfOV8oRtpDUddLm3Wfe2KTVccS99Q37RF2@cGhhG@BkC7SD1hQ2EFGYzTxwqnBVlEole70M@RFB7vTbKAokNdtCSrlLe9py578R5ejy9HI/rVnPmHulltrwH7/PlS6NbPnKA1eSO9d6hgBDXGKEDxjjI676W@/pb5/AUOBSYCSgv3zFcj5OYepJhHQ0uFhAEEPoQ@ALn/I9zb8r@ORegVsvxQRHx@4M6HSOidgPw1dWVLJsh/lr@A0) Messing with the optimization levels can result in differing values, even if you put `_mm_empty()` after each intrinsic (which would force the vector to memory...