intel-intrinsics
intel-intrinsics copied to clipboard
Deprecate loadUnaligned and storeUnaligned from public API
Concrete context always has a type and size information, so there isn't such a need for those in the public API. You can always use _mm_loadu_ps
and friends.
- [x] Remove non-LDC use of loadUnaligned
- [x] Remove non-LDC use of storeUnaligned
- [x] Remove emulated loadUnaligned and storeUnaligned
Reminder for myself: When making new code, do not use loadUnaligned or storeUnaligned unless for LDC code paths.
The issues with those is that it needs type-specific overloads, that always correspond to one intrinsic. loadUnaligned overloads ends up being untested and unoptimized, while the corresponding intrinsics is tested and optimized and already ported for all compilers, so it's really duplication.
Related to #102 , using __vector directly is often faster
Done, shufflevector is the same and should go though