nsimd
nsimd copied to clipboard
Add loads/stores of BFloat16
BFloat16 are truncated standard float32, therefore
- loads involves unpacks and
- stores involves unzip
This is OK for all supported architectures.
Reference: https://en.wikipedia.org/wiki/Bfloat16_floating-point_format.