Sayantan Chakraborty
Sayantan Chakraborty
This transition has already been done in a different PR, so closing this
So I am starting the `VEX` variants.
No problems yet, just that `simd_fabs` doesn't accept a `f16` argument, so i will just use an and operation. I am actively avoiding doing f16 operations in rust, but that's...
cc @tgross35 @beetrees
I have changed the detection logic to be name-based. This approach can't have false negatives, but it has nontrivial behaviour with functions that export themselves as LLVM intrinsic, with `unadjusted`...
I managed to resolve the false positive (which resulted in #140822). This can now be extended to more use-cases, e.g. using `bf16` vectors from Rust
That can be used to improve performance, I am not really focusing on performance in this PR. I want to currently emphasize the correctness of the codegen.
Oh wait, I probably misunderstood your comment, you meant using the llvm declaration by itself. Yeah, that would be better, thanks for the info. I will update the impl when...
Added functionality to use the appropriate amx-specific casts instead of generic `bitcast`. Discovered the bug that if the intrinsic is passed as a function pointer, then it doesn't work. The...
@nikic nice, one less thing to worry about ❤️