triton
triton copied to clipboard
[AMD][gfx12] Support WMMAv2 dot instruction generation
- Added intrinsic generation according to the operands type, cache them to avoid repetitive calculations
- Fixed parameters dependent on the version in the main logic of WMMA operation generator
- Added a lit test to verify number of llvm instructions