triton
triton copied to clipboard
[WIP][gfx11] Support tied wmma instrucrions
- Generated intrinsic for wmma calculations
- Generate tied instructions along M axis if possible.
Results for FA benchmark (from here) for gfx11 (W7900) target:
Thanks @jfactory07 for the results above.