cutlass
cutlass copied to clipboard
feat: allow print_latex(TiledMMA) to colorize sliced thread and add print_latex(ThrMMA)
Allow colorize only one thread of print_latex, to make mma pattern obvious and reduce eye strain.
For example,
#include "cute/tensor.hpp"
using namespace cute;
int main() {
auto tiled_mma = make_tiled_mma(SM80_16x8x8_F32F16F16F32_TN{});
auto thr_mma = tiled_mma.get_thread_slice(11);
// print_latex(tiled_mma, 11);
print_latex(thr_mma); // the same as above
return 0;
}
produces
This PR has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates. This PR will be labeled inactive-90d if there is no activity in the next 60 days.
This PR has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates.