Luca Wehrstedt
Results
41
issues of
Luca Wehrstedt
Summary: By telling CUTLASS to output in column-major (somehow it's faster) and transposing the inputs so that the end result is the same. Here are the benchmark results for the...
fb-exported
cla signed