Luca Wehrstedt

Results 41 issues of Luca Wehrstedt

Summary: By telling CUTLASS to output in column-major (somehow it's faster) and transposing the inputs so that the end result is the same. Here are the benchmark results for the...

fb-exported
cla signed