wang-y-z
Results
2
comments of
wang-y-z
DNNDK didn't make the pruning open source yet.
Do you have more profile data by nsight-compute? Which can be a good guide for perf debugging. BTW, have you done any autotune your layer norm triton kernel?