Jesse Cai
Jesse Cai
@pytorchbot rebase
@pytorchbot merge
@pytorchbot merge -f "unrelated failure"
@pytorchbot revert
@pytorchbot revert -m "this PR breaks AO tests" -c nosignal
@pytorchbot rebase
@pytorchbot rebase
@pytorchbot merge -f "unrelated failure on cuDNN path"
cc @mostafaelhoushi What hardware, pytorch and ao version are you using? On my H100 on the nightlies, I see: (for 4096, 11008) which is a speedup ``` Baseline: 0.04709856033325195 ms...
I would definitely recommend using the latest nightlies to test