ZhangYF

Results 1 comments of ZhangYF

same question here. It's indeed very hard to understand why it was designed this way. I believe accelerator.backward(loss) should only perform the backward operation, and other steps should be written...