Boqian
Results
1
comments of
Boqian
Adding `if context_layer.requires_grad == True:` on **309** lines of **modeling.py** works for me. Like: ` if context_layer.requires_grad == True: self.context_layer_val.retain_grad() ` Hope it solves your problem.