Boqian

Results 1 comments of Boqian

Adding `if context_layer.requires_grad == True:` on **309** lines of **modeling.py** works for me. Like: ` if context_layer.requires_grad == True: self.context_layer_val.retain_grad() ` Hope it solves your problem.