Chen Liu
Chen Liu
self.critic_optim.zero_grad() critic_loss.backward() self.critic_optim.step() self.actor_optim.zero_grad() actor_loss.backward() self.actor_optim.step() 当我把这个顺序调整后,这个会报错:因为inplace操作导致梯度的更新失败。。感激了
When we train the current task, will we use the data of the previous task? ewc need task A data to compute fisher info, when we train task B how...
dalao, I find that in PaCo or GPaCo the logits on the numerator in the loss function not masked, but the denominator of the loss function is masked cause `exp_logits...
In Configuration_1.ipynb's train function the server receives the output from clients and then computes the loss and backward(), but how are the gradients updated on the clients? 😥 thanks.
Hello, it's a great work. The paper mentions several mask schemes in training processes could you open-sourced training process? Thanks.😀