walk-these-ways
walk-these-ways copied to clipboard
Questions about “optimizer” in ppo.py
I noticed that “self.optimizer” and “self.adaptation_module_optimizer” are defined in ppo.py, but both of these two optimizers seem to be related to actor_critic. Will this cause parameters to be updated repeatedly in the update function? Looking forward to your answers.