Prabhasa Kalkur
Results
2
comments of
Prabhasa Kalkur
Thanks. While I wait, I also have a question on PPO2 (don't want to open another issue): I see there are ``self.n_envs`` environments running in parallel. I understand this might...
Thanks! > Anyway, as mentioned in the doc, I would recommend using an `EvalCallback` instead of training reward for plotting learning curve Ohh. Could you expand more on this? I...