Prabhasa Kalkur

Results 2 comments of Prabhasa Kalkur

Thanks. While I wait, I also have a question on PPO2 (don't want to open another issue): I see there are ``self.n_envs`` environments running in parallel. I understand this might...

Thanks! > Anyway, as mentioned in the doc, I would recommend using an `EvalCallback` instead of training reward for plotting learning curve Ohh. Could you expand more on this? I...