Hasnain Ali

Results 2 comments of Hasnain Ali

Hi, I have been facing problems with diagnosing PPO2 training on multiple environments. Especially the episode reward are weird (see the image). ![image](https://user-images.githubusercontent.com/24753433/103527174-e81d8900-4ebc-11eb-855d-38c5710b7070.png) Today, I chanced to read this issue....

> > > @Capitolhill As a quick fix, I suggest trying out [stable-baselines3](https://github.com/DLR-RM/stable-baselines3) which also has tensorboard support and is more actively maintained. Migration from SB2 is mostly as simple...