jtromans
jtromans
I also got this problem with PPO2. I note in issues #340 the entropy coefficient was to blame. More concretely, the OP suggested that it was too high at a...
Thanks for catching that - I've corrected that but I still get the same issue. What would be the appropriate approach for debugging this.  
Thanks for the suggestion. I will try this. However, randomly trying different Hyper Parameters may not be the most optimal way to debug. I'm after a more technical approach in...
Running Ubuntu 18.04.2 LTS, Docker 19.03.6 running tensorflow/tensorflow:1.14.0-gpu-py3-jupyter w/ stable_baselines '2.10.0' FWIW I cannot get PPO2 agent to learn CartPole using this Keras Policy 'as is', whereas when I use...