jtromans comments

Results 4 comments of


                                            jtromans

NAN problem in PPO1 and PPO2

I also got this problem with PPO2. I note in issues #340 the entropy coefficient was to blame. More concretely, the OP suggested that it was too high at a...

Thanks for catching that - I've corrected that but I still get the same issue. What would be the appropriate approach for debugging this. ![image](https://user-images.githubusercontent.com/1130383/78510898-82b33500-7790-11ea-9e77-0c8b3ff94270.png) ![image](https://user-images.githubusercontent.com/1130383/78510931-af674c80-7790-11ea-9e17-907514d458d1.png)

NAN problem in PPO1 and PPO2

Thanks for the suggestion. I will try this. However, randomly trying different Hyper Parameters may not be the most optimal way to debug. I'm after a more technical approach in...

[question] Using keras in Custom Policy

Running Ubuntu 18.04.2 LTS, Docker 19.03.6 running tensorflow/tensorflow:1.14.0-gpu-py3-jupyter w/ stable_baselines '2.10.0' FWIW I cannot get PPO2 agent to learn CartPole using this Keras Policy 'as is', whereas when I use...