typewriter
typewriter copied to clipboard
Nans in PPO and Clipped PPO agents
Hi, I have tried applying Clipped PPO agents with different environments and after some time Surrogate Loss, KL divergence and entropy all become Nan. I've tried various settings of hyperparameters, it sometimes postpones the crash but this issue is still prevalent. I've faced similar issue with PPO as well.
Many users have run into similar problem ( for eg: Issue #87). Kindly suggest any solution to this problem.