Reinforcement-learning-with-tensorflow icon indicating copy to clipboard operation
Reinforcement-learning-with-tensorflow copied to clipboard

DPPO not converging

Open ghost opened this issue 6 years ago • 0 comments

I tried your DPPO algorithm with EP_MAX = 8000 and the total moving reward is not converging. Any Idea why ?

ghost avatar Dec 09 '18 17:12 ghost