Reinforcement-learning-with-tensorflow
Reinforcement-learning-with-tensorflow copied to clipboard

Published 20 hours ago •

DPPO not converging

Open ghost opened this issue 6 years ago • 0 comments

I tried your DPPO algorithm with EP_MAX = 8000 and the total moving reward is not converging. Any Idea why ?

Dec 09 '18 17:12 ghost