rl-agents icon indicating copy to clipboard operation
rl-agents copied to clipboard

The problem of reward performance

Open Louise599 opened this issue 9 months ago • 2 comments

Hello, thank you very much for open sourcing such a great project. I am running the code: python experiments.py evaluate configs/IntersectionEnv/env.json using the command
configs/IntersectionEnv/agents/DQNAgent/baseline.json
--train --episodes=4000 --name-from-config, the reward graph I get is unstable. I hope to get your help, thanks a lot! 屏幕截图 2024-05-08 111045

Louise599 avatar May 08 '24 03:05 Louise599

hi,I also encountered the same problem. In tensorboard, all of my curves did not converge. Have you solved this problem now?

kongxincaizi avatar Jun 05 '24 02:06 kongxincaizi

Perhaps I have found a solution. Adjust the smoothness index in tensorboard

kongxincaizi avatar Jun 05 '24 04:06 kongxincaizi