rl-agents
rl-agents copied to clipboard
The problem of reward performance
Hello, thank you very much for open sourcing such a great project. I am running the code:
python experiments.py evaluate configs/IntersectionEnv/env.json using the command
configs/IntersectionEnv/agents/DQNAgent/baseline.json
--train --episodes=4000 --name-from-config,
the reward graph I get is unstable. I hope to get your help, thanks a lot!
hi,I also encountered the same problem. In tensorboard, all of my curves did not converge. Have you solved this problem now?
Perhaps I have found a solution. Adjust the smoothness index in tensorboard