Orbit
Orbit copied to clipboard
Open source collection of Reinforcement Learning Environments.
Results
3
Orbit issues
Sort by
recently updated
recently updated
newest added
The epsilon decay in the code is under the module `agent.replay()` which is called every step, making the epsilon rapidly decline during the first episode. I don't know if this...
all progress related to Cannon