rl_algorithms
rl_algorithms copied to clipboard
I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really understanding them.
Because I should know how to fix the current bugs in my code. - [ ] DDPG - [ ] TRPO
Let's leave Python 2.7 behind and make everything 3.5+ for this repository. If I need to go back to Python 2.7, make a virtualenv.
I need to make this code more modular and flexible, and take full advantage of Python's features. The policies right now are kind of hard-coded awkwardly. Look at `modular_rl` and...
It turns out that the G-learning paper doesn't use the episodic setting (at least for the cliff-world setting, which is my main concern). Let's write a new cliff-world environment which...