rl_algorithms issues

Results 5 rl_algorithms issues

Sort by recently updated

Fix DDPG and TRPO

Because I should know how to fix the current bugs in my code. - [ ] DDPG - [ ] TRPO

DanielTakeshi

Asynchronous Advantage Actor-Critic

I need that algorithm implemented here!!!

DanielTakeshi

enhancement

Make everything Python3

Let's leave Python 2.7 behind and make everything 3.5+ for this repository. If I need to go back to Python 2.7, make a virtualenv.

DanielTakeshi

Better modularity for policies

I need to make this code more modular and flexible, and take full advantage of Python's features. The policies right now are kind of hard-coded awkwardly. Look at `modular_rl` and...

DanielTakeshi

enhancement

G-learning, test with infinite horizon

It turns out that the G-learning paper doesn't use the episodic setting (at least for the cliff-world setting, which is my main concern). Let's write a new cliff-world environment which...

DanielTakeshi

rl_algorithms
rl_algorithms copied to clipboard

Metadata

Fix DDPG and TRPO

Asynchronous Advantage Actor-Critic

Make everything Python3

Better modularity for policies

G-learning, test with infinite horizon

← Metadata

Owner

Metadata

rl_algorithms rl_algorithms copied to clipboard

Metadata

Fix DDPG and TRPO

Asynchronous Advantage Actor-Critic

Make everything Python3

Better modularity for policies

G-learning, test with infinite horizon

← Metadata

Owner

Metadata

rl_algorithms
rl_algorithms copied to clipboard