RL_toolbox
RL_toolbox copied to clipboard
reinfore learning tool box, contains trpo, a3c algorithm for continous action space
RL_toolbox
all the algorithm is running on pycharm IDE, or the package loss error may exist.
implemented algorithm: trpo a3c
- a3c:for continous action space, use multi processes, but saving model has not been implemented.
- trpo:for continous and discrete action space
run
- a3c:run a3c/a3c_continous.py in pycharm IDE
- trpo:run experiment/trpo_continous.py in pycharm IDE