pytorch-REINFORCE
pytorch-REINFORCE copied to clipboard
PyTorch Implementation of REINFORCE for both discrete & continuous control
PyTorch REINFORCE
data:image/s3,"s3://crabby-images/88615/88615ff57bfc2d027a0322bda97ab81070163ca2" alt=""
PyTorch implementation of REINFORCE.
This repo supports both continuous and discrete environments in OpenAI gym.
Requirement
- python 2.7
- PyTorch
- OpenAI gym
- Mujoco (optional)
Run
Use the default hyperparameters. (Program will detect whether the environment is continuous or discrete)
python main.py --env_name [name of environment]
Experiment results
continuous: InvertedPendulum-v1
data:image/s3,"s3://crabby-images/dd4f6/dd4f65fa1485901e1f2f843b1464bdf43c8867d8" alt=""
discrete: CartPole-v0
data:image/s3,"s3://crabby-images/f5a00/f5a00840af87c0ed180d609bb5177958010989f3" alt=""