garage icon indicating copy to clipboard operation
garage copied to clipboard

Add RNN support to torch/PPO and torch/TRPO

Open ryanjulian opened this issue 4 years ago • 3 comments

ryanjulian avatar Feb 29 '20 00:02 ryanjulian

I'm planning to implement the followings to add RNN support to PPO/ TRPO:

  • [x] RNN: lstm_module
  • [x] RNN policies and its module for PPO: guassianLSTMModule, guassianLSTMPolicy
  • [x] examples (maybe Pendulum): trpo_lstm, ppo_lstm
  • [ ] test cases for 1,2

Let me know if I miss anything!

irisliucy avatar Jun 25 '20 07:06 irisliucy

Yes, that sounds good! You should probably pick the more popular of LSTM and GRU.

ryanjulian avatar Jun 25 '20 14:06 ryanjulian

I'm planning to implement the followings to add RNN support to PPO/ TRPO:

  • [x] RNN: lstm_module
  • [x] RNN policies and its module for PPO: guassianLSTMModule, guassianLSTMPolicy
  • [x] examples (maybe Pendulum): trpo_lstm, ppo_lstm
  • [ ] test cases for 1,2

Let me know if I miss anything!

@irisliucy Was this ever completed? I can't seem to find the code for it. I'm trying to implement a recurrent policy in PyTorch to use with PPO and can't figure out the API, so would be helpful to see these examples. Thanks!

nikhilxb avatar Sep 09 '21 14:09 nikhilxb