MARLlib
MARLlib copied to clipboard
Backpropagation through time for PPO
PPO + LSTM have a extral hyperparameter what is bptt horizon. Is possible I set up it?
Seems like in RLlib version 1.8.0, there is no native support for configuring the BPTT when using an RNN structure as the model base. Maybe consider trying the latest version of Ray for potential updates on this?