tianshou icon indicating copy to clipboard operation
tianshou copied to clipboard

An elegant PyTorch deep reinforcement learning library.

Results 215 tianshou issues
Sort by recently updated
recently updated
newest added

- [ ] I have marked all applicable categories: + [ ] exception-raising fix + [X] algorithm implementation fix + [X] documentation modification + [X] new feature - [X] I...

- [x] I have marked all applicable categories: + [ ] exception-raising bug + [x] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.")...

bug
RNN

- [x] I have marked all applicable categories: + [ ] exception-raising bug + [x] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.")...

bug
RNN

- [X] I have marked all applicable categories: + [ ] exception-raising bug + [X] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.")...

enhancement

DQN has mask mechanism:https://github.com/thu-ml/tianshou/blob/master/tianshou/policy/modelfree/dqn.py in forward Function. But. the pg strategy does not seem to have a related mechanism for mask processing. If so, can it be added? Below is...

enhancement

- [X] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the...

enhancement
RNN

When I tried to train Pendulum-v0 with a recurrent-style SAC, the policy didn't improve, while it worked fine with a MLP model. The curves of the training process in Tensorboard...

question
RNN

- [x] I have marked all applicable categories: + [x] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.")...

question

## A question - I want to build a small scale version of **Open AI Five** - And I learnt that it uses LSTM + PPO - suppose I build...

question
RNN

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from...

question
RNN