tianshou issues

Does the MulitAgentPolicyManager support other policy, e.g. DiscreteSACPolicy?

2

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from...

zhangwenjun1229

question

SAC + LSTM

1

- I have visited the [source website](https://github.com/thu-ml/tianshou/) - I have searched through the [issue tracker](https://github.com/thu-ml/tianshou/issues) for duplicates - I have mentioned version numbers, operating system and environment, where applicable: ```python...

jaried

bug

Question of logging

1

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from...

SoMuchSerenity

question

Using wrapper or mask makes a great training but a terrible testing

8

Hello, I have a question when using a `gym.ObservationWrapper` for training and testing as a mask. There are many actions in my env and some of them are unavailable in...

lsylusiyao

question

Does tianshou support multi-agent parallel env besides pettingzoo aec env?

3

In most multi-agent scenarios, e.g., sc2, dota2, agents should execute actions simultaneously instead of step by step. So the parallel environment where the step function receives actions of all ready...

ZiyiLiubird

enhancement

question about DRQN

5

https://github.com/thu-ml/tianshou/blob/f13e415eb0de55baca5dc0d6fae39d6a38e8bc0b/tianshou/policy/modelfree/dqn.py#L167 It seems `state` is not used during training even when specifying a recurrent net. Am I missing something, or is it expected?

leao1995

bug

not reproduced yet

RNN

Implement Decision Transformer for offline RL

5

- [x] I have marked all applicable categories: + [ ] exception-raising fix + [ ] algorithm implementation fix + [ ] documentation modification + [x] new feature - [x]...

nuance1979

enhancement

blocked

Processing the PGPolicy's raw output

1

I want to sample multiple actions from the network's raw output, so I canceled the softmax option from original network and divide the output vector into N equal portions. Then...

963141377

question

Log info, not just reward

5

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from...

vadim0x60

enhancement

How to use Tianshou with GPU and custom environment

3

- [x] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the...

robkuehl

question

tianshou
tianshou copied to clipboard

Metadata

Does the MulitAgentPolicyManager support other policy, e.g. DiscreteSACPolicy?

SAC + LSTM

Question of logging

Using wrapper or mask makes a great training but a terrible testing

Does tianshou support multi-agent parallel env besides pettingzoo aec env?

question about DRQN

Implement Decision Transformer for offline RL

Processing the PGPolicy's raw output

Log info, not just reward

How to use Tianshou with GPU and custom environment

← Metadata

Owner

Metadata

tianshou tianshou copied to clipboard

Metadata

← Metadata

Owner

Metadata

tianshou
tianshou copied to clipboard