Jiayi Weng comments

Results 303 comments of


                                            Jiayi Weng

Add reward shaping and policy shaping to DQN

You're right, and also reward shaping can somehow write in `gym.Wrapper` or `collector.preprocess_fn` (if treating F_D as a part of the environment). Currently we don't have any plan. But you...

Add reward shaping and policy shaping to DQN

Because in section `3.3 Policy Shaping` the authors doesn't claim that this algorithm is only compatible with DQN. My understanding is that this policy shaping method can be adapted to...

Model-based algorithm?

The first question is, yes, our group has some model-based works previously and we would like to refract and merge. I also have an RL work related to combining model-free...

Centralized learning-decentralized execution for multiagent settings

@youkaichao

speed testing in case of parallel/distributed computation

> Does tianshou support distributed computation (e.g., on a cluster) for collecting experience? Currently we mainly support a single node of small-medium size of experiment, but you can use `RayVectorEnv`...

speed testing in case of parallel/distributed computation

> Hi @Trinkle23897, > > I am running an experiment with Tianshou in a computer with multiple CPU cores and 2 GPUs. Is it possible to take advantage of that...

speed testing in case of parallel/distributed computation

> By changing from VectorEnv to RayVectorEnv and using the specific arguments will Tianshou take care of the optimization? VectorEnv is not optimized. It uses the for-loop (typically for debugging)....

Please consider Hydra

Wow, that's great! Sure, we will consider to provide some example scripts using Hydra. Thanks for your sharing!

Can running and training be separated?

Our code currently does not support this method. But we will consider your suggestion.

Vizdoom and pytorch crashed when import together

I replace torch-0.4 to torch-0.3.1 and import successfully. So this issue is pytorch0.4 's bug?