Deep-Reinforcement-Learning-Practice issues

Results 4 Deep-Reinforcement-Learning-Practice issues

Sort by recently updated

GPU

Is it ok to run the model on gpu with a3c algorithm?. If you know a2c algorithm, please tell me is that better than a3c on gpu?. Thank you

nguyenviettuan96

Will you update your sample codes to Tensorflow 2.0?

Thanks for your contribution.

shtse8

How double dqn update parameters?

Hello，I find in double dqn file, there is no updated parameters function like this： $$ \theta^{-}=\alpha \theta^{-}+(1-\alpha) \theta $$ can you tell me why and what differences between this two?...

ynuwm

PPO随机策略

请问对于连续控制任务，如果可选的动作action有多个（假设6个），PPO采用随机策略其actor最后一层的输出是什么？

davinca

Deep-Reinforcement-Learning-Practice
Deep-Reinforcement-Learning-Practice copied to clipboard

Metadata

GPU

Will you update your sample codes to Tensorflow 2.0?

How double dqn update parameters?

PPO随机策略

← Metadata

Owner

Metadata

Deep-Reinforcement-Learning-Practice Deep-Reinforcement-Learning-Practice copied to clipboard

Metadata

GPU

Will you update your sample codes to Tensorflow 2.0?

How double dqn update parameters?

PPO随机策略

← Metadata

Owner

Metadata

Deep-Reinforcement-Learning-Practice
Deep-Reinforcement-Learning-Practice copied to clipboard