easy-rl icon indicating copy to clipboard operation
easy-rl copied to clipboard

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Results 64 easy-rl issues
Sort by recently updated
recently updated
newest added

https://datawhalechina.github.io/easy-rl/#/chapter2/chapter2_questions&keywords Description

Gitalk
/chapter2/chapter2_questions&keywords

可以新增PPO-continuous的demo code么

https://datawhalechina.github.io/easy-rl/#/chapter10/chapter10 Description

Gitalk
/chapter10/chapter10

https://datawhalechina.github.io/easy-rl/#/chapter12/chapter12_questions&keywords Description

Gitalk
/chapter12/chapter12_questions&keywords

https://datawhalechina.github.io/easy-rl/#/chapter11/chapter11_questions&keywords Description

Gitalk
/chapter11/chapter11_questions&keywords

https://datawhalechina.github.io/easy-rl/#/chapter10/chapter10_questions&keywords Description

Gitalk
/chapter10/chapter10_questions&keywords

https://datawhalechina.github.io/easy-rl/#/chapter13/chapter13 Description

Gitalk
/chapter13/chapter13

https://datawhalechina.github.io/easy-rl/#/chapter12/project3 Description

Gitalk
/chapter12/project3

I think that in ppo2.py line119-122, we need to assert "if dones_arr[k]: break" into the for loop. That is because there are data from different episodes in the memory. Is...

在dqn的更新中,为什么没有下面的代码,不用复制策略网络? if self.sample_count % self.target_update == 0: # 每隔一段时间,将策略网络的参数复制到目标网络 self.target_net.load_state_dict(self.policy_net.state_dict())