easy-rl
easy-rl copied to clipboard
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
https://datawhalechina.github.io/easy-rl/#/chapter2/chapter2_questions&keywords Description
可以新增PPO-continuous的demo code么
https://datawhalechina.github.io/easy-rl/#/chapter10/chapter10 Description
https://datawhalechina.github.io/easy-rl/#/chapter12/chapter12_questions&keywords Description
https://datawhalechina.github.io/easy-rl/#/chapter11/chapter11_questions&keywords Description
https://datawhalechina.github.io/easy-rl/#/chapter10/chapter10_questions&keywords Description
https://datawhalechina.github.io/easy-rl/#/chapter13/chapter13 Description
https://datawhalechina.github.io/easy-rl/#/chapter12/project3 Description
I think that in ppo2.py line119-122, we need to assert "if dones_arr[k]: break" into the for loop. That is because there are data from different episodes in the memory. Is...
在dqn的更新中,为什么没有下面的代码,不用复制策略网络? if self.sample_count % self.target_update == 0: # 每隔一段时间,将策略网络的参数复制到目标网络 self.target_net.load_state_dict(self.policy_net.state_dict())