easy-rl
easy-rl copied to clipboard

Published 20 hours ago •

→

Metadata

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Reame
Issues

Results 64 easy-rl issues

Sort by recently updated

/chapter2/chapter2_questions&keywords

5

comment

https://datawhalechina.github.io/easy-rl/#/chapter2/chapter2_questions&keywords Description

Gitalk

/chapter2/chapter2_questions&keywords

add PPO-continuous code

1

comment

可以新增PPO-continuous的demo code么

/chapter10/chapter10

1

comment

https://datawhalechina.github.io/easy-rl/#/chapter10/chapter10 Description

Gitalk

/chapter10/chapter10

/chapter12/chapter12_questions&keywords

https://datawhalechina.github.io/easy-rl/#/chapter12/chapter12_questions&keywords Description

Gitalk

/chapter12/chapter12_questions&keywords

/chapter11/chapter11_questions&keywords

https://datawhalechina.github.io/easy-rl/#/chapter11/chapter11_questions&keywords Description

Gitalk

/chapter11/chapter11_questions&keywords

/chapter10/chapter10_questions&keywords

https://datawhalechina.github.io/easy-rl/#/chapter10/chapter10_questions&keywords Description

Gitalk

/chapter10/chapter10_questions&keywords

/chapter13/chapter13

https://datawhalechina.github.io/easy-rl/#/chapter13/chapter13 Description

Gitalk

/chapter13/chapter13

/chapter12/project3

1

comment

https://datawhalechina.github.io/easy-rl/#/chapter12/project3 Description

Gitalk

/chapter12/project3

PPO advantage calculation

I think that in ppo2.py line119-122, we need to assert "if dones_arr[k]: break" into the for loop. That is because there are data from different episodes in the memory. Is...

dqn算法问题

在dqn的更新中，为什么没有下面的代码，不用复制策略网络？ if self.sample_count % self.target_update == 0: # 每隔一段时间，将策略网络的参数复制到目标网络 self.target_net.load_state_dict(self.policy_net.state_dict())

‹
1
2
3
4
5
6
7
›

About

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

reinforcement-learning

deep-reinforcement-learning

q-learning

imitation-learning

dqn

a3c

ddpg

double-dqn

dueling-dqn

ppo

policy-gradient

sarsa

td3

easy-rl

8.4k

Stars

1.7k

Forks

Watchers

Owner

← Metadata

8.4k

Stars

1.7k

Forks

Watchers

Owner

Metadata

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/