Reinforcement-Learning-Pytorch-Cartpole
Reinforcement-Learning-Pytorch-Cartpole copied to clipboard

g6ling

→

Metadata

Simple Cartpole example writed with pytorch.

Reame
Issues

Results 2 Reinforcement-Learning-Pytorch-Cartpole issues

Sort by recently updated

trafficstars

Why compute action from target net rather than online net?

https://github.com/g6ling/Reinforcement-Learning-Pytorch-Cartpole/blob/ecb7b622cfefe825ac95388cceb6752413d90a2a/POMDP/4-R2D2-Single/train.py#L76 Another question : Why do you only store hidden state from target net and not from online net?

jsrimr

Error in 4-R2D2-Single

Hi, I ran your code in 4-R2D2-Single, and got: C:\POMDP\4-R2D2-Single\memory.py:88: RuntimeWarning: invalid value encountered in true_divide prior_mean = abs_td_error_sum / lengths_burn Traceback (most recent call last): File "train.py", line 124,...

ghost

About

Simple Cartpole example writed with pytorch.

pytorch

reinforcement-learning

deep-reinforcement-learning

cartpole

pytorch-cartpole

161

Stars

23

Forks

Watchers

Owner

g6ling

← Metadata

161

Stars

23

Forks

Watchers

Owner

g6ling

Metadata

Simple Cartpole example writed with pytorch.

Back

Reinforcement-Learning-Pytorch-Cartpole Reinforcement-Learning-Pytorch-Cartpole copied to clipboard

Metadata

Why compute action from target net rather than online net?

Error in 4-R2D2-Single

← Metadata

Owner

Metadata

Reinforcement-Learning-Pytorch-Cartpole
Reinforcement-Learning-Pytorch-Cartpole copied to clipboard