Reinforcement-Learning-Pytorch-Cartpole
Reinforcement-Learning-Pytorch-Cartpole copied to clipboard
Simple Cartpole example writed with pytorch.
https://github.com/g6ling/Reinforcement-Learning-Pytorch-Cartpole/blob/ecb7b622cfefe825ac95388cceb6752413d90a2a/POMDP/4-R2D2-Single/train.py#L76 Another question : Why do you only store hidden state from target net and not from online net?
Hi, I ran your code in 4-R2D2-Single, and got: C:\POMDP\4-R2D2-Single\memory.py:88: RuntimeWarning: invalid value encountered in true_divide prior_mean = abs_td_error_sum / lengths_burn Traceback (most recent call last): File "train.py", line 124,...