MountainCar_DQN_RND
MountainCar_DQN_RND copied to clipboard
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
I think that the value RND return as the intrinsic reward is not good in DQN. And I think it can be used to select action. So the dimension of...

