deepQN_tensorflow
deepQN_tensorflow copied to clipboard
deepQN doesn't learn
I've tried this program on a Mac, the agent never got total score more than 10..
Is there some well-trained model as a demo?
Yeah It never get score higher than 10~20 because I removed fixed target network which is essential to prevent oscilation and reach high score. I'm working on new version and it's almost done. I'll upload that soon!
Great news!
It seems in the paper of 2015 there was no target network in the algorithm, and this code is based on this version.
In the paper of 2016, they say that a target network is necessary. This is what you called the new version, isn't is?
But now I'm very curious how the deepmind people made the first version work...
target network is necessary for learning.
If someone is interested, I have a working implementation in tensorflow that can reach similar score (in the only game I've tested) to DeepMind's paper.