deepQN_tensorflow deepQN doesn't learn

deepQN doesn't learn

Open shendiaomo opened this issue 8 years ago • 5 comments

I've tried this program on a Mac, the agent never got total score more than 10..

Is there some well-trained model as a demo?

Mar 10 '16 04:03 shendiaomo

Yeah It never get score higher than 10~20 because I removed fixed target network which is essential to prevent oscilation and reach high score. I'm working on new version and it's almost done. I'll upload that soon!

Mar 12 '16 03:03 gliese581gg

Great news!

Mar 14 '16 03:03 shendiaomo

It seems in the paper of 2015 there was no target network in the algorithm, and this code is based on this version.

In the paper of 2016, they say that a target network is necessary. This is what you called the new version, isn't is?

But now I'm very curious how the deepmind people made the first version work...

Mar 14 '16 10:03 shendiaomo

target network is necessary for learning.

Mar 21 '16 02:03 mrkulk

If someone is interested, I have a working implementation in tensorflow that can reach similar score (in the only game I've tested) to DeepMind's paper.

Jun 06 '16 07:06 ppwwyyxx

deepQN_tensorflow deepQN_tensorflow copied to clipboard

deepQN doesn't learn

deepQN_tensorflow
deepQN_tensorflow copied to clipboard