DQN-tensorflow The Gym envirment has bug so the training does't give good reward

The Gym envirment has bug so the training does't give good reward

Open quhezheng opened this issue 8 years ago • 3 comments

The best reward would be 30, that all. But by replacing Gym with ROM directly, the output would be very different, very stable reward around 300~400

I dont' know exactly what's wrong with Gym

Jun 20 '17 02:06 quhezheng

Could you explain the steps on replacing Gym with ROM?

Jul 17 '17 14:07 isVoid

I have encountered the same problem that my best reward is only 45 with Gym. If you had find out what's wrong, please let me know. Thanks.

Jul 22 '17 13:07 hjchen2

I seem to understand. The new gym environment is different. If you directly use 'Breakout-v0', you will skip four frames in the middle. You should use 'BreakoutNoFrameskip-v0'

Mar 09 '20 04:03 JUZI1

DQN-tensorflow DQN-tensorflow copied to clipboard

The Gym envirment has bug so the training does't give good reward

DQN-tensorflow
DQN-tensorflow copied to clipboard