DQN-tensorflow icon indicating copy to clipboard operation
DQN-tensorflow copied to clipboard

The Gym envirment has bug so the training does't give good reward

Open quhezheng opened this issue 7 years ago • 3 comments

The best reward would be 30, that all. But by replacing Gym with ROM directly, the output would be very different, very stable reward around 300~400

I dont' know exactly what's wrong with Gym

quhezheng avatar Jun 20 '17 02:06 quhezheng

Could you explain the steps on replacing Gym with ROM?

isVoid avatar Jul 17 '17 14:07 isVoid

I have encountered the same problem that my best reward is only 45 with Gym. If you had find out what's wrong, please let me know. Thanks.

hjchen2 avatar Jul 22 '17 13:07 hjchen2

I seem to understand. The new gym environment is different. If you directly use 'Breakout-v0', you will skip four frames in the middle. You should use 'BreakoutNoFrameskip-v0'

JUZI1 avatar Mar 09 '20 04:03 JUZI1