DQN-Atari Reward converges at a low value while training in Breakout-v5

Reward converges at a low value while training in Breakout-v5

Open edwardelric1202 opened this issue 2 years ago • 0 comments

Hi, I used this code to train a dqn in Breakout-v5, but found the reward in training just reach 3.5-4, could you please give some advice of training? I wonder why the change of environment can bring about such problem, thanks.

Dec 01 '22 03:12 edwardelric1202

DQN-Atari DQN-Atari copied to clipboard

Reward converges at a low value while training in Breakout-v5

DQN-Atari
DQN-Atari copied to clipboard