DQN-Atari
DQN-Atari copied to clipboard
Reward converges at a low value while training in Breakout-v5
Hi, I used this code to train a dqn in Breakout-v5, but found the reward in training just reach 3.5-4, could you please give some advice of training? I wonder why the change of environment can bring about such problem, thanks.