dqn-pytorch icon indicating copy to clipboard operation
dqn-pytorch copied to clipboard

Don‘t converge

Open XA-kirino opened this issue 4 years ago • 3 comments

After 400 epochs of training, the total rewards keeps -21.0~-19.0 and takes about 800-1200 steps for a round. Anyone has met this problem? Any idea of solving this? environ: pytorch 1.5 python 3.7 gym 0.17.3 atari_py 0.2.6 ubuntu 16.04

XA-kirino avatar Nov 05 '20 09:11 XA-kirino

I am having this issue as well.

jeffz0 avatar Nov 10 '20 03:11 jeffz0

I am having this issue as well.

You may need to train more epochs like 1k-2k, i've tried with another code and successed.

XA-kirino avatar Nov 10 '20 04:11 XA-kirino

Ok, what other code did you try? Edit: After training overnight, it seems to be working fine.

jeffz0 avatar Nov 10 '20 04:11 jeffz0