Reinforcement-Learning icon indicating copy to clipboard operation
Reinforcement-Learning copied to clipboard

Accuracy improvement possible?

Open ruze00 opened this issue 2 years ago • 1 comments

I'm running the code verbatim but not finding the results which might be expected. For example, running ping_pong_a2c results in barely any improvement after more than 8,000 runs, while I would expect a good level of accuracy (at least > 0 score) by 5,000 iterations or so based on other people reporting results based on using RL with Atari/Pong.

image

Is there something I'm missing? Do the hyperparameters need to be tuned rather than run as is?

Thank you for creating the code base.

ruze00 avatar May 23 '22 17:05 ruze00

No, it does not converge. I spent days on this code to debug why but couldn't drill down to the exact issue. Use the openAi gym wrappers to manipulate the frames

allohvk avatar Nov 08 '22 07:11 allohvk