dopamine
dopamine copied to clipboard
Incredibly high score for Defender
Has anyone tried running Defender with dopamine? I noticed that baselines of Defender are not provided so I was trying to run one, but the scores are incredibly high.
Here are the reference scores in Rainbow:
And this is what I get with C51:

I had the same problem... By the way, I found there is no Surround environment in OpenAI gym. How can I reproduce the results of IQN and C51 on this environment?
Hello, I think this is due to a bug in the underlying ALE library. The reward are multiplied by 100 without reason compared to the true score of the actual game. See mgbellemare/Arcade-Learning-Environment#262 for more details.
Hello, I think this is due to a bug in the underlying ALE library. The reward are multiplied by 100 without reason compared to the true score of the actual game. See mgbellemare/Arcade-Learning-Environment#262 for more details.
Hi, thanks for the information! I wonder how all previous works got their results though.....