dopamine icon indicating copy to clipboard operation
dopamine copied to clipboard

1% of exploratory actions vs 10% for DQN

Open Wovchena opened this issue 4 years ago • 0 comments

Hi baselines/README.md states:

Unsurprisingly, DQN performs much better when trained with 1% of exploratory actions instead of 10% (as used in the original Nature paper).

Why is that unsurprising? Is there an explanation why DQN should be trained with 1% of exploratory actions instead of 10%? Or is this is just an empirical result which appears for most of DQN implementations?

Wovchena avatar Mar 02 '20 21:03 Wovchena