async-rl icon indicating copy to clipboard operation
async-rl copied to clipboard

the action (x4) semantics different?

Open mw66 opened this issue 8 years ago • 1 comments

Hi,

I just noticed:

https://github.com/muupan/async-rl/blob/master/ale.py#L115

each training action is taken 4x times to the game environment?

e.g. user pressed 'down' once, but in your simulated training the environment to take 'down' action 4 times!

I wonder why? and will the result differ from the original paper.

mw66 avatar Jan 08 '17 05:01 mw66

The original paper uses action repeating. See 8. Experimental Setup in http://arxiv.org/abs/1602.01783.

muupan avatar Jan 25 '17 10:01 muupan