async-rl Update train.py

Update train.py

Open pavitrakumar78 opened this issue 8 years ago • 2 comments

added python 2.x compatibility added workaround for breakout and space invaders to correct the number of moves Note: this workaround just tells the acting methods that the move space will be 4 instead of the default 6. So, the network will be making a prediction amongst 0,1,2,3 which corresponds to ['NOOP', 'FIRE', 'RIGHT', 'LEFT'] - thus, we ignore the RIGHTFIRE and LEFTFIRE which are not needed for breakout.

Jan 09 '17 16:01 pavitrakumar78

I think it's useful to have Python 2 support but workarounds for action_space are not necessary. If you remove them I will merge this PR. Thanks! :)

Jan 14 '17 13:01 Grzego

That is just one line on the top! :+1:
I removed the workarounds. You can test the code again if you want! :)

Jan 15 '17 09:01 pavitrakumar78

async-rl async-rl copied to clipboard

Update train.py

async-rl
async-rl copied to clipboard