random-network-distillation-pytorch
random-network-distillation-pytorch copied to clipboard
Action values are incremented by 1 for the Breakout game ?
Hi, Is the reason for the following code modifying the actions for the breakout game is eliminating the NOOP action from the available set of actions that can be taken by the agent ? envs.py:
if 'Breakout' in self.env_id:
action += 1
train.py:
if 'Breakout' in env_id:
output_size -= 1