random-network-distillation-pytorch Action values are incremented by 1 for the Breakout game ?

Action values are incremented by 1 for the Breakout game ?

Open cangozpi opened this issue 1 year ago • 0 comments

Hi, Is the reason for the following code modifying the actions for the breakout game is eliminating the NOOP action from the available set of actions that can be taken by the agent ? envs.py:

if 'Breakout' in self.env_id: 
    action += 1

train.py:

if 'Breakout' in env_id:
    output_size -= 1

Feb 23 '24 14:02 cangozpi

random-network-distillation-pytorch random-network-distillation-pytorch copied to clipboard

Action values are incremented by 1 for the Breakout game ?

random-network-distillation-pytorch
random-network-distillation-pytorch copied to clipboard