Actor-Critic-PyTorch
Actor-Critic-PyTorch copied to clipboard
Why is the action selection is random sample in test?
action = policy(state)
Should this not be argmax during test?