Super-mario-bros-PPO-pytorch
Super-mario-bros-PPO-pytorch copied to clipboard
Why is the reward value so designed?
Hello, I'm curious about the logic of reward value design here, can you introduce it?
https://github.com/uvipen/Super-mario-bros-PPO-pytorch/blob/9cd3fe4283331e9232088a19b03518fe94524a2f/src/env.py#L54
https://github.com/uvipen/Super-mario-bros-PPO-pytorch/blob/9cd3fe4283331e9232088a19b03518fe94524a2f/src/env.py#L61
@DannyLee1991 It is my question too!