tensorflow-rl-pong
tensorflow-rl-pong copied to clipboard
Pong AI trained using policy gradient-based reinforcement learning
Results
1
tensorflow-rl-pong issues
Sort by
recently updated
recently updated
newest added
Hi, I am really stuck at the discount_rewards function. Can you explain the logic behind discount_rewards function. It seems its updating the rewards in forward direction