tensorflow-rl-pong icon indicating copy to clipboard operation
tensorflow-rl-pong copied to clipboard

Pong AI trained using policy gradient-based reinforcement learning

Results 1 tensorflow-rl-pong issues
Sort by recently updated
recently updated
newest added

Hi, I am really stuck at the discount_rewards function. Can you explain the logic behind discount_rewards function. It seems its updating the rewards in forward direction