Stewart Slocum
Results
2
repositories owned by
Stewart Slocum
pong-with-policy-gradients
15
Stars
2
Forks
Watchers
Code for an intro to RL workshop. You'll be training a simple agent to play pong using policy gradients. Adapted from http://karpathy.github.io/2016/05/31/rl/
Learning-to-Optimize
18
Stars
4
Forks
Watchers
An implementation of the 2016 meta-learning paper "Learning to Optimize" from the BAIR lab at Berkeley.