Stewart Slocum

Results 2 repositories owned by Stewart Slocum

pong-with-policy-gradients

15
Stars
2
Forks
Watchers

Code for an intro to RL workshop. You'll be training a simple agent to play pong using policy gradients. Adapted from http://karpathy.github.io/2016/05/31/rl/

Learning-to-Optimize

18
Stars
4
Forks
Watchers

An implementation of the 2016 meta-learning paper "Learning to Optimize" from the BAIR lab at Berkeley.