OpenAI
OpenAI
weightnorm
Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks"
vime
Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"
robosumo
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"
train-procgen
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
neural-gpu
Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"
generating-reviews-discovering-sentiment
Code for "Learning to Generate Reviews and Discovering Sentiment"
neural-mmo
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
InfoGAN
Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"
supervised-reptile
Code for the paper "On First-Order Meta-Learning Algorithms"