Samuel
Samuel
Asynchronous-Methods-for-Deep-Reinforcement-Learning
Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-...
chess-alpha-zero
Chess reinforcement learning by AlphaGo Zero methods.
connect4-alpha-zero
Connect4 reinforcement learning by AlphaGo Zero methods.
muzero
A simple implementation of MuZero algorithm for connect4 game
random-memory-adaptation
Random memory adaptation model inspired by the paper: "Memory-based parameter adaptation (MbPA)"
tensorflow-image-wavenet
This is a TensorFlow implementation of the WaveNet generative neural network architecture https://deepmind.com/blog/wavenet-generative-model-raw-audio/ for image generation.
tensorflow-tex-wavenet
This is a TensorFlow implementation of the WaveNet generative neural network architecture https://deepmind.com/blog/wavenet-generative-model-raw-audio/ for text generation.
leela-fish
UCI chess playing engine derived from Stockfish and LeelaChess Zero