Multi_Task_RL icon indicating copy to clipboard operation
Multi_Task_RL copied to clipboard

Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow

Multi_Task_RL

Multi Domain and Multi Task Deep Reinforcement Learning for Continuous Control - Using Hard parameter sharing Deep Neural Networks ("Multi Headed Network") as the policy and value function approximator to enable a single Reinforcement Learning agent to learn multi tasks and domains in parallel.

Trained using Proximal Policy Optimization (PPO) and KL-divergence constraint from TRPO. Environments from Open AI Gym.

David C. Alvarez-Charris, Chenyang Zhao, Timothy Hospedales University of Edinburgh - 2018 Dissertation for MSc in Artificial Intelligence