QasimWani/RL-Unity: Implementation of Deep Reinforcement Learning algo...

Implementation of STOA Deep Reinforcement Learning (DRL) algorithms in the Unity Engine.

Algorithms:

Value Based Method - Deep Q Network (DQN)
Policy Based Method - Deep Deterministic Policy Gradient (DDPG)
Multi Agent Reinforcement Learning using MADDPG

Outputs:

Value Based Method - DQN

In this Unity environment, the goal of the agent is to pick up yellow bananas while avoiding blue bananas.

Rolling Scores

Policy Based Method - DDPG

In this Unity environment, the goal of the agent is to move the double-jointed arm to the target location indicated by the torquoise sphere. This video demonstrates a more practical approach of the Reacher Unity environment.

Rolling Scores

Multi-Agent RL - MADDPG

In this Unity environment, the goal of the agent is to maximize the rally between the two tennis agents, i.e. as the two agents pass the ball to each other without dropping, the higher the reward.

Rolling Scores

RL-Unity
RL-Unity copied to clipboard

Metadata

Implementation of STOA Deep Reinforcement Learning (DRL) algorithms in the Unity Engine.

Algorithms:

Outputs:

Value Based Method - DQN

Policy Based Method - DDPG

← Metadata

Owner

Metadata

RL-Unity RL-Unity copied to clipboard

Metadata

Implementation of STOA Deep Reinforcement Learning (DRL) algorithms in the Unity Engine.

Algorithms:

Outputs:

Value Based Method - DQN

Policy Based Method - DDPG

← Metadata

Owner

Metadata

RL-Unity
RL-Unity copied to clipboard