rl_games
rl_games copied to clipboard
VDN
This PR includes Tarun's VDN implementation, alongside a partial implementation of PPO-S (where the bits left to do have to do with network architecture issues for the centralized critic)