PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA copied to clipboard
There is no negative reward for the environment used by this repo?
Hi! I found that the env_FindGoals used in this repo is different from the env_FindGoals in your other repo. The environment used by this repo will only give positive rewards, no negative rewards?