PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA issues

Results 4 PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA issues

Sort by recently updated

Memory leak

It seems that running `COMA2.py` results in memory leak, and the program takes all the memory after some episodes.

Did you forget about the reward from the second agent?

Hi... I'm new with COMA and PyTorch. As I read your code, it's impressive and useful. However, I saw at line 165 you didn't include reward from the second agent....

FSNStefan

Leverage the use of recurrent modules

Hi, I am wondering if the oscillation of the training phase comes from the fact that you only include down-sampling layers in your actor nets, since in partially observable domains,...

Fernadoo

There is no negative reward for the environment used by this repo?

Hi! I found that the env_FindGoals used in this repo is different from the env_FindGoals in your other repo. The environment used by this repo will only give positive rewards,...

SunnyWangGitHub

PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA copied to clipboard

Metadata

Memory leak

Did you forget about the reward from the second agent?

Leverage the use of recurrent modules

There is no negative reward for the environment used by this repo?

← Metadata

Owner

Metadata

PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA copied to clipboard

Metadata

Memory leak

Did you forget about the reward from the second agent?

Leverage the use of recurrent modules

There is no negative reward for the environment used by this repo?

← Metadata

Owner

Metadata

PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA copied to clipboard