maddpg
maddpg copied to clipboard
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
When I set display = True I get the error below. These are the versions I have for the known dependencies: Python (3.6) OpenAI gym (0.10.5) tensorflow (1.8.0) numpy (1.14.5)...
I just run maddpg in `simple_speaker_listener` several times,but none of them get the -20 avg-reward like the paper proposed. Are there anything i should modify to get a better or...
It's interesting that the code decomposes the output of actor network as the mean and the standard deviation, and then constructs a new action with a gaussian distribution. In past,...
I am having trouble when I run python3 train.py --scenario simple File "train.py", line 12, in import maddpg ModuleNotFoundError: No module named 'maddpg'
When I used 4 agents of MADDPG to solve my own problems, the cumulative rewards I got did not improve. Sometimes the rewards gradually decreased. What is the reason?
Will appreciate it if this repository have a nice and short description.
Hello! I am working to implement MADDPG in pytorch based on the details of this implementation in tensorflow. I have followed the implementation to a tee, but I when I...
In the comparison plots, we see MADDPG being compared to DDPG algorithm. As far as I know, DDPG can only be used with continuous action space. But the experiments with...