Counterfactual-Multi-Agent-Policy-Gradients
Counterfactual-Multi-Agent-Policy-Gradients copied to clipboard
PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."
 
 I repeat running the code and sometimes the learning curve drops during training.