MADDPG
MADDPG copied to clipboard

Published 20 hours ago •

→

Metadata

Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".

Reame
Issues

Results 1 MADDPG issues

Sort by recently updated

大佬请问这边当我先更新critic时再更新actor（论文是这样的）这个会报因为inplace操作导致梯度的更新失败。。真的改不出来了

self.critic_optim.zero_grad() critic_loss.backward() self.critic_optim.step() self.actor_optim.zero_grad() actor_loss.backward() self.actor_optim.step() 当我把这个顺序调整后，这个会报错：因为inplace操作导致梯度的更新失败。。感激了

xiaobingbuhuitou

About

Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".

450

Stars

77

Forks

Watchers

Owner

starry-sky6688

← Metadata

450

Stars

77

Forks

Watchers

Owner

starry-sky6688

Metadata

Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".

Back

MADDPG MADDPG copied to clipboard

Metadata

大佬请问这边当我先更新critic时再更新actor（论文是这样的）这个会报因为inplace操作导致梯度的更新失败。。真的改不出来了

← Metadata

Owner

Metadata

MADDPG
MADDPG copied to clipboard