Multi-Agent-Deep-Deterministic-Policy-Gradients
Multi-Agent-Deep-Deterministic-Policy-Gradients copied to clipboard
Shouldn't it be agent.actor.forward() and calculate actor_loss?
trafficstars
https://github.com/philtabor/Multi-Agent-Deep-Deterministic-Policy-Gradients/blob/a3c294aa6834f348a7401306dff3e67919c861f5/maddpg.py#L83
Dear Phill,
First of all plenty of thanks and gratitude for your lessens, I've learned a lot from your lectures. I've noticed a difference in the code at line 83 in MADDPG class while calculating actor-loss. It's running forward propagation of critic network instead of actor network. I believe this is typo, please correct me if I'm wrong.
Thanks and Regards Viji