multiagent-particle-envs NN code

NN code

Open abeerM opened this issue 4 years ago • 0 comments

According to the paper "our policies are parameterized by a two-layer ReLU MLP with 64 units per layer. To support discrete communication messages, we use the Gumbel-Softmax estimator [14]." However, I could not find it in the code! The policy is hardcoded (policy.py )based on the keyboard input, so what if my environment does not require input from the user

Appreciate explaining that point

Jan 12 '21 14:01 abeerM

multiagent-particle-envs multiagent-particle-envs copied to clipboard

NN code

multiagent-particle-envs
multiagent-particle-envs copied to clipboard