maddpg
maddpg copied to clipboard
Why ‘u_action_space = spaces.Discrete(world.dim_p * 2 + 1)’?
Hi, I dont understand 'u_action_space = spaces.Discrete(world.dim_p * 2 + 1)' I know that action[0] is the communication, but why dim_p needs to mutiply 2
I second this question
I think they reduce it to small steps in simple directions, so you can move in 2 times the dimension. If it is a 2D world. you can move left, right top bot, if it were a 3d in +-i, +-j, +-z and I assume the extra action is to stay quiet.