pymarl Reward function design

Reward function design

Open PMatthaei opened this issue 4 years ago • 0 comments

I am currently wondering about the reward design. The implemented function resembles a global reward function and these should be normalized by the amount of agents. This seems to happen here with respect to the total reward obtainable by kills and round win. But what about the battle reward? It includes rewards from kills, deaths, damage taken and dealt. Am I missing it or how are damage taken, damage dealt and deaths normalized over the amount of considered agents?

Sep 13 '20 09:09 PMatthaei

pymarl pymarl copied to clipboard

Reward function design

pymarl
pymarl copied to clipboard