pysc2-rl-mini The factor of value loss

The factor of value loss

Open JIElite opened this issue 6 years ago • 0 comments

In worker.py, we need to sample experience by using local model and then backpropagate the gradients of total loss. The total loss is composed of loss_of_actor, loss_of_critic and entropy.

In line 132, you have already multiply 0.5 to value_loss_vb, but in line 152 you to that again. Does it mean the factor of critic loss in total loss is 0.25? Is it what you want to do?

Thank you. :)

Mar 09 '18 12:03 JIElite

pysc2-rl-mini pysc2-rl-mini copied to clipboard

The factor of value loss

pysc2-rl-mini
pysc2-rl-mini copied to clipboard