RavnaBergsndot issues

Repositories
Issues
Comments

Results 1 issues of


                                            RavnaBergsndot

The loss function

There's a difference between reinforcement and supervised learning in the AGZ paper. The paper mentioned that although for the reinforcement version, the loss function is like "loss = action_loss +...