Rafael Sterzinger

Results 1 issues of Rafael Sterzinger

Changed the default observation of bandits to 1. By default, almost all biases are set to 0 during initialization. Combining this with inputting always 0 will cause problems during training/testing,...