Rafael Sterzinger
Results
1
issues of
Rafael Sterzinger
Changed the default observation of bandits to 1. By default, almost all biases are set to 0 during initialization. Combining this with inputting always 0 will cause problems during training/testing,...