contextualbandits
contextualbandits copied to clipboard
If r != 0 then could be working for negative reward as well?
In the following line https://github.com/david-cortes/contextualbandits/blob/7b87efe31ffefdbc95a125d557920ab5b13105c5/contextualbandits/utils.py#L912C13-L912C34
if we replace arms_w_rew = (r > 0.) with arms_w_rew = (r != 0.) then "BootstrappedUCB" and "BootstrappedTS" could work and incorporate negative rewards (I guess).