Ryan Angi

Results 1 comments of Ryan Angi

Should this feature request also include the ability to use the IPS/DR estimator for evaluating the average PV loss of a policy (offline) using `--cb_type mtr` in the learning algorithm?