Ryan Angi
Results
1
comments of
Ryan Angi
Should this feature request also include the ability to use the IPS/DR estimator for evaluating the average PV loss of a policy (offline) using `--cb_type mtr` in the learning algorithm?