div-garg
Results
1
comments of
div-garg
Hi, we only the `expert_rewards` for SQIL where the expert gets a reward 1 and the policy gets a reward 0. Storing fake rewards of 1 for the expert data...