Jules Gagnon-Marchand

Results 42 comments of Jules Gagnon-Marchand

in order for baselines 3 (and then rl4lms) to support bfloat16, it would suffice to modify `a_tensor_name.cpu().numpy()` to `a_tensor_name.cpu().float().numpy()`.

yes I did a bunch of other changes in the end