Bobak Hashemi

Results 4 issues of Bobak Hashemi

I ran into [this issue](https://github.com/huggingface/transformers/issues/15065) when installing `transformers[flax]` which seems to depend on older versions of `tokenizers`. I do have the latest `tokenizers` available installed via pip, but attempting to...

This PR is just a WIP to show the approach. As of now, I am training on the webgpt dataset, there are three main items I still need to accomplish...

ml

#78 As per discussion with @theblackcat102 I built the rankgen trainer on top of their framework ([wandb](https://wandb.ai/bobakhashemi/reward-model/runs/2sgitozt?workspace=user-bobakhashemi)). The model seems to be training now in fp32. Apparently t5 has some...

ml

Based on #313 , we are having issues with model evaluation in the reward model trainer (code in model/ranking). It seems that the evaluation results are not being computed and...

ml