Chantal Pellegrini
Results
1
comments of
Chantal Pellegrini
For me, batch evaluation also gave nan values, after fine-tuning. Performing evaluation also with bfloat16 instead of float16 solved this for me. (I also fine-tuned using bf16 True). Like this,...