Kevin Koehncke
Results
2
comments of
Kevin Koehncke
I am also seeing this regression for all variants of Flan-T5 (base, large, XL). Model is just outputting `` repeatedly. We convert correctly to use `bfloat16` as it is a...