Kevin Koehncke

Results 2 comments of Kevin Koehncke

I am also seeing this regression for all variants of Flan-T5 (base, large, XL). Model is just outputting `` repeatedly. We convert correctly to use `bfloat16` as it is a...