Sea-Snell

Results 1 issues of Sea-Snell

If I use Adafactor with MultiStep on a bfloat16 model I get this strange error (note the error is extremely long, so I truncated it to fit in the issue;...