bert
bert copied to clipboard
how to train BERT in bfloat16 mode on TPUs?
Hello, Has anyone tried to train a BERT model using bfloat16? Thanks, Li