nict-wisdom
Results
1
issues of
nict-wisdom
We tried to run Mesh-TensorFlow to train T5 on GPUs following the instructions on T5's repository, but the training is extremely slow. > global_step/sec: 0.0467347 > examples/sec: 0.186939 The training...