Results 1 issues of 이강준

*Issue #, if available:* The example code in the fp16 module in `training/distributed_training/pytorch/model_parallel` is strange when training gpt-j or gpt2 in a distributed environment. In the training code, it is...