Takuya Kato

Results 1 issues of Takuya Kato

**Describe the bug** The loss curve of a training run with the following configuration - virtual-pipeline-parallel-size > 1 - TORCH_NCCL_AVOID_RECORD_STREAMS=1 does not match with the curves of training runs with...