Takuya Kato
Results
1
issues of
Takuya Kato
**Describe the bug** The loss curve of a training run with the following configuration - virtual-pipeline-parallel-size > 1 - TORCH_NCCL_AVOID_RECORD_STREAMS=1 does not match with the curves of training runs with...