Pavel

Results 1 issues of Pavel

**Describe the bug** When training a model with contrastive denoising (DN) enabled in a DistributedDataParallel (DDP) setting, a deadlock can occur at reduce_dict(loss_dict) if some ranks receive empty targets (i.e.,...