arjuntemura

Results 1 issues of arjuntemura

## Bug Description I am running a distributed Linear model (20 parameters) across 2 GPU Nodes, each node having 2 NVIDIA H100 NVL GPUs. The Model uses DDP parallelization strategy....