arjuntemura
Results
1
issues of
arjuntemura
## Bug Description I am running a distributed Linear model (20 parameters) across 2 GPU Nodes, each node having 2 NVIDIA H100 NVL GPUs. The Model uses DDP parallelization strategy....