awsome-distributed-training icon indicating copy to clipboard operation
awsome-distributed-training copied to clipboard

Extra containerized nccl tests

Open verdimrc opened this issue 10 months ago • 1 comments

Issue #, if available: N/A

Description of changes: sample .sbatch scripts to run nccl tests under containers. Two variants: native implementation, and a pure pytorch-based (that some of our customers have been using for benchmarking).

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

verdimrc avatar May 03 '24 02:05 verdimrc