benchmark Wrap up the distributed benchmark and get them running on conda

Wrap up the distributed benchmark and get them running on conda_mast

Open xuzhao9 opened this issue 1 year ago • 0 comments

We would like to introduce basic distributed benchmarking support on synthetic data.

The idea is to wrap up single-GPU model on DDP/FSDP, then get them running on conda_mast. The initial OSS distributed userbenchmark could be used as the starting point: https://github.com/pytorch/benchmark/tree/main/userbenchmark/distributed

Dec 19 '23 18:12 xuzhao9

benchmark benchmark copied to clipboard

Wrap up the distributed benchmark and get them running on conda_mast

benchmark
benchmark copied to clipboard