awsome-distributed-training
awsome-distributed-training copied to clipboard

Published 20 hours ago •

Reame
Issues

Organize SM-modelparallelv2 per orchestrator

Open mhuguesaws opened this issue 5 months ago • 0 comments

In current form, there are various files without specific orchestrator. This issue to organize per orchestrator:

kubernets/train.yaml
slurm/train.sbatch

Sep 20 '24 16:09 mhuguesaws