awsome-distributed-training icon indicating copy to clipboard operation
awsome-distributed-training copied to clipboard

Organize SM-modelparallelv2 per orchestrator

Open mhuguesaws opened this issue 5 months ago • 0 comments

In current form, there are various files without specific orchestrator. This issue to organize per orchestrator:

  • kubernets/train.yaml
  • slurm/train.sbatch

mhuguesaws avatar Sep 20 '24 16:09 mhuguesaws