submitit
submitit copied to clipboard
Fallback to slurm for TorchDistributedEnv
From my understanding, this change enables someone not using submitit to still be able to retrieve those environment variables that are normally set by
torchrun
.
can torchrun be used from python and not commandline?
This seems a bit weird to me, as this is a helper function from within submitit, so I would expect it to only be relevant when using it in conjunction with submitit. Maybe what we need to do instead is to see if we can setup those env vars in user code (maybe by using torchrun?).
i'm fine with it being in a user code, then again with only a couple of line changes we are able to accomodate more use cases easily, without duplicating code which can also bring some positive aspects :)