fairseq
fairseq copied to clipboard
How to use fairseq-hydra-train with multi-nodes?
❓ Questions and Help
I do not plan to use srun and just start the training on two machines by hands. But how to use fairseq-hydra-train with multi-nodes? Configure the yaml only or use torchrun? Please help and thanks very much!
This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!
❓ Questions and Help
I do not plan to use srun and just start the training on two machines by hands. But how to use fairseq-hydra-train with multi-nodes? Configure the yaml only or use torchrun? Please help and thanks very much!
Here is my solution. Hope it will be useful.