fairseq icon indicating copy to clipboard operation
fairseq copied to clipboard

How to use fairseq-hydra-train with multi-nodes?

Open Snowdar opened this issue 2 years ago • 2 comments

❓ Questions and Help

I do not plan to use srun and just start the training on two machines by hands. But how to use fairseq-hydra-train with multi-nodes? Configure the yaml only or use torchrun? Please help and thanks very much!

Snowdar avatar Jan 28 '22 12:01 Snowdar

This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!

stale[bot] avatar Apr 28 '22 22:04 stale[bot]

❓ Questions and Help

I do not plan to use srun and just start the training on two machines by hands. But how to use fairseq-hydra-train with multi-nodes? Configure the yaml only or use torchrun? Please help and thanks very much!

Here is my solution. Hope it will be useful.

weiyx16 avatar Sep 07 '22 02:09 weiyx16