hydra-torch icon indicating copy to clipboard operation
hydra-torch copied to clipboard

pytorch/mmdetection distributed training with multi-machines with hydra

Open ytzhao opened this issue 3 years ago • 0 comments

Hi all,

I'm newbie to hydra, here I meet a problem in developing my own project.

My project is based on mmdetection with it's own yaml configure system, but I am working on integrating hydra to the project. To train the model, distributed training is necessary(not only the data parallel). I'm wondering is there any tutorial or documentary about how to do distributed training with multiple machines with hydra?

Thanks all ;-)

ytzhao avatar Feb 18 '22 07:02 ytzhao