Leihuan42
Results
1
comments of
Leihuan42
> > 用一张3090训练会出现如下的问题,我的训练命令是python train.py -c configs/dfine/dfine_hgnetv2_l_coco.yml,请问是否有配置选项可以关闭分布式功能。或者说能使用单卡训练dfine吗? > > 回溯(最近一次调用最后):文件“/workspace/D-FINE/src/nn/backbone/hgnetv2.py”,第 498 行,**init** if torch.distributed.get_rank() == 0:文件“/opt/conda/lib/python3.10/site-packages/torch/distributed/distributed_c10d.py”,第 1173 行,get_rank default_pg = _get_default_group() 文件“/opt/conda/lib/python3.10/site-packages/torch/distributed/distributed_c10d.py”,第 707 行,_get_default_groupraise RuntimeError( RuntimeError: Default process group has not...