NAFNet icon indicating copy to clipboard operation
NAFNet copied to clipboard

train.py: error: unrecognized arguments: --local-rank=0

Open davidvct opened this issue 1 year ago • 6 comments

Encounter this error when trying to train GoPro datasets: python -m torch.distributed.launch --nproc_per_node=1 --master_port=4321 train.py -opt options/train/GoPro/NAFNet-width32.yml --launcher pytorch

I searched the train.py, there is no --local-rank=0.

How to fix?

davidvct avatar Jan 23 '24 06:01 davidvct

在train里添加 image

txy00001 avatar Feb 21 '24 05:02 txy00001

Change

parser.add_argument('--local_rank', type=int, default=0)

To

parser.add_argument('--local-rank', type=int, default=0)

And I didn't add

os.environ['RANK'] = str(0)

sentinel8b avatar Apr 24 '24 01:04 sentinel8b

Change

parser.add_argument('--local_rank', type=int, default=0)

To

parser.add_argument('--local-rank', type=int, default=0)

And I didn't add

os.environ['RANK'] = str(0)

thanks,when i try to use torchrun it reported:”can not open python:no such file“,when i follow your change,it works!

rp7sv avatar May 10 '24 10:05 rp7sv