distrifuser icon indicating copy to clipboard operation
distrifuser copied to clipboard

Failed to initialize process group: Error initializing torch.distributed using env:// rendezvous: environment variable RANK expected, but not set, falling back to single GPU

Open ibrahim737701 opened this issue 9 months ago • 1 comments

Even after adding environment variables like:-

os.environ['RANK'] = '0' os.environ['WORLD_SIZE'] = '4' os.environ['MASTER_ADDR'] = 'localhost' os.environ['MASTER_PORT'] = '12345'

the mode itself is not loading.

ibrahim737701 avatar May 13 '24 08:05 ibrahim737701