AzureML-BERT
AzureML-BERT copied to clipboard
how can I get ip of master node
when I get it via env variable AZ_BATCHAI_MPI_MASTER_NODE, I get follow error
File "src/scripts/submit_job/distributed.py", line 273, in set_environment_variables_for_nccl_backend os.environ['MASTER_ADDR'] = os.environ['AZ_BATCHAI_MPI_MASTER_NODE'] File "/opt/conda/lib/python3.6/os.py", line 669, in getitem raise KeyError(key) from None KeyError: 'AZ_BATCHAI_MPI_MASTER_NODE'
Did you find solution for this issue? Facing the same error here