Yunqi Yan

Results 1 issues of Yunqi Yan

# Description Training is failing with a NCCL error indicating that peer-to-peer (P2P) access is not supported between the GPU devices being used. Despite setting environment variables `NCCL_P2P_DISABLE=1` and `NCCL_IGNORE_DISABLED_P2P=1`,...