Deepfake-Detection
Deepfake-Detection copied to clipboard
multiple-gpu case
@HongguLiu
have you tested your training code with multiple gpu? I got the RuntimeError: NCCL Error 2: unhandled system error
. One gpu case is fine for me. Thanks.
To train a model with multiple gpus, we use model = nn.DataParallel(model)
.
If you have trained a model with multiple gpus, you must test model with
if isinstance(model, torch.nn.DataParallel): model = model.module
@HongguLiu Thanks for letting me know the testing case. I was talking about the training. I got the NCCL error. Have you successfully finished the training with multiple gpu? if so, could you update your python requirements file https://github.com/HongguLiu/Deepfake-Detection/blob/master/requirements.txt
to include more details about the python environment ? it could be pytorch issue.
We usually train our model with multiple gpu. And this code is support of training with multiple gpu.