You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@HongguLiu
have you tested your training code with multiple gpu? I got the RuntimeError: NCCL Error 2: unhandled system error. One gpu case is fine for me. Thanks.
The text was updated successfully, but these errors were encountered:
To train a model with multiple gpus, we use model = nn.DataParallel(model) .
If you have trained a model with multiple gpus, you must test model with if isinstance(model, torch.nn.DataParallel): model = model.module
@HongguLiu Thanks for letting me know the testing case. I was talking about the training. I got the NCCL error. Have you successfully finished the training with multiple gpu? if so, could you update your python requirements file https://github.com/HongguLiu/Deepfake-Detection/blob/master/requirements.txt to include more details about the python environment ? it could be pytorch issue.
@HongguLiu
have you tested your training code with multiple gpu? I got the
RuntimeError: NCCL Error 2: unhandled system error
. One gpu case is fine for me. Thanks.The text was updated successfully, but these errors were encountered: