Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

multiple-gpu case #2

Open
zhuolinumd opened this issue Jan 24, 2020 · 3 comments
Open

multiple-gpu case #2

zhuolinumd opened this issue Jan 24, 2020 · 3 comments

Comments

@zhuolinumd
Copy link

@HongguLiu
have you tested your training code with multiple gpu? I got the RuntimeError: NCCL Error 2: unhandled system error. One gpu case is fine for me. Thanks.

@HongguLiu
Copy link
Owner

To train a model with multiple gpus, we use model = nn.DataParallel(model) .
If you have trained a model with multiple gpus, you must test model with
if isinstance(model, torch.nn.DataParallel): model = model.module

@zhuolinumd
Copy link
Author

@HongguLiu Thanks for letting me know the testing case. I was talking about the training. I got the NCCL error. Have you successfully finished the training with multiple gpu? if so, could you update your python requirements file https://github.com/HongguLiu/Deepfake-Detection/blob/master/requirements.txt to include more details about the python environment ? it could be pytorch issue.

@HongguLiu
Copy link
Owner

We usually train our model with multiple gpu. And this code is support of training with multiple gpu.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants