Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA Error #22

Open
jmwang0117 opened this issue Nov 5, 2024 · 3 comments
Open

CUDA Error #22

jmwang0117 opened this issue Nov 5, 2024 · 3 comments

Comments

@jmwang0117
Copy link

Thanks for your great work!

I use two A800-80Gs for training. It can train correctly with one subnet, but the following error occurs with three subnets. Is there any way to solve this problem?

image

image

@anhquancao
Copy link
Collaborator

Hi, I've never seen this error. Looks like there is an error when calling .max() function from pytorch. Did you use the same version of pytorch?

@jmwang0117
Copy link
Author

I use pytorch 2.1.0+cu121

@anhquancao
Copy link
Collaborator

The code was tested with pytorch 1.13, probably this could be the reason.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants