Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I have tried training multiple times, but NaN keeps occurring consistently. #142

Open
rudxo9251 opened this issue Aug 30, 2024 · 7 comments

Comments

@rudxo9251
Copy link

I continuously train multiple times, but NaN occurs. I haven't changed the training parameters, but NaN appears in the loss during training. What could be the problem?

@rudxo9251
Copy link
Author

i solve

@xuanyuzhang21
Copy link

I met the same problem. Can you tell me how to solve this issue? Thanks a lot.

@Lanjiong-Li
Copy link

I met the same problem. Can you tell me how to solve this issue? Thanks a lot.

set the lr to 1e-6 may help.

@rudxo9251
Copy link
Author

change adam_epsilon to 1e-4.

@all1new
Copy link

all1new commented Sep 5, 2024

I met the same problem. Can you tell me how to solve this issue? Thanks a lot.

set the lr to 1e-6 may help.

Is there any better training effect? ​​The training results here are not as good as the original weights.

@awais-nayyar
Copy link

I want to train this code on my own dataset. What is the minimum GPU and Ram requirements???
can anybody help to guide me?

@TnoobT
Copy link

TnoobT commented Sep 14, 2024

I want to train this code on my own dataset. What is the minimum GPU and Ram requirements??? can anybody help to guide me?

58G gpu

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants