[QUESTION] why the time of one iter in nsys longer than that in the ouput log? #1174
Unanswered
hanwen-sun
asked this question in
Q&A
Replies: 1 comment
-
Marking as stale. No activity in 60 days. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I want to compare the speed of training llama2-7b between libai(https://github.com/Oneflow-Inc/libai) and Megatron-LM in NVIDIA A800-SXM4-80G.
But I find the time of one iter in nsys is longer than the output in log when using Megatron-LM;
and I can't find many gap in the cuda stream.
Can anyone explain this to me?
Beta Was this translation helpful? Give feedback.
All reactions