Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimized inference of XGLM model on HPU #1323

Open
wants to merge 7 commits into
base: main
Choose a base branch
from

Conversation

XinyuYe-Intel
Copy link
Contributor

What does this PR do?

Optimized inference of XGLM model on HPU.

Before submitting

  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

Signed-off-by: Ye, Xinyu <[email protected]>
@libinta
Copy link
Collaborator

libinta commented Sep 18, 2024

@XinyuYe-Intel can you provide gaudi2 test on latest 1/17/1.18 docker RUN_SLOW=true GAUDI2_CI test and gaudi1 test result?

@XinyuYe-Intel
Copy link
Contributor Author

@XinyuYe-Intel can you provide gaudi2 test on latest 1/17/1.18 docker RUN_SLOW=true GAUDI2_CI test and gaudi1 test result?

perf on gaudi2 on 1.17.1 with RUN_SLOW=true is as below:
image

For gaudi1, I don't have the machine, so I can't provide the result.

Copy link
Collaborator

@ssarkar2 ssarkar2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@XinyuYe-Intel could you please resolve the conflicts on this PR, looks good otherwise

@XinyuYe-Intel
Copy link
Contributor Author

@XinyuYe-Intel could you please resolve the conflicts on this PR, looks good otherwise

Resolved conflicts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants