-
Notifications
You must be signed in to change notification settings - Fork 346
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[JAX] Support various implementations of RoPE. #655
[JAX] Support various implementations of RoPE. #655
Conversation
/te-ci jax |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Just a minor comment on an (optional) error message for bad RoPE method.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGMT
@nouiz Do we need JET CI for this? |
I think so. We discovered this issue as the current Rope wasn't used by JET CI if I understood correctly. So that would be the first CI run that use it. |
/te-ci jax |
Hi @mingxu1067, I found this PR cause functional regression on the LLaMA model. We can discuss offline in the next week. |
f61518e
to
1c18a40
Compare
/te-ci jax |
1c18a40
to
a9bb39e
Compare
…tation bug Signed-off-by: Ming Huang <[email protected]>
a9bb39e
to
375209b
Compare
/te-ci jax |
@denera @nouiz @zlsh80826 |
@mingxu1067 LGTM! Thanks for seeing this through. |
alternate
is to pair indexi
withi + d/2
,
d
is the hidden dimension. 'consecutive' pairs indexi
withi + 1
.