Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding e2e tests for i1 mask attentions #19312

Draft
wants to merge 3 commits into
base: main
Choose a base branch
from
Draft

Conversation

lialan
Copy link
Contributor

@lialan lialan commented Nov 27, 2024

  • New tests are aimed at testing with option --iree-experimental-packed-i1-storage turned on, which allows real packed i1 datatype in memory.
  • Only certain shapes are correct at this moment as upstream patches for emulating unaligned vector stores are not yet merged.

To test actual i1 handling with attention op.
@lialan lialan force-pushed the lialan/attention_i1_e2e branch 5 times, most recently from 00ab73a to 35f7356 Compare November 29, 2024 03:10
* New tests are aimed at testing with option
`--iree-experimental-packed-i1-storage` turned on, which allows real
packed i1 datatype in memory.
* Only certain shapes are correct at this moment as upstream patches for
  emulating unaligned vector stores are not yet merged.

Signed-off-by: Alan Li <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant