Skip to content
Discussion options

You must be logged in to vote

Currently, our attention kernels don't compile with triton on turing/volta, even the latest version. We didn't investigate much but it may be related to triton-lang/triton#616. We may be able to rewrite the forward pass of the kernel to make it works but we prefer to wait for a more stable release of triton before.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by seastar105
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants