Skip to content

Conversation

@qqbbiu
Copy link
Collaborator

@qqbbiu qqbbiu commented Dec 19, 2025

1.cuda>=12.8 python链路支持了完整的xqa,与现有的pybingds c++ op不一样;
2.优化了xqa的kernel,对应在内源推出了wheel包,支持了h200的最优参数finetune;
3.增加spec与非sepc的ut

@qqbbiu qqbbiu requested a review from LLLLKKKK as a code owner December 19, 2025 09:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants