Skip to content

Comments

MXFP4 8 wave pingpong#944

Merged
panditsa merged 4 commits intoiree-org:mainfrom
adedespirlet:mxfp_final
Feb 21, 2026
Merged

MXFP4 8 wave pingpong#944
panditsa merged 4 commits intoiree-org:mainfrom
adedespirlet:mxfp_final

Conversation

@adedespirlet
Copy link
Contributor

@adedespirlet adedespirlet commented Feb 20, 2026

This PR contains two schedules for 8 wave pingpong.
The two kernels are
test_dbuf_8wave_pingpong_mxfp_gemm()
and
test_dbuf_8wave_mixed_pingpong_mxfp_gemm()

the mixed_pingpong version performs better on the big shapes of interest
2048,57344,16384
4096,57344,16384
8192,57344,16384
8192,57344,8192
16384,57344,16384
16384,16384,16384
32768,57344,16384
32768,57344,8192

Signed-off-by: Aurore De Spirlet <aurore.despirlet@amd.com>
Signed-off-by: Aurore De Spirlet <aurore.despirlet@amd.com>
Signed-off-by: Aurore De Spirlet <aurore.despirlet@amd.com>
@panditsa panditsa merged commit 2cde4db into iree-org:main Feb 21, 2026
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants