[BUG] fix moe benchmark when bs*seq is small #3382

yiakwy-xpu-ml-framework-team · 2025-02-07T22:36:22Z

Motivation

In "benchmark_deepseekv3_moe_align_blocks.py" change block_size 3 and run calculate_diff(batch_size=1, seq_len=4). The test will fail.

This is because expert_ids for cuda is not zero initialized and contain junk data.

Modifications

After fix, the script can run smoothly for

calculate_diff(batch_size=4, seq_len=1024)
calculate_diff(batch_size=1, seq_len=4)

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.

fix moe benchmark when bs*seq is small

8398f62

yiakwy-xpu-ml-framework-team changed the title ~~fix moe benchmark when bs*seq is small~~ [BUG] fix moe benchmark when bs*seq is small Feb 7, 2025

yiakwy-xpu-ml-framework-team mentioned this pull request Feb 7, 2025

[Bug] how to solve illegal memory access in moe_align_block_size kernel optimization #3339

Closed

5 tasks

BBuf approved these changes Feb 8, 2025

View reviewed changes

zhyncs enabled auto-merge (squash) February 8, 2025 07:37

zhyncs disabled auto-merge February 8, 2025 07:37

Merge branch 'main' into fix_moe_align_benchmark_init

47ddbf7

zhyncs merged commit 64480df into sgl-project:main Feb 8, 2025
1 check passed

BBuf mentioned this pull request Feb 8, 2025

Update fused_moe's benchmark #3346

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] fix moe benchmark when bs*seq is small #3382

[BUG] fix moe benchmark when bs*seq is small #3382

yiakwy-xpu-ml-framework-team commented Feb 7, 2025

[BUG] fix moe benchmark when bs*seq is small #3382

[BUG] fix moe benchmark when bs*seq is small #3382

Conversation

yiakwy-xpu-ml-framework-team commented Feb 7, 2025

Motivation

Modifications

Checklist