-
Notifications
You must be signed in to change notification settings - Fork 862
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[BUG] fix moe benchmark when bs*seq is small
#3382
opened Feb 7, 2025 by
yiakwy-xpu-ml-framework-team
Loading…
3 of 5 tasks
Docs: Add Slurm Example on Multi-Nodes
amd
documentation
Improvements or additions to documentation
#3379
opened Feb 7, 2025 by
zhaochenyang20
Loading…
2 of 5 tasks
FEA compat with ipv6
amd
good first issue
Good for newcomers
help wanted
Extra attention is needed
#3301
opened Feb 4, 2025 by
lxww302
Loading…
4 tasks done
[Feature] Support llguidance for constrained decoding
#3298
opened Feb 4, 2025 by
JC1DA
Loading…
2 of 3 tasks
[ROCm] Enable Fused MLA Triton kernel for DeepSeekV3
#3237
opened Jan 31, 2025 by
lcskrishna
•
Draft
docs(references/deepseek): current section links
#3225
opened Jan 31, 2025 by
guspan-tanadi
Loading…
4 tasks
Add support for nvidia modelopt fp8 kv cache
high priority
#3223
opened Jan 30, 2025 by
Edwardf0t1
Loading…
1 of 4 tasks
Online serving benchmarks of real datasets for hierarchical KV caching
#3211
opened Jan 30, 2025 by
PanJason
Loading…
5 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.