-
Notifications
You must be signed in to change notification settings - Fork 528
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Performance]: Process affinity to CPU cores with multiple sockets support
#2171
opened Nov 25, 2024 by
HaiShaw
Loading…
Replace prob based with threshold based load balancing
#2170
opened Nov 25, 2024 by
ByronHsu
Loading…
3 tasks done
add profile in offline benchmark & update doc
await-response
#2123
opened Nov 22, 2024 by
bjmsong
Loading…
3 tasks
feat: use cascade attention kernel (single level)
#2101
opened Nov 20, 2024 by
james-p-xu
•
Draft
1 of 3 tasks
Add log input text when using openai chat api
await-response
#2058
opened Nov 17, 2024 by
ccjincong
Loading…
3 tasks done
Input_embeds support
await-response
high priority
#2052
opened Nov 16, 2024 by
RinRin-32
Loading…
1 of 3 tasks
Surpport kv cache int8/int4 for triton backend
await-response
#1644
opened Oct 12, 2024 by
yuguo-Jack
Loading…
ProTip!
Updated in the last three days: updated:>2024-11-21.