Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Replace prob based with threshold based load balancing
#2170 opened Nov 25, 2024 by ByronHsu Loading…
3 tasks done
test select concurrency
#2165 opened Nov 24, 2024 by qeternity Loading…
Speculative EAGLE2. New PR
#2150 opened Nov 24, 2024 by yukavio Loading…
Byhsu/fairness router
#2149 opened Nov 24, 2024 by ByronHsu Draft
3 tasks
add profile in offline benchmark & update doc await-response
#2123 opened Nov 22, 2024 by bjmsong Loading…
3 tasks
Online weight update [WIP]
#2119 opened Nov 22, 2024 by zhaochenyang20 Draft
3 tasks
feat: use cascade attention kernel (single level)
#2101 opened Nov 20, 2024 by james-p-xu Draft
1 of 3 tasks
Add log input text when using openai chat api await-response
#2058 opened Nov 17, 2024 by ccjincong Loading…
3 tasks done
[TEST] flashinfer version upgrade to v0.2.0
#2054 opened Nov 17, 2024 by james-p-xu Draft
3 tasks
Input_embeds support await-response high priority
#2052 opened Nov 16, 2024 by RinRin-32 Loading…
1 of 3 tasks
Add support for GPT-J
#2041 opened Nov 15, 2024 by danilotpnta Draft
regex stopping condition
#2035 opened Nov 14, 2024 by jancervenka Loading…
3 tasks done
[WIP] Use FlashInfer RoPE
#2016 opened Nov 12, 2024 by james-p-xu Loading…
3 tasks done
Debug studio await-response
#1831 opened Oct 29, 2024 by zolinthecow Loading…
3 tasks done
Function calling for OpenAI backend
#573 opened Jun 29, 2024 by Yiyun-Liang Loading…
ProTip! Updated in the last three days: updated:>2024-11-21.