Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Model] Add support for Qwen2 for embeddings documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#10184 opened Nov 9, 2024 by DarkLight1337 Loading…
3
7
Add docs on serving with Llama Stack documentation Improvements or additions to documentation
#10183 opened Nov 9, 2024 by terrytangyuan Loading…
[Docs] Misc updates to TPU installation instructions documentation Improvements or additions to documentation
#10165 opened Nov 8, 2024 by mikegre-google Loading…
[Doc] Move PR template content to docs ci/build documentation Improvements or additions to documentation
#10159 opened Nov 8, 2024 by russellb Loading…
Fix missing data type in flashinfer prefill
#10141 opened Nov 8, 2024 by reyoung Loading…
[Kernel]Enable HPU for Speculative Decoding
#10131 opened Nov 7, 2024 by xuechendi Loading…
[WIP] Prefix Cache Aware Scheduling [1/n]
#10128 opened Nov 7, 2024 by rickyyx Loading…
[V1][Bugfix] Propagate V1 LLMEngine properly ready ONLY add when PR is ready to merge/full CI is needed
#10127 opened Nov 7, 2024 by comaniac Loading…
Fix quantization config of vl model
#10120 opened Nov 7, 2024 by jinzhen-lin Loading…
[Hardware][XPU] AWQ/GPTQ support for xpu backend documentation Improvements or additions to documentation needs-rebase
#10107 opened Nov 7, 2024 by yma11 Loading…
[CI/Build] Bump test transformers version ci/build needs-rebase ready ONLY add when PR is ready to merge/full CI is needed
#10106 opened Nov 7, 2024 by Isotr0py Draft
[Kernel]Generalize Speculative decode from Cuda
#10094 opened Nov 6, 2024 by xuechendi Loading…
Splitting attention kernel file ci/build ready ONLY add when PR is ready to merge/full CI is needed
#10091 opened Nov 6, 2024 by maleksan85 Loading…
[CI/Build] Split up models tests ci/build ready ONLY add when PR is ready to merge/full CI is needed
#10069 opened Nov 6, 2024 by DarkLight1337 Loading…
[Core] Add dynamic chunk size calculation
#10061 opened Nov 6, 2024 by prashantgupta24 Loading…
ProTip! Updated in the last three days: updated:>2024-11-06.