Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Multi-host] Generalize device_put for single and multi-host sharding ready ONLY add when PR is ready to merge/full CI is needed
#1653 opened Feb 7, 2026 by Lumosis Loading…
Update Dockerfile to allow automatically starting vLLM server ready ONLY add when PR is ready to merge/full CI is needed
#1651 opened Feb 6, 2026 by wenxindongwork Loading…
[AWQ][MoE] Support AWQ MoE
#1644 opened Feb 6, 2026 by catswe Draft
Fix the vllm upstream break changes ready ONLY add when PR is ready to merge/full CI is needed
#1642 opened Feb 5, 2026 by mrjunwan-lang Loading…
[Jax/Flax Version Change] Update Jax to 0.8.1 and flax to 0.12.3 ready ONLY add when PR is ready to merge/full CI is needed
#1641 opened Feb 5, 2026 by patrickji2014 Loading…
[Qwen][FP8] Add test for pre-quantized FP8 qwen3 model ready ONLY add when PR is ready to merge/full CI is needed
#1640 opened Feb 5, 2026 by lk-chen Loading…
JAX FP8 Linear Infrastructure - Tensor Quantized ready ONLY add when PR is ready to merge/full CI is needed
#1638 opened Feb 5, 2026 by patemotter Loading…
[JAX][MoE] fix bug for SparseMoE
#1625 opened Feb 4, 2026 by bzgoogle Loading…
[Log] resolves confusion for ep test log ready ONLY add when PR is ready to merge/full CI is needed
#1619 opened Feb 4, 2026 by syhuang22 Loading…
Split JAX unit tests into two parts with coverage ready ONLY add when PR is ready to merge/full CI is needed
#1609 opened Feb 3, 2026 by ernie-chang Draft
MLA Support for GLM 4.7 Flash
#1595 opened Feb 1, 2026 by mkurman Loading…
Support DeepSeek-V2-Lite-Chat-FP8
#1593 opened Feb 1, 2026 by catswe Draft
[Parallelism Support Matrix Tests] fix ep test with sharding additional_config ready ONLY add when PR is ready to merge/full CI is needed
#1589 opened Jan 31, 2026 by syhuang22 Loading…
Standardize Tuned Block Sizes via JSON ready ONLY add when PR is ready to merge/full CI is needed
#1578 opened Jan 30, 2026 by RobMulla Loading…
1 task done
Pin the vllm hash to a vllm hash to minitgate test issue. ready ONLY add when PR is ready to merge/full CI is needed
#1574 opened Jan 30, 2026 by QiliangCui Loading…
Fix/structured output batch alignment ready ONLY add when PR is ready to merge/full CI is needed
#1563 opened Jan 29, 2026 by XMaster96 Loading…
ProTip! Add no:assignee to see everything that’s not assigned.