-
Notifications
You must be signed in to change notification settings - Fork 96
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Remove _reqs_being_saved check from status clean up for the finished …
#1654
opened Feb 7, 2026 by
dannawang0221
Loading…
[Multi-host] Generalize device_put for single and multi-host sharding
ready
ONLY add when PR is ready to merge/full CI is needed
#1653
opened Feb 7, 2026 by
Lumosis
Loading…
Update Dockerfile to allow automatically starting vLLM server
ready
ONLY add when PR is ready to merge/full CI is needed
#1651
opened Feb 6, 2026 by
wenxindongwork
Loading…
[wip] [do not review] batch load for tpu kv host offload connector
#1649
opened Feb 6, 2026 by
saikat-royc
Loading…
Fix the vllm upstream break changes
ready
ONLY add when PR is ready to merge/full CI is needed
#1642
opened Feb 5, 2026 by
mrjunwan-lang
Loading…
[Jax/Flax Version Change] Update Jax to 0.8.1 and flax to 0.12.3
ready
ONLY add when PR is ready to merge/full CI is needed
#1641
opened Feb 5, 2026 by
patrickji2014
Loading…
[Qwen][FP8] Add test for pre-quantized FP8 qwen3 model
ready
ONLY add when PR is ready to merge/full CI is needed
#1640
opened Feb 5, 2026 by
lk-chen
Loading…
JAX FP8 Linear Infrastructure - Tensor Quantized
ready
ONLY add when PR is ready to merge/full CI is needed
#1638
opened Feb 5, 2026 by
patemotter
Loading…
Massage some parameters to balance prefill/decode and improve disagg performance
#1622
opened Feb 4, 2026 by
mrjunwan-lang
Loading…
[Log] resolves confusion for ep test log
ready
ONLY add when PR is ready to merge/full CI is needed
#1619
opened Feb 4, 2026 by
syhuang22
Loading…
[TPU host offload] batched save bulk transfer for tpu host offload connector
#1612
opened Feb 3, 2026 by
saikat-royc
Loading…
Split JAX unit tests into two parts with coverage
ready
ONLY add when PR is ready to merge/full CI is needed
#1609
opened Feb 3, 2026 by
ernie-chang
•
Draft
Bump Jax to 0.9.0, flax to 0.12.3 and qwix to 0.1.4
#1598
opened Feb 2, 2026 by
patrickji2014
Loading…
[bug] resolve compatibility issues with vllm trunk
#1592
opened Feb 1, 2026 by
bangshengtang
Loading…
[Parallelism Support Matrix Tests] fix ep test with sharding additional_config
ready
ONLY add when PR is ready to merge/full CI is needed
#1589
opened Jan 31, 2026 by
syhuang22
Loading…
Standardize Tuned Block Sizes via JSON
ready
ONLY add when PR is ready to merge/full CI is needed
#1578
opened Jan 30, 2026 by
RobMulla
Loading…
1 task done
Pin the vllm hash to a vllm hash to minitgate test issue.
ready
ONLY add when PR is ready to merge/full CI is needed
#1574
opened Jan 30, 2026 by
QiliangCui
Loading…
Fix/structured output batch alignment
ready
ONLY add when PR is ready to merge/full CI is needed
#1563
opened Jan 29, 2026 by
XMaster96
Loading…
Update contributor guidelines to prefer Torchax implementation of models
#1558
opened Jan 29, 2026 by
muskansh-google
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.