Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Upstream fp8 with static scales gpt oss gpt-oss Related to GPT-OSS models needs-rebase
#30357 opened Dec 9, 2025 by maleksan85 Draft
[CI][DeepSeek] Add nightly DeepSeek R1 lm_eval tests on H200 ci/build deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed
#30356 opened Dec 9, 2025 by MatthewBonanni Loading…
2 of 5 tasks
[Bugfix] Cache added_vocab to avoid per-token overhead
#30351 opened Dec 9, 2025 by scratch-ml Loading…
5 tasks
Remove virtual engine handling codex kv-connector needs-rebase qwen Related to Qwen models tpu Related to Google TPUs v1
#30350 opened Dec 9, 2025 by WoosukKwon Loading…
[BugFix] Fix minimax m2 model rope_parameters
#30349 opened Dec 9, 2025 by esmeetu Loading…
5 tasks
[Docs]: adds a new metric vllm:request_prefill_kv_computed_tokens in docs documentation Improvements or additions to documentation
#30348 opened Dec 9, 2025 by googs1025 Loading…
5 tasks
[cpu][ci] Add CPU Attention Tests for Neon Backend
#30347 opened Dec 9, 2025 by fadara01 Loading…
2 tasks
Fix typos in comments across multiple files documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed v1
#30345 opened Dec 9, 2025 by wilsonwu Loading…
5 tasks
Fix gigachat3 parser + update tests frontend tool-calling
#30338 opened Dec 9, 2025 by ajpqs Loading…
3 of 5 tasks
fix: enhance human_readable_int function
#30337 opened Dec 9, 2025 by andyxning Loading…
5 tasks
[Bugfix] Fix fp8 DeepGemm compilation issues bug Something isn't working ci-failure Issue about an unexpected test failure in CI deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed
#30336 opened Dec 9, 2025 by ElizaWszola Loading…
[Bugfix] tpu_model_runner: set vllm config context when calling reset_dynamo_cache() ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#30331 opened Dec 9, 2025 by dtrifiro Loading…
[Bugfix] Fix cuda graph sizes when running with speculative decoding nvidia ready ONLY add when PR is ready to merge/full CI is needed
#30330 opened Dec 9, 2025 by PatrykSaffer Loading…
[BugFix] Fix hang issue in LMCache mp mode kv-connector v1
#30327 opened Dec 9, 2025 by wz1qqx Loading…
5 tasks
[Frontend] [Doc] Exclude log deltas feature frontend
#30322 opened Dec 9, 2025 by Catacomba Loading…
3 tasks done
ProTip! Filter pull requests by the default branch with base:main.