Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

GLM-4.7-Flash Turbomind support
#4362 opened Feb 13, 2026 by lapy Loading…
2 of 4 tasks
fix fa3 install
#4361 opened Feb 13, 2026 by irexyc Loading…
[WIP] Support video inputs
#4360 opened Feb 13, 2026 by CUHKSZzxy Draft
fix ssm inputs merge
#4359 opened Feb 13, 2026 by grimoire Loading…
support glm5
#4355 opened Feb 12, 2026 by grimoire Loading…
Improve proxy server improvement
#4354 opened Feb 12, 2026 by lvhan028 Loading…
Qwen3.5
#4351 opened Feb 11, 2026 by grimoire Loading…
Support MiniMax-M2 in TurboMind engine
#4343 opened Feb 10, 2026 by zh-nj Loading…
Fix authorization Bug:P1
#4338 opened Feb 9, 2026 by lvhan028 Loading…
[WIP]Support torch compile
#4336 opened Feb 8, 2026 by grimoire Draft
Qwen/Internlm/Llama Dense/Moe model fp8 quant online enhancement New feature or request
#4324 opened Feb 5, 2026 by 43758726 Loading…
return BadRequest for all invlid inputs Bug:P2
#4291 opened Jan 26, 2026 by lvhan028 Loading…
support repetition ngram logits processor enhancement New feature or request
#4288 opened Jan 23, 2026 by grimoire Loading…
fix dllm mask on set_step
#4278 opened Jan 18, 2026 by grimoire Loading…
[ascend] fix awq and smoothq
#4277 opened Jan 16, 2026 by wanfengcxz Draft
Update benchmark serving script for proxy_server
#4173 opened Dec 1, 2025 by lvhan028 Loading…
ProTip! no:milestone will show everything without a milestone.