-
-
Notifications
You must be signed in to change notification settings - Fork 11.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add use_flashinfer_sampler function with device capability check
#28379
opened Nov 10, 2025 by
usberkeley
Loading…
5 tasks done
[ROCm] Add missing gemm_a8w8_blockscale import
rocm
Related to AMD ROCm
#28378
opened Nov 10, 2025 by
sarckk
Loading…
1 of 5 tasks
[ROCm] Support for Whisper v1 with Aiter Unified Attention and Aiter Flash Attention
rocm
Related to AMD ROCm
v1
#28376
opened Nov 10, 2025 by
apinge
Loading…
5 tasks
[Fix] optimize visual token mask with caching and multi-token support
#28374
opened Nov 10, 2025 by
bo-ke
Loading…
[WIP][DEBUG] Tool converter
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#28372
opened Nov 10, 2025 by
chaunceyjiang
Loading…
5 tasks
[refactor] [mla]: independently passing q_nope & q_rope
v1
#28368
opened Nov 9, 2025 by
vnadathur
Loading…
Feature/isaac 0.1
new-model
Requests to new models
#28367
opened Nov 9, 2025 by
oscardev256
•
Draft
5 tasks
[Bugfix] Fix persistent_masked_m_silu_mul_quant tests
bug
Something isn't working
ready
ONLY add when PR is ready to merge/full CI is needed
#28366
opened Nov 9, 2025 by
varun-sundar-rabindranath
Loading…
[bugfix] fix siglip batch text output error
#28365
opened Nov 9, 2025 by
piood
Loading…
3 of 5 tasks
Add @tjtanaa to codeowner for ROCm and multi-modal
ci/build
rocm
Related to AMD ROCm
#28360
opened Nov 9, 2025 by
tjtanaa
Loading…
5 tasks
[Performance][B200] silu_mul_quant: pack scales in int32
deepseek
Related to DeepSeek models
kernel
nvidia
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
#28358
opened Nov 9, 2025 by
varun-sundar-rabindranath
Loading…
[Doc] Sleep mode documentation
documentation
Improvements or additions to documentation
#28357
opened Nov 9, 2025 by
iAmir97
Loading…
5 tasks
add cpu option for p/d in nixl_connector
kv-connector
#28356
opened Nov 9, 2025 by
ZhengHongming888
Loading…
5 tasks
Fix/responses api harmony channel metadata #28262
frontend
gpt-oss
Related to GPT-OSS models
#28355
opened Nov 9, 2025 by
baonudesifeizhai
Loading…
5 tasks
[Misc] FlattenLogprobs -> FlatLogprobs
ready
ONLY add when PR is ready to merge/full CI is needed
#28335
opened Nov 8, 2025 by
zhuohan123
Loading…
5 tasks
[Frontend] split append tool output
frontend
gpt-oss
Related to GPT-OSS models
#28333
opened Nov 8, 2025 by
qandrew
Loading…
[Model] Add Afmoe architecture implementation
documentation
Improvements or additions to documentation
new-model
Requests to new models
#28332
opened Nov 8, 2025 by
pranav4501
Loading…
5 tasks done
[Frontend][2/n] remove empty content from _parse_tool_calls_from_content
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#28331
opened Nov 7, 2025 by
qandrew
Loading…
Fix: tool call streaming when both reasoning and tool parsers are enabled #28297
frontend
#28330
opened Nov 7, 2025 by
baonudesifeizhai
Loading…
5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.