-
-
Notifications
You must be signed in to change notification settings - Fork 7.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix][Frontend] support webm with audioread fallback
frontend
#18477
opened May 21, 2025 by
cpwan
Loading…
[Misc] refactor disaggregated-prefill-v1 example
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#18474
opened May 21, 2025 by
reidliu41
Loading…
add quick allreduce for vllm
ci/build
documentation
Improvements or additions to documentation
frontend
v1
#18473
opened May 21, 2025 by
lihaoyang-amd
•
Draft
make TIMEOUT_KEEP_ALIVE configurable through env var
frontend
#18472
opened May 21, 2025 by
liusiqian-tal
Loading…
[Platform] Move platform check to right place
ready
ONLY add when PR is ready to merge/full CI is needed
tpu
Related to Google TPUs
#18470
opened May 21, 2025 by
wangxiyuan
Loading…
[V1] Support Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
v1
LLM.apply_model
frontend
multi-modality
#18465
opened May 21, 2025 by
DarkLight1337
Loading…
[Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking
ci/build
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
structured-output
tool-calling
v1
#18454
opened May 21, 2025 by
Crucifixion-Fxl
Loading…
[V1] fix torch profiling for V1 offline scenarios
v1
#18445
opened May 21, 2025 by
divakar-amd
Loading…
Enable CPU nightly performance benchmark and its Markdown report
ci/build
#18444
opened May 21, 2025 by
louie-tsai
Loading…
[Core] Add support for sampling penalties to v1 ngram speculative decoding
speculative-decoding
v1
#18441
opened May 20, 2025 by
pooyadavoodi
Loading…
Fix: [NixlConnector] use agent_name instead of engine_id in nixl send_notif()
#18438
opened May 20, 2025 by
juncgu
Loading…
[KERNEL] Sampler. CUDA kernel for applying repetition penalty
ci/build
#18437
opened May 20, 2025 by
vadiklyutiy
Loading…
[V1] Support Deepseek MTP
ci/build
frontend
speculative-decoding
v1
#18435
opened May 20, 2025 by
YaoJiayi
Loading…
3 tasks
[Kernel] DeepEP dispatch-combine kernel integration
#18434
opened May 20, 2025 by
varun-sundar-rabindranath
•
Draft
[Bugfix] Add half type support in reshape_and_cache_cpu_impl on x86 cpu platform
ready
ONLY add when PR is ready to merge/full CI is needed
#18430
opened May 20, 2025 by
zzzyq
Loading…
[Feature][MXFP4] Add GEMM kernel
documentation
Improvements or additions to documentation
#18426
opened May 20, 2025 by
fxmarty-amd
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.