-
-
Notifications
You must be signed in to change notification settings - Fork 6.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TPU][V1][DEBUG] Provide Env Variable To Disable Sampler
ready
ONLY add when PR is ready to merge/full CI is needed
tpu
Related to Google TPUs
v1
#16063
opened Apr 4, 2025 by
NickLucche
Loading…
[ROCm][V1] Changes needed for making vllm run on Fedora 41 with gtx1100
ci/build
#16062
opened Apr 4, 2025 by
martinhoyer
Loading…
[Bugfix][TPU] Fix V1 TPU worker for sliding window
tpu
Related to Google TPUs
v1
#16059
opened Apr 4, 2025 by
mgoin
Loading…
[fix]: Dockerfile.ppc64le fixes for opencv-python and hf-xet
ci/build
#16048
opened Apr 4, 2025 by
Shafi-Hussain
•
Draft
[Misc] improve chat_with_tools example
documentation
Improvements or additions to documentation
#16044
opened Apr 4, 2025 by
reidliu41
Loading…
Add NeuronxDistributedInference support, Speculative Decoding, Dynamic on-device sampling
ci/build
documentation
Improvements or additions to documentation
#16043
opened Apr 4, 2025 by
aws-satyajith
Loading…
[CI/Build] Set up performance benchmark on TPUs
ci/build
#16042
opened Apr 4, 2025 by
yarongmu-google
Loading…
[Bugfix] LoRA : Fix the order in which the kernels process LoRAs
#16040
opened Apr 3, 2025 by
varun-sundar-rabindranath
Loading…
Use moe_wna16 kernel for compressed tensors wna16 moe models
#16038
opened Apr 3, 2025 by
mgoin
Loading…
[V1][Spec Decode] Eagle Model loading
documentation
Improvements or additions to documentation
v1
#16035
opened Apr 3, 2025 by
LiuXiaoxuanPKU
Loading…
[NVIDIA] Support Cutlass MLA for Blackwell GPUs
ci/build
#16032
opened Apr 3, 2025 by
kaixih
Loading…
[Misc] Auto detect bitsandbytes pre-quantized models
documentation
Improvements or additions to documentation
quantization
ready
ONLY add when PR is ready to merge/full CI is needed
#16027
opened Apr 3, 2025 by
tristanleclercq
Loading…
[Doc][Bugfix] Add missing EOF in k8s deploy doc
documentation
Improvements or additions to documentation
#16025
opened Apr 3, 2025 by
psschwei
Loading…
[Benchmark] Add sampling parameters to benchmark_serving.
#16022
opened Apr 3, 2025 by
hyeygit
Loading…
[Misc][Benchmark] Remove colon from key 'request_goodput:'
#16018
opened Apr 3, 2025 by
appleparan
Loading…
[Model] Add smolvlm support
documentation
Improvements or additions to documentation
frontend
v1
#16017
opened Apr 3, 2025 by
chaunceyjiang
Loading…
[Model] Add Related to Google TPUs
v1
SupportsMultiModal.get_language_model
interface
tpu
#16007
opened Apr 3, 2025 by
NickLucche
Loading…
[Misc] Fix test_sharded_state_loader.py(#16004)
#16005
opened Apr 3, 2025 by
Accelerator1996
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.