forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 40
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
removing quant and kv-cache fp8 from deepseek run instructions
#509
opened Apr 9, 2025 by
arakowsk-amd
Loading…
Handling input dim size greater than 3 in tuned_gemm.py
#482
opened Mar 13, 2025 by
charlifu
Loading…
EXPERIMENTING WITH K8S // NO NEED TO MERGE // Rocm vllm ci fix nd k8 osci
#477
opened Mar 12, 2025 by
Alexei-V-Ivanov-AMD
Loading…
Updating ISL and OSL to align with reported benchmark table
stale
#424
opened Feb 14, 2025 by
eduand-alvarez
Loading…
K8test baseline -> Testing a single MI300 8x GPU node for CI performance // no need to merge
stale
#409
opened Feb 6, 2025 by
Alexei-V-Ivanov-AMD
Loading…
ProTip!
Adding no:label will show everything without a label.