vLLM

vllm Public
A high-throughput and memory-efficient inference and serving engine for LLMs

vllm-project/vllm’s past year of commit activity

Python 48,626 Apache-2.0 7,706 1,859 (12 issues need help) 680 Updated Jun 1, 2025
llm-compressor Public
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

vllm-project/llm-compressor’s past year of commit activity

Python 1,420 Apache-2.0 138 56 (11 issues need help) 33 Updated May 31, 2025
vllm-ascend Public
Community maintained hardware plugin for vLLM on Ascend

vllm-project/vllm-ascend’s past year of commit activity

Python 703 Apache-2.0 176 140 (1 issue needs help) 72 Updated May 30, 2025
vllm-spyre Public
Community maintained hardware plugin for vLLM on Spyre

vllm-project/vllm-spyre’s past year of commit activity

Python 24 Apache-2.0 14 13 (2 issues need help) 10 Updated May 30, 2025
aibrix Public
Cost-efficient and pluggable Infrastructure components for GenAI inference

vllm-project/aibrix’s past year of commit activity

Jupyter Notebook 3,644 Apache-2.0 362 161 (15 issues need help) 14 Updated May 30, 2025
production-stack Public
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

vllm-project/production-stack’s past year of commit activity

Python 1,291 Apache-2.0 192 53 (2 issues need help) 35 Updated May 30, 2025
ci-infra Public
This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

vllm-project/ci-infra’s past year of commit activity

HCL 11 24 0 6 Updated May 27, 2025
vllm-openvino Public

vllm-project/vllm-openvino’s past year of commit activity

Python 11 Apache-2.0 5 2 0 Updated May 27, 2025
vllm-project.github.io Public

vllm-project/vllm-project.github.io’s past year of commit activity

HTML 11 16 0 0 Updated May 14, 2025
flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention

vllm-project/flash-attention’s past year of commit activity

Python 72 BSD-3-Clause 1,725 0 11 Updated May 3, 2025

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Sponsors

Top languages

Uh oh!

Most used topics

Uh oh!