KubeAI: A K8s vLLM operator #7955

samos123 · 2024-08-28T15:45:58Z

samos123
Aug 28, 2024

KubeAI is the easiest way to deploy vLLM at scale on K8s. Some highlights:
✅️ Drop-in replacement for OpenAI with API compatibility
🚀 Works on CPUs and GPUs
⚖️ Scale from zero, autoscale based on load
🛠️ Zero dependencies (no Istio, Knative, etc.)
🤖 Operates OSS model servers (vLLM and Ollama)
🔋 Includes a Chat UI out of the box: (OpenWebUI i.e. ChatGPT-like UI)
✉️ Plug-n-play with cloud messaging systems (Kafka, PubSub, etc.)

Would love your feedback!

Source: https://github.com/substratusai/kubeai

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

KubeAI: A K8s vLLM operator #7955

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

KubeAI: A K8s vLLM operator #7955

Uh oh!

Uh oh!

samos123 Aug 28, 2024

Replies: 0 comments

samos123
Aug 28, 2024