Skip to content

Adding getting started instructions for GKE, Istio, and Kgateway #577

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 52 commits into from
Mar 28, 2025
Merged
Show file tree
Hide file tree
Changes from 43 commits
Commits
Show all changes
52 commits
Select commit Hold shift + click to select a range
6bc07c6
Create resources.yaml for kgateway
nicolexin Mar 25, 2025
63d7c40
Update getting started guide for KGateway
nicolexin Mar 25, 2025
048189a
Replace Envoy Gateway user guide with GKE user guide
nicolexin Mar 25, 2025
a679070
Create resources.yaml for GKE Gateway
nicolexin Mar 25, 2025
a627ea7
Delete config/manifests/gateway/enable_patch_policy.yaml
nicolexin Mar 25, 2025
7b490de
Delete config/manifests/gateway/gateway.yaml
nicolexin Mar 25, 2025
9c8d00d
Delete config/manifests/gateway/patch_policy.yaml
nicolexin Mar 25, 2025
0519935
Delete config/manifests/gateway/traffic_policy.yaml
nicolexin Mar 25, 2025
3e7e74e
Add http2 appProtocol to EPP service
nicolexin Mar 25, 2025
a140a3e
Add user guide for Istio
nicolexin Mar 25, 2025
8a878f8
Create resources.yaml for Istio
nicolexin Mar 25, 2025
f0b59e4
Fix GKE gateway name to match the user guide
nicolexin Mar 25, 2025
c06cffd
Fix cleanup instructions to refer up-to-date YAMLs
nicolexin Mar 25, 2025
21100f9
Allow Istio gateway to use HTTPRoute from all namespaces
nicolexin Mar 26, 2025
d8d4666
Update Kgateway port number to 80
nicolexin Mar 26, 2025
ee7fa97
Update gateway port to 80
nicolexin Mar 26, 2025
59cbe2e
Remove the sectionName from Kgateway HTTPRoute
nicolexin Mar 26, 2025
afc64dc
Create common httproute YAML
nicolexin Mar 26, 2025
8d235f6
Create healthcheck.yaml for GKE gateway
nicolexin Mar 26, 2025
52318b3
Separate gateway.yaml for GKE gateway
nicolexin Mar 26, 2025
9343660
Separate gateway.yaml for Istio
nicolexin Mar 26, 2025
8ef12a8
Separate gateway.yaml for Kgateway
nicolexin Mar 26, 2025
557c44f
Update the user guide to use shared HTTPRoute YAML
nicolexin Mar 26, 2025
6d48b5b
Add EPP DestinationRule for Istio
nicolexin Mar 27, 2025
e512145
Add instructions for bypassing TLS verification for Istio
nicolexin Mar 27, 2025
e82e074
Update CRDs to the latest v0.2.0 release
nicolexin Mar 27, 2025
ff8b2a1
Update gateway to use the v1 API
nicolexin Mar 27, 2025
f6f9538
Remove weight from HTTPRoute
nicolexin Mar 27, 2025
efb8c35
Update gateway.yaml
nicolexin Mar 27, 2025
5a2677e
Remove allowedRoutes from Istio gateway
nicolexin Mar 27, 2025
ce19438
Remove allowedRoutes from Kgateway
nicolexin Mar 27, 2025
b63263d
Merge branch 'kubernetes-sigs:main' into userguide
nicolexin Mar 27, 2025
d493258
Update latest instructions for installing Istio and addressing some c…
nicolexin Mar 27, 2025
9cb2575
Fix indentation for installing CRDs
nicolexin Mar 27, 2025
2574453
Merge remote-tracking branch 'upstream/main' into userguide
nicolexin Mar 27, 2025
2f9baea
Merge branch 'kubernetes-sigs:main' into userguide
nicolexin Mar 28, 2025
35a835f
Addressing code review comments
nicolexin Mar 28, 2025
0a24389
Fix indentation
nicolexin Mar 28, 2025
c1b563b
Update Istio installation instructions
nicolexin Mar 28, 2025
6d3642a
Fix indentation
nicolexin Mar 28, 2025
6a9f91a
Fix indentation
nicolexin Mar 28, 2025
b6d4c7a
Add more spacing to the CPU based model instructions
nicolexin Mar 28, 2025
e9f2298
Removing comments from kgateway
nicolexin Mar 28, 2025
484f19f
Add clarification on the EPP secureServing default value.
nicolexin Mar 28, 2025
d71f29c
Add instructions for configuring timeout
nicolexin Mar 28, 2025
41fc083
Create httproute-with-timeout.yaml
nicolexin Mar 28, 2025
d5fd70f
Create gcp-backend-policy.yaml
nicolexin Mar 28, 2025
d0ddd16
Add cleanup for GCPBackendPolicy
nicolexin Mar 28, 2025
e1c0b1d
Remove namespace from destination-rule.yaml
nicolexin Mar 28, 2025
e4471ec
Rename inferencepool.yaml to inferencepool-resources.yaml
nicolexin Mar 28, 2025
365d847
Rename inferencepool.yaml to inferencepool-resources.yaml
nicolexin Mar 28, 2025
c82487d
Rename inferencepool.yaml to inferencepool-resources.yaml
nicolexin Mar 28, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
27 changes: 0 additions & 27 deletions config/manifests/gateway/enable_patch_policy.yaml

This file was deleted.

50 changes: 0 additions & 50 deletions config/manifests/gateway/gateway.yaml

This file was deleted.

10 changes: 10 additions & 0 deletions config/manifests/gateway/gke/gateway.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
kind: Gateway
apiVersion: gateway.networking.k8s.io/v1
metadata:
name: inference-gateway
spec:
gatewayClassName: gke-l7-regional-external-managed
listeners:
- name: http
port: 80
protocol: HTTP
16 changes: 16 additions & 0 deletions config/manifests/gateway/gke/healthcheck.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
kind: HealthCheckPolicy
apiVersion: networking.gke.io/v1
metadata:
name: health-check-policy
namespace: default
spec:
targetRef:
group: "inference.networking.x-k8s.io"
kind: InferencePool
name: vllm-llama2-7b
default:
config:
type: HTTP
httpHealthCheck:
requestPath: /health
port: 8000
18 changes: 18 additions & 0 deletions config/manifests/gateway/httproute.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
apiVersion: gateway.networking.k8s.io/v1
kind: HTTPRoute
metadata:
name: llm-route
spec:
parentRefs:
- group: gateway.networking.k8s.io
kind: Gateway
name: inference-gateway
rules:
- backendRefs:
- group: inference.networking.x-k8s.io
kind: InferencePool
name: vllm-llama2-7b
matches:
- path:
type: PathPrefix
value: /
10 changes: 10 additions & 0 deletions config/manifests/gateway/istio/destination-rule.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
apiVersion: networking.istio.io/v1
kind: DestinationRule
metadata:
name: epp-insecure-tls
spec:
host: vllm-llama2-7b-epp.default.svc.cluster.local
trafficPolicy:
tls:
mode: SIMPLE
insecureSkipVerify: true
10 changes: 10 additions & 0 deletions config/manifests/gateway/istio/gateway.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
apiVersion: gateway.networking.k8s.io/v1
kind: Gateway
metadata:
name: inference-gateway
spec:
gatewayClassName: istio
listeners:
- name: http
port: 80
protocol: HTTP
10 changes: 10 additions & 0 deletions config/manifests/gateway/kgateway/gateway.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
apiVersion: gateway.networking.k8s.io/v1
kind: Gateway
metadata:
name: inference-gateway
spec:
gatewayClassName: kgateway
listeners:
- name: http
port: 80
protocol: HTTP
123 changes: 0 additions & 123 deletions config/manifests/gateway/patch_policy.yaml

This file was deleted.

16 changes: 0 additions & 16 deletions config/manifests/gateway/traffic_policy.yaml

This file was deleted.

1 change: 1 addition & 0 deletions config/manifests/inferencepool.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,7 @@ spec:
- protocol: TCP
port: 9002
targetPort: 9002
appProtocol: http2
type: ClusterIP
---
apiVersion: apps/v1
Expand Down
Loading