Skip to content

Commit 05ccdd6

Browse files
committed
Add the base model of the cpu vllm sample app to InferenceModel.yaml
1 parent 07df631 commit 05ccdd6

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

Diff for: config/manifests/inferencemodel.yaml

+11
Original file line numberDiff line numberDiff line change
@@ -21,3 +21,14 @@ spec:
2121
criticality: Critical
2222
poolRef:
2323
name: my-pool
24+
25+
---
26+
apiVersion: inference.networking.x-k8s.io/v1alpha2
27+
kind: InferenceModel
28+
metadata:
29+
name: inferencemodel-base-model-cpu
30+
spec:
31+
modelName: Qwen/Qwen2.5-1.5B-Instruct
32+
criticality: Critical
33+
poolRef:
34+
name: my-pool

0 commit comments

Comments
 (0)