Skip to content

Commit 4e02bde

Browse files
committed
Add the base model to InferenceModel sample manifest
1 parent 0bef35b commit 4e02bde

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

Diff for: config/manifests/inferencemodel.yaml

+11
Original file line numberDiff line numberDiff line change
@@ -10,3 +10,14 @@ spec:
1010
targetModels:
1111
- name: tweet-summary-1
1212
weight: 100
13+
14+
---
15+
apiVersion: inference.networking.x-k8s.io/v1alpha2
16+
kind: InferenceModel
17+
metadata:
18+
name: inferencemodel-base-model
19+
spec:
20+
modelName: meta-llama/Llama-2-7b-hf
21+
criticality: Critical
22+
poolRef:
23+
name: my-pool

0 commit comments

Comments
 (0)