Skip to content

Commit d371095

Browse files
committed
detailing how to handle unset criticality
1 parent 6fe810d commit d371095

File tree

2 files changed

+6
-2
lines changed

2 files changed

+6
-2
lines changed

api/v1alpha1/inferencemodel_types.go

+3-1
Original file line numberDiff line numberDiff line change
@@ -77,8 +77,10 @@ type InferenceModelSpec struct {
7777
// queuing or rejecting requests of lower criticality. InferenceModels of an equivalent Criticality will
7878
// fairly share resources over throughput of tokens. In the future, the metric used to calculate fairness,
7979
// and the proportionality of fairness will be configurable.
80+
//
81+
// Default values for this field will not be set, to allow for future additions of new field that may 'one of' with this field.
82+
// Any implementations that may consume this field may treat an unset value as the 'Standard' range.
8083
// +optional
81-
// +kubebuilder:default="Default"
8284
Criticality *Criticality `json:"criticality,omitempty"`
8385

8486
// TargetModels allow multiple versions of a model for traffic splitting.

config/crd/bases/inference.networking.x-k8s.io_inferencemodels.yaml

+3-1
Original file line numberDiff line numberDiff line change
@@ -55,13 +55,15 @@ spec:
5555
condition, one will be selected at random.
5656
properties:
5757
criticality:
58-
default: Default
5958
description: |-
6059
Criticality defines how important it is to serve the model compared to other models referencing the same pool.
6160
Criticality impacts how traffic is handled in resource constrained situations. It handles this by
6261
queuing or rejecting requests of lower criticality. InferenceModels of an equivalent Criticality will
6362
fairly share resources over throughput of tokens. In the future, the metric used to calculate fairness,
6463
and the proportionality of fairness will be configurable.
64+
65+
Default values for this field will not be set, to allow for future additions of new field that may 'one of' with this field.
66+
Any implementations that may consume this field may treat an unset value as the 'Standard' range.
6567
enum:
6668
- Critical
6769
- Standard

0 commit comments

Comments
 (0)