Skip to content

Feature: Serverless Endpoint #193

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Jan 20, 2023
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 5 additions & 5 deletions apis/v1alpha1/ack-generate-metadata.yaml
Original file line number Diff line number Diff line change
@@ -1,13 +1,13 @@
ack_generate_info:
build_date: "2023-01-04T22:46:39Z"
build_hash: 12246c7da82841b351ec7a9e1f139f9338f2784b
build_date: "2023-01-19T03:37:00Z"
build_hash: 1b20baf45a0b73a11b296050322a384c705fa897
go_version: go1.17.13
version: v0.21.0
api_directory_checksum: 19b63f2ee6b7e14b2f6d254a6caff0b9867539e5
version: v0.22.0
api_directory_checksum: c38df824c3c6e41cd4eaa559dccc4c05c7e8d075
api_version: v1alpha1
aws_sdk_go_version: v1.44.117
generator_config_info:
file_checksum: fd845048bf7dc84ecd3ea1ed1784b831d8f42645
file_checksum: e3688a5487a10dccd7a5e488de025d9489043f7d
original_file_name: generator.yaml
last_modification:
reason: API generation
4 changes: 3 additions & 1 deletion apis/v1alpha1/generator.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -57,6 +57,9 @@ resources:
Tags:
compare:
is_ignored: true
hooks:
delta_pre_compare:
code: customSetDefaults(a, b)
Endpoint:
reconcile:
requeue_on_success_seconds: 30
Expand Down Expand Up @@ -1017,7 +1020,6 @@ ignore:
shape_names:
# RSessionAppSettings is an empty struct that causes generation errors
- RSessionAppSettings
- ProductionVariantServerlessConfig
- ExecutionRoleIdentityConfig
- HyperParameterTuningResourceConfig
- InstanceMetadataServiceConfiguration
Expand Down
52 changes: 34 additions & 18 deletions apis/v1alpha1/types.go

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

50 changes: 50 additions & 0 deletions apis/v1alpha1/zz_generated.deepcopy.go

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Original file line number Diff line number Diff line change
Expand Up @@ -167,6 +167,17 @@ spec:
type: integer
modelName:
type: string
serverlessConfig:
description: Specifies the serverless configuration for an endpoint
variant.
properties:
maxConcurrency:
format: int64
type: integer
memorySizeInMB:
format: int64
type: integer
type: object
variantName:
type: string
volumeSizeInGB:
Expand Down
44 changes: 44 additions & 0 deletions config/crd/bases/sagemaker.services.k8s.aws_endpoints.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -268,6 +268,17 @@ spec:
currentInstanceCount:
format: int64
type: integer
currentServerlessConfig:
description: Specifies the serverless configuration for
an endpoint variant.
properties:
maxConcurrency:
format: int64
type: integer
memorySizeInMB:
format: int64
type: integer
type: object
currentWeight:
type: number
deployedImages:
Expand Down Expand Up @@ -295,6 +306,17 @@ spec:
desiredInstanceCount:
format: int64
type: integer
desiredServerlessConfig:
description: Specifies the serverless configuration for
an endpoint variant.
properties:
maxConcurrency:
format: int64
type: integer
memorySizeInMB:
format: int64
type: integer
type: object
desiredWeight:
type: number
instanceType:
Expand Down Expand Up @@ -332,6 +354,17 @@ spec:
currentInstanceCount:
format: int64
type: integer
currentServerlessConfig:
description: Specifies the serverless configuration for an endpoint
variant.
properties:
maxConcurrency:
format: int64
type: integer
memorySizeInMB:
format: int64
type: integer
type: object
currentWeight:
type: number
deployedImages:
Expand Down Expand Up @@ -359,6 +392,17 @@ spec:
desiredInstanceCount:
format: int64
type: integer
desiredServerlessConfig:
description: Specifies the serverless configuration for an endpoint
variant.
properties:
maxConcurrency:
format: int64
type: integer
memorySizeInMB:
format: int64
type: integer
type: object
desiredWeight:
type: number
variantName:
Expand Down
4 changes: 3 additions & 1 deletion generator.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -57,6 +57,9 @@ resources:
Tags:
compare:
is_ignored: true
hooks:
delta_pre_compare:
code: customSetDefaults(a, b)
Endpoint:
reconcile:
requeue_on_success_seconds: 30
Expand Down Expand Up @@ -1017,7 +1020,6 @@ ignore:
shape_names:
# RSessionAppSettings is an empty struct that causes generation errors
- RSessionAppSettings
- ProductionVariantServerlessConfig
- ExecutionRoleIdentityConfig
- HyperParameterTuningResourceConfig
- InstanceMetadataServiceConfiguration
Expand Down
11 changes: 11 additions & 0 deletions helm/crds/sagemaker.services.k8s.aws_endpointconfigs.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -167,6 +167,17 @@ spec:
type: integer
modelName:
type: string
serverlessConfig:
description: Specifies the serverless configuration for an endpoint
variant.
properties:
maxConcurrency:
format: int64
type: integer
memorySizeInMB:
format: int64
type: integer
type: object
variantName:
type: string
volumeSizeInGB:
Expand Down
Loading