fix: supplying hyperparameters to training step constructor drops hyperparameters specified in estimator #144

shivlaks · 2021-06-14T21:45:13Z

Summary

Hyperparameters can be specified in the estimator object and hyperparameters property.
Both of which are taken in the constructor of the TrainingStep class.

The current behaviour drops any hyperparameters that were specified in the estimator if the property
is set in the TrainingStep constructor. This is undesirable as the estimators often specify algorithm specific hyperparameters out of the box that we don't want to drop.

This change merges the hyperparameters in the constructor as well as the estimator that is used in TrainingStep.
If there are duplicate keys, the hyperparameters specified in the constructor will be used.

Closes #99, #72

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…erparameters specified in estimator

wong-a · 2021-06-14T23:46:44Z

src/stepfunctions/steps/sagemaker.py

-            parameters['HyperParameters'] = hyperparameters
+            merged_hyperparameters = {}
+            if estimator.hyperparameters() is not None:
+                merged_hyperparameters.update(estimator.hyperparameters())


re:

should we warn/error instead of an in-place merge when we find duplicate keys?

Throwing an error wouldn't be helpful; it's more breaking (you can't create what you could before) and doesn't address the desired functionality in #99. Logging might at an INFO level may be helpful for duplicate keys.

We should also document if not already that parameters gets totally overridden by training_config. This isn't the case for all service integration steps. Maybe we should adopt an update strategy there too?

I was originally thinking that we should log it as a warning since INFO tends to generally also include a lot of junk but not strongly opinionated.

I'll re-spin the update calls to assemble the dicts so that they log something on duplicate keys.
absolutely agree that we need to document.

This isn't the case for all service integration steps. Maybe we should adopt an update strategy there too?

great call. wasn't on my radar, but I'm in favour of adopting the update strategy. The most consistent it is across things in the SDK, the more intuitive and idiomatic it will feel for users.

I'll make the changes to service integration steps in a separate PR. let me know if you had a different thought/idea of where we should be documenting this behaviour @wong-a

I meant the other way around actually. In all service integration steps besides sagemaker, the constructor accepts a parameters argument that becomes Parameters . We can update sagemaker step classes to accept a parameters dict which can be an escape hatch for full API coverage or override any explicitly exposed arguments.

For example, DynamoDBUpdateItem the caller must specify all parameters in the parameters field. The constructor doesn't have a table_name argument to set or other required fields:
https://github.com/aws/aws-step-functions-data-science-sdk-python/blob/main/src/stepfunctions/steps/service.py#L140-L167

Whereas the sagemaker steps always construct parameters using the sagemaker SDK and some special arguments in the constructor. You could provide parameters because the constructor accepts **kwargs, but it won't do anything.
https://github.com/aws/aws-step-functions-data-science-sdk-python/blob/main/src/stepfunctions/steps/sagemaker.py#L477-L479

I see what you mean. I'm in favour of adding that parameters property and will address it in another PR. Escape hatches are powerful because it'll give users a path forward without requiring first class support to be developed and released.

Yeah, that's out of scope of this PR. Can you create an issue for tracking?

tests/unit/test_sagemaker_steps.py

wong-a

LGTM besides making documentation clearer

src/stepfunctions/steps/sagemaker.py

wong-a

< ship it >
 ---------
        \   ^__^
         \  (oo)\_______
            (__)\       )\/\
                ||----w |
                ||     ||

StepFunctions-Bot · 2021-06-18T14:45:19Z

AWS CodeBuild CI Report

CodeBuild project: AutoBuildProject6AEA49D1-sEHrOdk7acJc
Commit ID: c431f7c
Result: SUCCEEDED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

bump for `v2.2.0` which includes the following changes: Features * Placeholders in TrainingStep to set S3 location for InputDataConfig and OutputDataConfig (#142) * EventBridge service integration (#147) Fixes * supplying hyperparameters to training step constructor drops hyperparameters specified in estimator (#144)

fix: supplying hyperparameters to training step constructor drops hyp…

d672d7d

…erparameters specified in estimator

shivlaks requested a review from wong-a June 14, 2021 21:45

wong-a reviewed Jun 14, 2021

View reviewed changes

feedback and update tests

efeaf21

shivlaks changed the title ~~[DRAFT] fix: supplying hyperparameters to training step constructor drops hyperparameters specified in estimator~~ fix: supplying hyperparameters to training step constructor drops hyperparameters specified in estimator Jun 17, 2021

shivlaks marked this pull request as ready for review June 17, 2021 16:43

wong-a suggested changes Jun 17, 2021

View reviewed changes

src/stepfunctions/steps/sagemaker.py Outdated Show resolved Hide resolved

clear up the documentation

d34128b

wong-a previously approved these changes Jun 17, 2021

View reviewed changes

ca-nguyen previously approved these changes Jun 18, 2021

View reviewed changes

dummy commit: remove extra space

c431f7c

shivlaks dismissed stale reviews from ca-nguyen and wong-a via c431f7c June 18, 2021 05:40

ca-nguyen approved these changes Jun 18, 2021

View reviewed changes

shivlaks merged commit 349fc11 into main Jun 18, 2021

wong-a mentioned this pull request Jun 18, 2021

feat: Adds support for Placeholders in TrainingStep to set S3 location for InputDataConfig and OutputDataConfig #142

Merged

This was referenced Jun 18, 2021

Tensorflow estimator script mode not working if we pass HyperParameters as part of TrainingStep #72

Closed

chore: bump version to v2.2.0 #149

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: supplying hyperparameters to training step constructor drops hyperparameters specified in estimator #144

fix: supplying hyperparameters to training step constructor drops hyperparameters specified in estimator #144

shivlaks commented Jun 14, 2021 •

edited

Loading

wong-a Jun 14, 2021

shivlaks Jun 15, 2021

shivlaks Jun 17, 2021

wong-a Jun 17, 2021

shivlaks Jun 17, 2021

wong-a Jun 17, 2021

wong-a left a comment

wong-a left a comment

StepFunctions-Bot commented Jun 18, 2021

fix: supplying hyperparameters to training step constructor drops hyperparameters specified in estimator #144

fix: supplying hyperparameters to training step constructor drops hyperparameters specified in estimator #144

Conversation

shivlaks commented Jun 14, 2021 • edited Loading

Summary

wong-a Jun 14, 2021

Choose a reason for hiding this comment

shivlaks Jun 15, 2021

Choose a reason for hiding this comment

shivlaks Jun 17, 2021

Choose a reason for hiding this comment

wong-a Jun 17, 2021

Choose a reason for hiding this comment

shivlaks Jun 17, 2021

Choose a reason for hiding this comment

wong-a Jun 17, 2021

Choose a reason for hiding this comment

wong-a left a comment

Choose a reason for hiding this comment

wong-a left a comment

Choose a reason for hiding this comment

StepFunctions-Bot commented Jun 18, 2021

AWS CodeBuild CI Report

shivlaks commented Jun 14, 2021 •

edited

Loading