Improve diagnostics when types that cannot handle unknown or null are used #495

bendbennett · 2022-09-22T12:01:12Z

Closes: #191

… used (#191)

bflad · 2022-09-22T13:53:01Z

internal/fwschemadata/data_get_at_path_test.go

@@ -290,7 +291,7 @@ func TestDataGetAtPath(t *testing.T) {
 					path.Root("bool"),
 					"Value Conversion Error",
 					"An unexpected error was encountered trying to build a value. This is always an error in the provider. Please report the following to the provider developer:\n\n"+
-						"unhandled unknown value",
+						"Received unknown value for , however the current struct field type *bool cannot handle unknown values. Use types.Bool, or a custom type that supports unknown values instead.",


Seeing the missing path information here, I think we should update reflect.Into() to accept a path.Path parameter. This would cover GetAttribute(), etc. usage and allow us to remove some diag.WithPath() usage, which was a workaround. There's going to be a lot of other reflection code that won't have the path readily available still, so I think setting those calls to path.Empty() for now is okay, and we can see about potentially improving those later.

I've updated reflect.Into() to accept path.Path. I'm a little unsure which diag.WithPath() you're referring to removing so I have left these in place for now.

internal/reflect/into.go

bflad · 2022-09-22T15:23:04Z

internal/reflect/into.go

@@ -109,11 +111,19 @@ func BuildValue(ctx context.Context, typ attr.Type, val tftypes.Value, target re
 		// all that's left to us now is to set it as an empty value or
 		// throw an error, depending on what's in opts
 		if !opts.UnhandledUnknownAsEmpty {
-			err := fmt.Errorf("unhandled unknown value")
+			typTypeStr := reflect.TypeOf(typ).String()


Is this necessary for anonymous structs types? Given an attr.Type, I think we can just call typ.String() and anticipate folks implementing their types correctly, rather than potentially returning some confusing type details.

I've removed this in both unknown and null cases and have switched to using typ.String() as you suggested.

bflad

Almost there!

bflad · 2022-09-23T13:51:45Z

internal/fwschemadata/data_get_at_path.go

 	// reflect.Into does not have path information for its Diagnostics.
 	for idx, valueAsDiag := range reflectDiags {


Here's the diag.WithPath() handling that can now be removed since the above now correctly passed the path information. 👍

bflad · 2022-09-23T14:05:03Z

internal/fwschemadata/data_get_at_path_test.go

@@ -290,7 +291,8 @@ func TestDataGetAtPath(t *testing.T) {
 					path.Root("bool"),
 					"Value Conversion Error",
 					"An unexpected error was encountered trying to build a value. This is always an error in the provider. Please report the following to the provider developer:\n\n"+
-						"unhandled unknown value",
+						"Received unknown value, however the target type cannot handle unknown values. Use the corresponding `types` package type or a custom type that handles unknown values.\n\n"+
+						"Path: bool\nTarget Type: *bool\nSuggested Type: types.BoolType",


Ah, okay. The suggested type here should be types.Bool for now, but stripping the Type suffix would only temporarily work for the framework types types and isn't a long term solution. The attr.Type interface doesn't give us a way to really know the correct/suggested attr.Value type and maybe it should. Technically, we could use the attr.Value returned from the ValueFromTerraform method if called with a null value of the TerraformType, but custom types don't have to support null values if they're always required 🙁 . I'll create an issue to consider adding something to that interface so we can recommend things correctly.

For now though, I'm not sure if there's okay enough messaging we can give here to say "use the value type associated with the type". We might be able to merge this in as-is right now and try to fix that up before the next release.

We could alter the wording in the diagnostic until attr: Consider Adding ValueType Method to Type Interface is addressed.

Perhaps something like:

Path: path.String() Target Type: target.Type() Suggested `types` Type: typ.String() (use the value type associated with the type e.g., use types.Object for types.ObjectType) Suggested Pointer Type: *target.Type()

bflad · 2022-09-23T14:08:04Z

internal/reflect/into.go

@@ -22,7 +23,7 @@ import (
 // in the tftypes.Value must have a corresponding property in the struct. Into
 // will be called for each struct field. Slices will have Into called for each
 // element.
-func Into(ctx context.Context, typ attr.Type, val tftypes.Value, target interface{}, opts Options) diag.Diagnostics {
+func Into(ctx context.Context, typ attr.Type, val tftypes.Value, target interface{}, opts Options, path path.Path) diag.Diagnostics {


Now that the path.Path information is available in this function, we should replace the diags.AddError() call below with diags.AddAttributeError() and consider adding Path: path.String()\nError: err.Error() to the details.

…iags from call to reflect.Into (#191)

bflad

Looks good to me 🚀 (let's follow up with #496)

apparentlymart · 2022-09-23T16:29:13Z

internal/reflect/into.go

@@ -109,11 +111,12 @@ func BuildValue(ctx context.Context, typ attr.Type, val tftypes.Value, target re
 		// all that's left to us now is to set it as an empty value or
 		// throw an error, depending on what's in opts
 		if !opts.UnhandledUnknownAsEmpty {


I'm a little concerned that this particular design may behave as a confusing trap for provider developers, for the reason I shared over in #191 (comment) .

Specifically: whether a value is known or not during planning is under the control of the module author, not the provider developer. Therefore I think it's technically invalid to decode any plan-time value into a type that cannot support unknowns, regardless of whether the value happens to actually be unknown in a particular request.

Do you think it would be reasonable to add (in another PR of course, since I see this one is already closed!) an additional flag to opts like RequireHandlingUnknown bool, and then if that's set then we'd always return an error if the target type can't handle unknowns, regardless of whether the given val happens to be unknown or not?

I would expect RequireUnknownHandling to be true for any call that happens while the framework is handling a PlanResourceChange request, one of the several "validate" RPCs, and ConfigureProvider, because those are all situations where the module author can compel the provider to deal with unknown values. This restriction would not be required for ApplyResourceChange, ReadResource, or ReadDataSource, because Terraform Core guarantees that everything will be known by the time those are called. Therefore this additional restriction would not affect the most common case where a provider is only implementing apply-time logic and letting the framework's own logic handle the validation and planning steps.

(A more accurate way to say it is: "config" and "proposed new value" can always potentially contain unknowns, but prior state should never.)

My motivation here is to let provider developers know during their initial development that they need to handle this case, rather than them "getting away with it" because their test cases happen to all have hard-coded known values and then only later getting bug reports from their users when they pass in an unknown value and get this error. I assume provider developers would rather know about this requirement early, rather than needing to rework their logic later only once an end-user discovers it.

I'd personally be worried about introducing an option, which if treated as opt-in, that developers won't opt into or know to opt into. Most of our current data handling documentation glosses over some of the available options and for most developers they won't encounter them until they are using the framework-defined types collection types, which usually support them in methods like As(), ElementsAs(), etc.

Admittedly, this is probably the most confusing aspect of provider development with the framework. Very regularly, we see developers struggle with these concepts in Discuss, etc. The associated website documentation is quite confusing where it tries to describe all the various rules. Our best advice is typically to use the framework-defined types types, but its so easy to reach for standard Go types, slices of provider-defined types in the case of lists/sets, and structs (generally without pointers) of provider-defined types in the case of objects. As you can imagine, these all run into the trouble you mention.

I wonder if it may be better if we took a more wholesale step back and re-evaluated the type system to ensure correctness, while still supporting some of the "nice-ities" of the current type system. We have been circling the idea of requiring function calls for interacting with values, rather than accessing/setting struct fields to cover #447. Combining that with an always-enabled rule like you mention in the internal reflection code, might do the trick?

I only mention combined these right now because we're running very low on time to make substantial changes to the framework before a first major version is released and its easier to consider the whole subsystem at once.

A quick sketch of these combined ideas might be:

Requiring all framework-defined types or custom provider-defined types that interact with schema-based data must always support null and unknown values.

Requiring the creation of framework-defined value types through function calls so there is no method for creating invalid values.

Requiring the access of framework-defined value types through method calls which can default to zero-values if null/unknown (NOTE: the attr.Value interface already requires IsNull() bool and IsUnknown() bool methods, so there's no need to access any sort of weird pointer or sentinel value representations)

// Creating framework-defined value types func NullBool() Bool func NullFloat64() Float64 func NullInt64() Int64 func NullList() List func NullMap() Map func NullNumber() Number func NullObject() Object func NullSet() Set func NullString() String func UnknownBool() Bool func UnknownFloat64() Float64 func UnknownInt64() Int64 func UnknownList() List func UnknownMap() Map func UnknownNumber() Number func UnknownObject() Object func UnknownSet() Set func UnknownString() String func BoolValue(bool) Bool func Float64Value(float64) Float64 func Int64Value(int64) Int64 func ListValue([]attr.Value) List func MapValue(map[string]attr.Value) Map func NumberValue(*big.Float) Number func ObjectValue(struct) Object func SetValue([]attr.Value) Set func StringValue(string) String // Access framework-defined value type values func (b Bool) BoolValue() b func (f Float64) Float64Value() float64 // could consider Float32Value(), etc. func (i Int64) Int64Value() int64 // could consider IntValue(), etc. func (n Number) BigFloatValue() *big.Float // could consider Float64Value() with errors, etc. func (s String) StringValue() string // refer to existing collection type methods, such as As() // provider-defined types tfsdk.Schema{ Attributes: map[string]tfsdk.Attribute{ "example: { // One of Computed, Optional, Required: true, doesn't matter Type: types.StringType, }, }, } type A struct { Example string `tfsdk:"example"` } type B struct { Example *string `tfsdk:"example"` } type B struct { Example types.String `tfsdk:"example"` } var ( a A b B c C d string e *string f types.String ) (tfsdk.Config).Get(ctx, &a) // Always errors (tfsdk.Config).Get(ctx, &b) // Always errors (tfsdk.Config).Get(ctx, &c) // No error for this condition (tfsdk.Config).GetAttribute(ctx, path.Root("example"), &d) // Always errors (tfsdk.Config).GetAttribute(ctx, path.Root("example"), &e) // Always errors (tfsdk.Config).GetAttribute(ctx, path.Root("example"), &f) // No error for this condition (tfsdk.Plan).Get(ctx, &a) // Always errors (tfsdk.Plan).Get(ctx, &b) // Always errors (tfsdk.Plan).Get(ctx, &c) // No error for this condition (tfsdk.Plan).GetAttribute(ctx, path.Root("example"), &d) // Always errors (tfsdk.Plan).GetAttribute(ctx, path.Root("example"), &e) // Always errors (tfsdk.Plan).GetAttribute(ctx, path.Root("example"), &f) // No error for this condition (tfsdk.State).Get(ctx, &a) // Always errors (tfsdk.State).Get(ctx, &b) // Always errors (tfsdk.State).Get(ctx, &c) // No error for this condition (tfsdk.State).GetAttribute(ctx, path.Root("example"), &d) // Always errors (tfsdk.State).GetAttribute(ctx, path.Root("example"), &e) // Always errors (tfsdk.State).GetAttribute(ctx, path.Root("example"), &f) // No error for this condition

I think we can still be lenient when setting values as long as they follow the current reflection implementation.

Does this align with what you would expect?

To be a little clearer, I was understanding this reflect.Into function as being an internal implementation detail of the higher level API for fetching values from the config, plan, and state, and so I was therefore assuming that the "options" here are always generated by the framework's own code rather than from provider code. If that isn't true then I would agree that the details of what I proposed don't make sense!

I defer to you on the API design details since I'm still very new to the framework design! I think the scenarios you wrote out here seem to make sense as far as I can mentally evaluate them, although I think it may be possible to make the tfsdk.State API be more permissive and fail only if the specific given value happens to be wrong, because:

tfsdk.State should never encounter unknown values, I think?

By the time we're dealing in state, the provider is able to control whether null values are valid or not and so can in principle use Required in the schema or validation and/or plan-time rules to ensure that they'll never need to deal with null in a particular argument.

In the above I'm making the assumption that tfsdk.State is only used to describe final state values, such as what would be passed in as the prior state to ReadResource and PlanResourceChange, and what would be returned from ApplyResourceChange. It would be trickier if we ever use tfsdk.State to represent a "proposed new state" or a "planned new state", because of course those can potentially contain unknown values if the configuration contains unknown values.

On the other hand, I think it could potentially be reasonable to argue that it's better to be consistent and have the same rules everywhere, so that there's the most opportunity for code reuse. Particularly with Go 1.18 type parameters it seems possible to write a generic type that can represent a possibly-null, possibly-unknown value of any Go type so that there doesn't need to be a custom type for every combination, which may lower the burden of using these enough that it's reasonable to just make them always required. 🤔

type ValueState int const ( ValueConcrete ValueState = iota ValueNull ValueUnknown ) // (I don't know what would be a good name for this type, // so this is just a placeholder.) type SomethingValue[T any] { Value T State ValueState }

On the other hand, I think it could potentially be reasonable to argue that it's better to be consistent and have the same rules everywhere, so that there's the most opportunity for code reuse.

This is the direction that feels right to me personally -- having inconsistent ways to interact with "schema-based data" between the various RPCs, attribute configurability, and type of data (config, plan, state) makes for harder comprehension, documentation, etc. for provider developers. We can certainly document Terraform's general implementation details there if developers want to potentially shortcut some logic (like checking for unknown values when there cannot be any), but I think there are diminishing returns by trying too hard to support Go standard library types or provider-defined slices/structs directly in the reflection when they can be incompatible outside the control of the provider developer.

I will say, it's been very nice to do things like this even in the current framework design:

type exampleResourceModel struct { ExampleList types.List `tfsdk:"example_list"` ID types.String `tfsdk:"id"` } func (r exampleResource) Create(ctx context.Context, req resource.CreateRequest, resp *resource.CreateResponse) { var data exampleResourceModel // Read plan data into model resp.Diagnostics.Append(req.Plan.Get(ctx, &data)...) if resp.Diagnostics.HasError() { return } // Potentially interact with infrastructure // Set any Computed/Unknown values data.ID = types.String{Value: "example"} // Write state data from model resp.Diagnostics.Append(resp.State.Set(ctx, &data)...) }

It's easy to document on our side and a fairly succinct implementation, code-wise, in my opinion.

I created #498 to followup on this. 👍 Thank you for raising this here.

github-actions · 2022-10-24T02:47:17Z

I'm going to lock this pull request because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

Improve diagnostics when types that cannot handle unknown or null are…

e83bf91

… used (#191)

bflad reviewed Sep 22, 2022

View reviewed changes

bendbennett added 3 commits September 23, 2022 06:45

Adding path.Path parameter to reflect.Into func (#191)

f70aec7

Refactoring diagnostics message structure (#191)

bdda9ef

Merge branch 'main' into bendbennett/issues-191

c950f34

bendbennett marked this pull request as ready for review September 23, 2022 08:22

bendbennett requested a review from a team as a code owner September 23, 2022 08:22

bflad added this to the v0.14.0 milestone Sep 23, 2022

bflad added tech-debt Issues tracking technical debt that we're carrying. reflection Issues and PRs about the reflection subsystem used to convert between attr.Values and Go values. labels Sep 23, 2022

bflad reviewed Sep 23, 2022

View reviewed changes

bflad mentioned this pull request Sep 23, 2022

attr: Consider Adding ValueType Method to Type Interface #496

Closed

bendbennett added 2 commits September 23, 2022 15:38

Replacing diag error with atttribute error (#191)

af8ae72

Removing usage of diag with path as path information is returned in d…

c60d94a

…iags from call to reflect.Into (#191)

bflad approved these changes Sep 23, 2022

View reviewed changes

bendbennett merged commit 123c67f into main Sep 23, 2022

bendbennett deleted the bendbennett/issues-191 branch September 23, 2022 15:16

bflad mentioned this pull request Sep 23, 2022

attr: Add ValueType method to Type interface #497

Merged

apparentlymart reviewed Sep 23, 2022

View reviewed changes

This was referenced Sep 23, 2022

Consider Always Returning Errors with Get/GetAttribute for Target Types Without Unknown Support #498

Open

types: Consider changing value types zero-values to null or unknown #447

Closed

github-actions bot locked as resolved and limited conversation to collaborators Oct 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve diagnostics when types that cannot handle unknown or null are used #495

Improve diagnostics when types that cannot handle unknown or null are used #495

bendbennett commented Sep 22, 2022

bflad Sep 22, 2022

bendbennett Sep 23, 2022

bflad Sep 22, 2022

bendbennett Sep 23, 2022

bflad left a comment

bflad Sep 23, 2022

bflad Sep 23, 2022

bendbennett Sep 23, 2022

bflad Sep 23, 2022

bflad left a comment

apparentlymart Sep 23, 2022 •

edited

Loading

bflad Sep 23, 2022 •

edited

Loading

apparentlymart Sep 23, 2022 •

edited

Loading

bflad Sep 23, 2022

bflad Sep 23, 2022

github-actions bot commented Oct 24, 2022

		// reflect.Into does not have path information for its Diagnostics.
		for idx, valueAsDiag := range reflectDiags {

Improve diagnostics when types that cannot handle unknown or null are used #495

Improve diagnostics when types that cannot handle unknown or null are used #495

Conversation

bendbennett commented Sep 22, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bflad left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bflad left a comment

Choose a reason for hiding this comment

apparentlymart Sep 23, 2022 • edited Loading

Choose a reason for hiding this comment

bflad Sep 23, 2022 • edited Loading

Choose a reason for hiding this comment

apparentlymart Sep 23, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Oct 24, 2022

apparentlymart Sep 23, 2022 •

edited

Loading

bflad Sep 23, 2022 •

edited

Loading

apparentlymart Sep 23, 2022 •

edited

Loading