Initial description of annotation collection #610

handrews · 2018-06-17T02:53:46Z

NOTE: This PR uses language added in PR #609, which was recently merged.

This covers the basics of annotation collection (#530), and its interaction
with assertions and applicators. It does not get into the exact
output structure for annotation results, which is intentional.
The goal is to establish the process, and then nail down the
format once the process is solid.

awwright · 2018-06-28T08:36:31Z

jsonschema-core.xml

+            <t>
+                Annotations are collected by keywords that explicitly define
+                annotation-collecting behavior.  Note that boolean schemas cannot
+                produce annotations as they do not make use of keywords.


Is this not obvious enough?

Eh, to you and me, yes, but I can see someone asking about the boolean schemas contributing to an annotation that indicates that a boolean schema was used. Or something. I think it's harmless to state explicitly, but let me know if you think it is confusing.

awwright · 2018-06-28T08:37:49Z

jsonschema-core.xml

+                A collected annotation MUST include the following information:
+                <list>
+                    <t>
+                        The name of the annotation, which MUST be identical to the keyword


How about simply "The name of the keyword that produced the annotation"

Do you mean

The name of the annotation, which MUST be the name of the keyword that produced it

because I would be fine with that.

Never mind, I see what you mean. Will update.

awwright · 2018-06-28T08:38:45Z

jsonschema-core.xml

+                </list>
+            </t>
+            <t>
+                If multiple schema locations attach values to the same instance location,


What's an "instance location"? Maybe just "instance"?

An instance is a document, an instance location may be the entire document, or may be some subset of the document which can be identified by a JSON Pointer. I use "instance location" a lot, and it is definitely not the same as "instance."

awwright · 2018-06-28T08:42:26Z

jsonschema-core.xml

+                For example, one application may consider a "description" annotation that is
+                in the same schema object as a "$ref" to override any "description" in the
+                reference's target schema.  A different application may prefer to concatenate
+                all "description" annotations based on whatever ordering it defines.


If we're going to specify sometimes annotations can override one another, there should be a deterministic algorithm defined. Consider how CSS ranks which property is highest priority.

Presenting all the generated values as equals is probably the most theoretically pure way, but I'm not sure how practical this is for schema authors. I don't think I'll have a solid opinion on this until we see implementations.

This approach comes from your repeated refusal to define how multiple values of default get used, among other things. Which initially bothered me, but I eventually came to agree with you. Your argument was that different applications may want to handle it differently, which made sense- at the time, I think I had proposed three different variations on default for different applications, which was ridiculous.

So that flexibility is what I am codifying here. Implementations should present enough information to applications so that they can choose their own usage. This is not CSS, and I do not see a one size fits all solution. Some keywords have obvious usage patterns, but those patterns are not the same for all keywords. And some keywords are not obvious. And some should arguably result in an application-level error as unresolvable.

When I have run this by other groups (such as OpenAPI) the flexibility has generally been seen as a positive. People have to get used to the idea, but once they do it seems to make sense to folks. I don't have a huge sample set here, but there's only so much begging and pleading I can do for feedback. Publishing a draft is the only thing that really provokes a lot of feedback.

Presenting all the generated values as equals is probably the most theoretically pure way, but I'm not sure how practical this is for schema authors.

They are not being presented as equals, they are being presented with their schema location, which allows applications to make their own distinctions.

I don't think I'll have a solid opinion on this until we see implementations.

The best way to get the broadest feedback from implementors is to avoid specifying a precedence. People will either make use of the flexibility, or we will learn that there is, actually, a clear typical usage, to the point that standardizing it will be beneficial. And then we can do that.

@awwright I'm going to add an example to make it more clear that we're not just punting on the question of precedence, but rather providing tools for applications to define their own in a clear way.

awwright · 2018-06-28T08:47:51Z

jsonschema-core.xml

+                    If any keyword in a schema object produces a false assertion
+                    result, then all annotations from all keywords in that schema
+                    object, and any of its subschemas or referenced schemas, MUST
+                    be discarded.


This could be shortened for clarity, I think.
"Annotations are only generated from instances that validate against the schema."

And care is needed so we don't confuse people implementing "not". The note about "not" is only (and should only be) non-normative.

This wording is very deliberate. It may need clarification, but simply saying "annotations are only generated from instances that validate against the schema" is insufficient. There is a process going on of walking both the schema and instance, and it matters when and where in the process the annotations are discarded. I don't think there is actually another process that makes sense, but we should be clear about it rather than making implementors each re-derive it.

I'm happy to handle the advice on "not" differently. We have not, to date, been all that rigorous about normative and non-normative, so I'd like to consider that question holistically rather than block this PR on it. I'm sure there are other things that have crept in that are non-normative. Probably a lot of Hyper-Schema since there are so many examples and "here's how to use it with HTTP" guidance in there. Feel free to file an issue on this, or we can just consider it during final review.

I can remove the "not" part entirely if you are concerned about leaving it in. I would rather do that than hold up the rest of this PR, and then come back to the question of "not".

@awwright I'm going to remove most of the "Annotations and Applicators" section to get rid of the details around not and similar cases, and try to clarify why I'm using the "must be discarded" language in the "Annotations and Assertions" section.

This covers the basics of annotation collection, and its interaction with assertions and applicators. It does not get into the exact output structure for annotation results, which is intentional. The goal is to establish the process, and then nail down the format once the process is solid.

handrews · 2018-06-30T20:57:58Z

@awwright I've made several updates and tried to incorporate most of your feedback. I've removed a number of potentially confusing details- hopefully people will understand how they follow from the simpler descriptions.

I also added an example of how an application might make use of multiple values. Which hopefully will make the point and usefulness more clear. If you still really want the spec to define behavior for every keyword, let's split that out into a new issue (in which case I will remove the controversial language from this PR so that we can merge the uncontroversial parts).

The two things I did not change are:

the line about boolean schemas. I agree that it should be obvious, but since boolean schemas produce assertion results despite not having keywords, I feel like it needs to be explicit that they do not produce annotation results. Even though I can't think of what they would possibly produce.
"instance location", which is terminology that, along with "schema location", I've been using extensively across multiple PRs. It appears in all three documents at this point. If you think there is a serious problem with it, please file a new issue and we will work it out. But it is not specific to this PR.

awwright · 2018-07-04T09:41:18Z

jsonschema-core.xml

+                A collected annotation MUST include the following information:
+                <list>
+                    <t>
+                        The name of the keyword that defines the annotation


Would "produces" be more accurate than "defines"?

Yeah, I'll change that and then commit, thanks!

* Simplify wording around annotation keywords and names * Remove potentially confusing details about the collection process, particularly regarding applicators such as "not" * Add an example to clarify why the handling of multiple annotation values is deferred to applications, and show how applications might make use of this flexibility.

vearutop · 2018-07-10T22:17:10Z

If validation error response is also powered by annotations, then boolean schema that produces the error should be in annotation, shouldn't it?

handrews added Type: Enhancement core annotation labels Jun 17, 2018

handrews added this to the draft-08 milestone Jun 17, 2018

handrews requested review from philsturgeon, gregsdennis and a team June 17, 2018 02:53

gregsdennis approved these changes Jun 17, 2018

View reviewed changes

dlax approved these changes Jun 18, 2018

View reviewed changes

philsturgeon approved these changes Jun 18, 2018

View reviewed changes

handrews mentioned this pull request Jun 20, 2018

Rename post-split "dependencies" to "schemaDependencies" #591

Closed

handrews changed the base branch from annot-results to master June 21, 2018 04:43

awwright reviewed Jun 28, 2018

View reviewed changes

handrews force-pushed the annot-collect branch from d8ea3c7 to 97d871b Compare June 30, 2018 20:50

handrews mentioned this pull request Jul 1, 2018

Standard schema for results, including errors #396

Closed

awwright approved these changes Jul 6, 2018

View reviewed changes

handrews force-pushed the annot-collect branch from 97d871b to 9aaa8b5 Compare July 7, 2018 21:17

handrews merged commit d605d04 into json-schema-org:master Jul 8, 2018

handrews deleted the annot-collect branch December 17, 2018 02:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial description of annotation collection #610

Initial description of annotation collection #610

handrews commented Jun 17, 2018 •

edited

Loading

awwright Jun 28, 2018

handrews Jun 29, 2018

awwright Jun 28, 2018

handrews Jun 29, 2018

handrews Jun 30, 2018

awwright Jun 28, 2018

handrews Jun 29, 2018

awwright Jun 28, 2018

handrews Jun 29, 2018

handrews Jun 30, 2018

awwright Jun 28, 2018

handrews Jun 29, 2018

handrews Jun 30, 2018

handrews commented Jun 30, 2018

awwright Jul 4, 2018

handrews Jul 7, 2018

vearutop commented Jul 10, 2018

Initial description of annotation collection #610

Initial description of annotation collection #610

Conversation

handrews commented Jun 17, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

handrews commented Jun 30, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vearutop commented Jul 10, 2018

handrews commented Jun 17, 2018 •

edited

Loading