-
Notifications
You must be signed in to change notification settings - Fork 39
enhancement: External validations for Operator SDK #98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 12 commits
290ea24
279a347
cc99c9e
2817e7e
5e1fe8b
57054b2
ef2e918
18067b3
793fa25
aab11a8
a6e0751
ae285c3
d7f7225
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change | ||||||
---|---|---|---|---|---|---|---|---|
@@ -0,0 +1,376 @@ | ||||||||
--- | ||||||||
title: sdk-external-and-pluggable-validations | ||||||||
authors: | ||||||||
- "@jmrodri" | ||||||||
reviewers: | ||||||||
- "@camilamacedo86" | ||||||||
- "@joelanford" | ||||||||
- "@bparees" | ||||||||
approvers: | ||||||||
- "@camilamacedo86" | ||||||||
- "@joelanford" | ||||||||
creation-date: 2021-10-01 | ||||||||
last-updated: 2021-10-01 | ||||||||
status: implementable | ||||||||
see-also: | ||||||||
- "/enhancements/this-other-neat-thing.md" | ||||||||
--- | ||||||||
|
||||||||
# SDK External and Pluggable Validations | ||||||||
|
||||||||
## Release Signoff Checklist | ||||||||
|
||||||||
- [X] Enhancement is `implementable` | ||||||||
- [X] Design details are appropriately documented from clear requirements | ||||||||
- [ ] Test plan is defined | ||||||||
- [ ] Graduation criteria for dev preview, tech preview, GA | ||||||||
|
||||||||
## Open Questions [optional] | ||||||||
|
||||||||
1. can we use scorecard to replace all of these validations? | ||||||||
* scorecard uses the cluster to run the tests, these validations typically | ||||||||
run locally or in a pipeline. They are also done before the expensive | ||||||||
operator tests are run. | ||||||||
1. what would it take to convert an existing validator to an executable format? | ||||||||
Simply add a `main.go` to wrap existing validators. | ||||||||
1. do we need to add `json` tags to [these structs][errors-go]? | ||||||||
|
||||||||
## Summary | ||||||||
|
||||||||
Today, validations used by Container Verification Pipeline (CVP) are compiled into | ||||||||
the `operator-sdk`. Any changes to the validation rules requires a release of | ||||||||
operator-framework/api followed by a release of `operator-sdk`. This proposal | ||||||||
attempts to design a way where the validations can be hosted in their own repos | ||||||||
as well as updated without requiring new releases of the `operator-sdk`. | ||||||||
|
||||||||
The validator is defined in the operator-framework/api, for example here is the | ||||||||
[`OperatorHubValidator`][operator-hub-validator]. In the `operator-sdk`, it is | ||||||||
"registered" as an optional validator. | ||||||||
|
||||||||
```go | ||||||||
var optionalValidators = validators{ | ||||||||
{ | ||||||||
Validator: apivalidation.OperatorHubValidator, | ||||||||
name: "operatorhub", | ||||||||
labels: map[string]string{ | ||||||||
nameKey: "operatorhub", | ||||||||
suiteKey: "operatorframework", | ||||||||
}, | ||||||||
desc: "OperatorHub.io metadata validation. ", | ||||||||
}, | ||||||||
... | ||||||||
``` | ||||||||
|
||||||||
When the validators are specified on the CLI, we call `run()` on the | ||||||||
[validators][optional-validators]. In that `run()` method we loop through the | ||||||||
validaors specified by `vals`. On each validator we invoke `Validate` with the | ||||||||
list of objects to validate. | ||||||||
|
||||||||
```go | ||||||||
... | ||||||||
for _, v := range vals { | ||||||||
if sel.Matches(labels.Set(v.labels)) { | ||||||||
results = append(results, v.Validate(objs...)...) | ||||||||
} | ||||||||
} | ||||||||
... | ||||||||
``` | ||||||||
|
||||||||
## Motivation | ||||||||
|
||||||||
Every time the business changes validation rules, it requires an update to the | ||||||||
operator-framework/api library. A release of said library, then it needs to get | ||||||||
included into the operator-sdk. Then a release of the SDK needs to be cut in | ||||||||
order for the new validation rule change to be usable. | ||||||||
|
||||||||
This process slows down the business by having to wait for this release process | ||||||||
to occur for what could be a very small rule change. It also causes downstream | ||||||||
rules to require an immediate upstream operator-sdk release, which may | ||||||||
otherwise be as far as 3 weeks away. It is difficult to explain to the community | ||||||||
why we need a new release for a CVP need. | ||||||||
|
||||||||
jmrodri marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
Having the validators external to the SDK will allow for vendor specific | ||||||||
validators to be created allowing for greater flexibility. | ||||||||
|
||||||||
### Goals | ||||||||
|
||||||||
* Allow validations to be updated when the business needs them to be | ||||||||
* Allow validations to release when the authors need them to be | ||||||||
* Do not require newer builds of the operator-sdk to get updated rules | ||||||||
* Allow validations to be hosted in their own repos | ||||||||
* Allow validations to be external to operator-sdk | ||||||||
|
||||||||
### Non-Goals | ||||||||
|
||||||||
* Migrate existing validators to the new format | ||||||||
|
||||||||
## Proposal | ||||||||
|
||||||||
### User Stories [optional] | ||||||||
|
||||||||
#### Story 1 | ||||||||
* as a CVP admin I would like to change the validation for XXX and have it | ||||||||
useable as soon as I can do a validator release. | ||||||||
|
||||||||
#### Story 2 | ||||||||
* as a validation author, I can write a validation for a bundle and use it | ||||||||
without having to get a new operator-sdk release | ||||||||
|
||||||||
#### Story 3 | ||||||||
* Users want to edit/add/remove a set of validation rules for Operator bundle validation. | ||||||||
|
||||||||
#### Story 4 | ||||||||
* Users can host a set of validation rules either locally or in remote. | ||||||||
|
||||||||
#### Story 5 | ||||||||
* Users can define the target set of validation rules for SDK's bundle validation command (either from local or remote source). | ||||||||
|
||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. By generating the bundle via SDK we need to check it to ensure that the spec is correct. In this way, all tests shipped under operator-framework/api must still be provided via SDK by default. I think we have the following uses cases that we still need to comply with:
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @camilamacedo86 I still think there is value for the compiled in checks. Though we may have to think of a better way to update those. |
||||||||
### Implementation Details/Notes/Constraints [optional] | ||||||||
|
||||||||
There are a few alternatives, but I think wrapping the validations in their | ||||||||
own executables makes sense. It aligns with the work we've done with | ||||||||
[Phase 2][phase-2] plugins. It allows the most flexibility in terms of | ||||||||
implementation. It also has a simple API - | ||||||||
input: bundle dir; output: ManifestResult JSON. | ||||||||
|
||||||||
Wrapping validations in their own executable simply means that there | ||||||||
needs to be something that the operator-sdk can run like a shell script | ||||||||
or a binary. | ||||||||
|
||||||||
The validation executable will need to accept a single input value which | ||||||||
is the bundle directory. The output will need to be a ManifestResult JSON. This | ||||||||
JSON will be parsed by the operator-sdk converted and output as a Result. | ||||||||
|
||||||||
For example, the existing validations could easily be wrapped with a `main.go` | ||||||||
file and compiled into a binary. They could be copied to their own repos for | ||||||||
easier release. Now you might be thinking that this means we'll have many | ||||||||
releases still, we actually would only have one (1) release which is the | ||||||||
validator itself. For go based validators this might mean a new binary, but you | ||||||||
could also release your validator in python and make it a single file or even a | ||||||||
shell script. | ||||||||
|
||||||||
From the operator-sdk's point of view, we don't care what you use | ||||||||
to create your validator only that we can run it and pass it a bundle directory. | ||||||||
|
||||||||
#### Validators | ||||||||
|
||||||||
##### Running validators | ||||||||
|
||||||||
The [existing validators](#default-validators-run-by-operator-sdk) operate | ||||||||
on a bundle. Each validator should be some executable that accepts a | ||||||||
single argument, the bundle root directory. So `operator-sdk` will pass them | ||||||||
the bundle directory. It will be the validator's responsibility to parse | ||||||||
the bundle to get what it needs. | ||||||||
|
||||||||
For example, the validator executable will be invoked from the | ||||||||
`operator-sdk` in the following manner: | ||||||||
|
||||||||
``` | ||||||||
/path/to/validator/executable /path/to/bundle | ||||||||
``` | ||||||||
|
||||||||
A concrete example might look like this: | ||||||||
|
||||||||
``` | ||||||||
./validator-poc /home/user/dev/gatekeeper-operator/bundle | ||||||||
jmrodri marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
``` | ||||||||
|
||||||||
The actual implementation is up to the author. Here we have an example of | ||||||||
the [`OperatorHubValidator`][validator-poc1] as a Go binary. | ||||||||
|
||||||||
The validators can be written in any language but there must be an executable | ||||||||
entry point that accepts a single bundle path as a CLI argument. For example, | ||||||||
you can write your validator in python but you would want to make the main | ||||||||
python file executable or supply a shell script to be invoked. | ||||||||
|
||||||||
##### Validator results | ||||||||
|
||||||||
As stated earlier, each validator should be some executable that accepts a | ||||||||
single argument, the bundle root directory. Th validators should also output | ||||||||
ManifestResult JSON to stdout. | ||||||||
|
||||||||
Because the [existing validators](#default-validators-run-by-operator-sdk) | ||||||||
currently return a [ManifestResult][manifest-result], it seems logical that we | ||||||||
use the same object as JSON for the output. | ||||||||
|
||||||||
The validator executable should exit non-zero ONLY if the entrypoint failed to | ||||||||
run NOT if the bundle validation fails. | ||||||||
|
||||||||
For example, let's say the validator is given a path to the gatekeeper bundle. | ||||||||
The validator should validate the given bundle and output the ManifestResult JSON. | ||||||||
Here is an example of a run: | ||||||||
|
||||||||
```json | ||||||||
{ | ||||||||
"Name": "gatekeeper-operator.v0.2.0-rc.3", | ||||||||
"Errors": null, | ||||||||
"Warnings": [ | ||||||||
{ | ||||||||
"Type": "CSVFileNotValid", | ||||||||
"Level": "Warning", | ||||||||
"Field": "", | ||||||||
"BadValue": "", | ||||||||
"Detail": "(gatekeeper-operator.v0.2.0-rc.3) csv.Spec.minKubeVersion is not informed. It is recommended you provide this information. Otherwise, it would mean that your operator project can be distributed and installed in any cluster version available, which is not necessarily the case for all projects." | ||||||||
} | ||||||||
] | ||||||||
} | ||||||||
``` | ||||||||
Comment on lines
+204
to
+218
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Are we going to expose this as a go type that validator implementers will use? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @ryantking yes, we are going to expose it. There is a link to it in the EP: https://github.com/operator-framework/api/blob/master/pkg/validation/errors/error.go#L9-L16 That's already accessible from SDK. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Let me know if I need to call that out better. |
||||||||
|
||||||||
The above JSON will be read by the `operator-sdk` during `bundle validate` | ||||||||
command and output the results as it does today. The example below shows what | ||||||||
`operator-sdk bundle validate` would printout if given the ManifestResult from | ||||||||
above. | ||||||||
|
||||||||
```json | ||||||||
{ | ||||||||
"passed": true, | ||||||||
"outputs": [ | ||||||||
{ | ||||||||
"type": "warning", | ||||||||
"message": "Warning: Value : (gatekeeper-operator.v0.2.0-rc.3) csv.Spec.minKubeVersion is not informed. It is recommended you provide this information. Otherwise, it would mean that your operator project can be distributed and installed in any cluster version available, which is not necessarily the case for all projects." | ||||||||
} | ||||||||
] | ||||||||
} | ||||||||
``` | ||||||||
|
||||||||
Allowing the validators to output ManifestResult should make it easier to | ||||||||
transition existing validators to the external format with minimal code. | ||||||||
|
||||||||
##### Migrating existing validator to executable | ||||||||
|
||||||||
In the short to near term, you create a new `main.go` for each of the validators. | ||||||||
Then import the validator code from operator-framework/api. The `main.go` would | ||||||||
take in one (1) argument, the bundle directory. | ||||||||
|
||||||||
Since the [existing validators](#default-validators-run-by-operator-sdk) already | ||||||||
output `ManifestResult`, it's easiest if we simply print that out as JSON to | ||||||||
stdout. | ||||||||
jmrodri marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
|
||||||||
An example POC that takes an existing validator and outputs `ManifestResult` can | ||||||||
be found at [validator-poc][validator-poc1] | ||||||||
|
||||||||
Another example of a migration can be find at [ocp-olm-catalog-validator][camila-poc]. | ||||||||
This particular example does NOT yet output `ManifestResult` format. | ||||||||
jmrodri marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
|
||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think we could also move the CLI implementation from SDK to the https://github.com/operator-framework/apia and export it as a lib, so that:
So that, we can ensure a standard and a centralised code to be kept maintained. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @camilamacedo86 |
||||||||
|
||||||||
|
||||||||
#### CLI | ||||||||
|
||||||||
The big question at hand is how do we indicate to the `operator-sdk` CLI that | ||||||||
we want to run an external validator? Today, the `bundle validate` command takes | ||||||||
in a few flags, here we will discuss how each might need to be changed to work | ||||||||
with external validators. | ||||||||
|
||||||||
``` | ||||||||
Usage: | ||||||||
operator-sdk bundle validate [flags] | ||||||||
|
||||||||
... | ||||||||
|
||||||||
Flags: | ||||||||
--alpha-select-external string Selector to select external validators to run. It should be set to a Unix path list ("/path/to/e1.sh:/path/to/e2") | ||||||||
-h, --help help for validate | ||||||||
-b, --image-builder string Tool to pull and unpack bundle images. Only used when validating a bundle image. One of: [docker, podman, none] (default "docker") | ||||||||
--list-optional List all optional validators available. When set, no validators will be run | ||||||||
--optional-values --optional-values=k8s-version=1.22 Inform a []string map of key=values which can be used by the validator. e.g. to check the operator bundle against an Kubernetes version that it is intended to be distributed use --optional-values=k8s-version=1.22 (default []) | ||||||||
-o, --output string Result format for results. One of: [text, json-alpha1]. Note: output format types containing "alphaX" are subject to change and not covered by guarantees of stable APIs. (default "text") | ||||||||
--select-optional string Label selector to select optional validators to run. Run this command with '--list-optional' to list available optional validators | ||||||||
|
||||||||
Global Flags: | ||||||||
--plugins strings plugin keys to be used for this subcommand execution | ||||||||
--verbose Enable verbose logging | ||||||||
``` | ||||||||
|
||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Currently, we do not have many external validators. However, in the future, if it grows we might face complexities such as: To test your project against the criteria to publish in the index X:
So, shows that would be very nice if we could just copy and paste a config and run just 1 command that would check the bundle against the common criteria + all external validators. See that we download OPM CLI and use it (https://github.com/operator-framework/operator-sdk/blob/master/testdata/go/v3/memcached-operator/Makefile#L183-L198). Then, if we could make SDK download and use all external validators configured then, we maybe make it easier / possible SDK only uses this config with a bin provided by operator-framework/api instead of importing it to execute the checks. That could address the main motivations of this PE. (avoid SDK releases only to ship new versions of the of/api bundle validators implementation) Then, if a user requires a new version of the default checks (of/api) we could only update the config for its upper version as we do in the case of OPM in the makefile target. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This config is going down the route of "discoverability". |
||||||||
* *--help* works as is | ||||||||
* *--image-builder* works as is | ||||||||
* *--list-optional* would need to be updated to locate external validators. | ||||||||
* *--optional-values* would continue to be passed to the validators | ||||||||
* *--output* indicates how we want to output the results. | ||||||||
* *--select-optional* works as is | ||||||||
* *-- alpha-select-external* added; takes in the location of the executable to run as | ||||||||
the validator, i.e. `/path/to/validator/the-executable:/path/to/another` | ||||||||
|
||||||||
### Risks and Mitigations | ||||||||
|
||||||||
There is little risk, if this does not pan out we keep going on the current path | ||||||||
of compiling them in. | ||||||||
jmrodri marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
|
||||||||
Another possible risk is users of an airgapped environment will have to install | ||||||||
these external validator executables. We could supply a mechanism for | ||||||||
distributing these external validators as an image. For this first release, we | ||||||||
will ignore the distribution solutions leaving it to the user to copy the | ||||||||
validators onto the system running `operator-sdk`. | ||||||||
|
||||||||
## Design Details | ||||||||
|
||||||||
### Test Plan | ||||||||
|
||||||||
- unit tests for the feature will be added in the implementation | ||||||||
- QE would need to create custom validations to be called by the new `--alpha-select-external` flag. | ||||||||
|
||||||||
### Graduation Criteria | ||||||||
|
||||||||
This feature will come out as alpha accessible via the `--alpha-select-external` | ||||||||
flag to the `bundle validate` command. We will accept feedback on possible | ||||||||
changes to the feature. | ||||||||
|
||||||||
After a period of time, we can graduate to a GA with a more permanent flag, i.e. | ||||||||
`--select-external`. | ||||||||
|
||||||||
### Version Skew Strategy | ||||||||
|
||||||||
* The version of the validators can change however the validator author sees fit. | ||||||||
* The API or contract between `operator-sdk` and validators will be | ||||||||
* input to validator: bundle directory | ||||||||
* output from validator: `ManifestResult` JSON | ||||||||
|
||||||||
## Implementation History | ||||||||
|
||||||||
N/A | ||||||||
|
||||||||
## Drawbacks | ||||||||
|
||||||||
Validators would have to be their own executables which could result in a | ||||||||
compilation step being needed depending on the language used to implement them. | ||||||||
|
||||||||
## Alternatives | ||||||||
|
||||||||
* put validations in their own images | ||||||||
* need to define "API" contract what is the entrypoint and what parameters do | ||||||||
we give it | ||||||||
* pro: | ||||||||
* already have a precendence for running things from images | ||||||||
* familiar tech | ||||||||
* con: | ||||||||
* would need a cluster to run these validations | ||||||||
* authors would have to create binaries of their validations anyway | ||||||||
|
||||||||
* use a language like JavaScript or CUE to define all validations | ||||||||
* validations could be run from a git repo, i.e. operator-sdk could pull it | ||||||||
and then evaluate it | ||||||||
* pro: | ||||||||
* simpler delivery, expose via a gitrepo and done | ||||||||
* con: | ||||||||
* all existing validations would have to be re-written in a new language | ||||||||
structure which could introduce new bugs | ||||||||
* unproven technology | ||||||||
* would have to write the engine to know how to execute these | ||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. this is a repeat and I don't think it needs to be added as a con to the JavaScript or CUE alternative. These would be equally fast. |
||||||||
|
||||||||
* use scorecard to do the validations | ||||||||
* create validations written in scorecard as custom tests | ||||||||
* pro: | ||||||||
* infrastructure required to run is already built within scorecard | ||||||||
* con: | ||||||||
* would need a cluster to run these validations | ||||||||
jmrodri marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
|
||||||||
## Infrastructure Needed [optional] | ||||||||
jmrodri marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
|
||||||||
N/A | ||||||||
|
||||||||
[phase-2]: https://github.com/kubernetes-sigs/kubebuilder/blob/master/designs/extensible-cli-and-scaffolding-plugins-phase-2.md | ||||||||
[manifest-result]: https://github.com/operator-framework/api/blob/master/pkg/validation/errors/error.go#L9-L16 | ||||||||
[validator-poc1]: https://github.com/jmrodri/validator-poc/tree/poc1-manifestresults | ||||||||
[camila-poc]: https://github.com/camilamacedo86/ocp-olm-catalog-validator | ||||||||
[errors-go]: https://github.com/operator-framework/api/blob/master/pkg/validation/errors/error.go#L9-L16 | ||||||||
[operator-hub-validator]: https://github.com/operator-framework/api/blob/master/pkg/validation/internal/operatorhub.go | ||||||||
[optional-validator]: https://github.com/operator-framework/operator-sdk/blob/master/internal/cmd/operator-sdk/bundle/validate/optional.go#L130-L156 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Scorecard uses kuttl, which supports a mocked control plane. Not sure if you want to couple validation to the scorecard, but that is an option to avoid needing a live cluster if you do want scorecard to handle this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ryantking Most of the validations are pretty static. We could revisit this idea of enhancing scorecard for this purpose in the future.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ryantking,
The Scorecard tests require to be executed with a cluster and have a high cost to be executed. Its initial idea would also allow functional tests and the users to create their own custom tests.
See that for example, we check all bundles using SDK in the Audit tool (https://github.com/operator-framework/audit), by using the scorecard we need like 4 hours when doing exactly the same with the static validators it takes 1 hour.