Introduce a mechanism for gathering test artifacts during individual test failures #2442

timflannagan · 2021-11-10T17:39:29Z

Update the testing e2e suite and add a mechanism for gathering test
artifacts during individual test failures. Currently, container logs are
gathered when deprovisioning upstream kind clusters, yet we lack
fine-grain ability to diagnose test failures further.

Note: test failures use the CurrentGinkgoTestDescription.Failed field
to determine failures. Testing artifacts are only gathered when the base
$ARTIFACTS_DIR environment variable has been specified.

Add a collect-ci-artifacts.sh bash script, responsible for gathering OLM
native resources for an individual testing namespace. This bash script
will be called when tearing down the generated testing namespace for
relevant e2e packages. Currently, the artifact gathering process is
restricted to only a single namespace - longer term, it might be
possible to instead migrate towards collecting resources that all share
a similar label selector, and utilizing that label selector to handle
multi-namespace testing scenarios.

Introduce another helper function in test/e2e/util_test.go that's
responsible for gathering test artifacts (i.e. calling this newly
introduced script) when the test case had failed, and in either case,
delete the namespace.

Signed-off-by: timflannagan [email protected]

Reviewer Checklist

Implementation matches the proposed design, or proposal is updated to match implementation
Sufficient unit test coverage
Sufficient end-to-end test coverage
Docs updated or added to /doc
Commit messages sensible and descriptive

timflannagan · 2021-11-10T17:39:54Z

test/e2e/ctx/ctx.go

+	}
+
+	// compiled test binary running e2e tests is run from the root ./bin directory
+	cmd := exec.Command("../test/e2e/collect-ci-artifacts.sh")


Note: likely need to stat this file vs. relying on a local file reference.

timflannagan · 2021-11-10T17:40:52Z

test/e2e/util_test.go

+	if currentTest.Failed {
+		log("collecting the %s namespace artifacts as the '%s' test case failed", ns.GetName(), currentTest.TestText)
+		if err := ctx.Ctx().DumpNamespaceArtifacts(ns.GetName()); err != nil {
+			log("failed to collect namespace artifacts: %v", err)


We avoid performing operations like Expect(err).ToBe(Nil()) here as we want the namespace to be deleted in any case, regardless of whether this method fails to gather artifacts.

timflannagan · 2021-11-10T17:42:41Z

test/e2e/collect-ci-artifacts.sh

+echo "Storing the test artifact output in the ${TEST_ARTIFACTS_DIR} directory"
+for command in "${commands[@]}"; do
+    echo "Collecting ${command} output..."
+    COMMAND_OUTPUT_FILE=${TEST_ARTIFACTS_DIR}/${command// /_}


This output file likely needs some work - here's an example of an e2e run locally:

$ make e2e-local ARTIFACTS_DIR=/tmp/artifacts ... $ tree /tmp/artifacts/catalog-e2e-2jq84/ /tmp/artifacts/catalog-e2e-2jq84/ ├── get_clusterserviceversions_-o_yaml ├── get_events_--sort-by_.lastTimestamp ├── get_installplans_-o_yaml ├── get_operatorgroups_-o_yaml ├── get_pods_-o_wide └── get_subscriptions_-o_yaml

timflannagan · 2021-11-10T17:44:09Z

test/e2e/ctx/ctx.go

+	}
+	ctx.Logf("collecting logs in the %s artifacts directory", ctx.artifactsDir)
+
+	logDir := filepath.Join(ctx.artifactsDir, namespace)


I'm open to suggestions on how to best dump individual test failure artifacts. In the current implementation, this creates a directory named after whatever namespace was generated for that test. It wouldn't be immediately clear how to map this to an individual test failure, outside of looking at the overall CI logs that are produced during a run (as the namespace generated is logged).

timflannagan · 2021-11-10T17:52:44Z

test/e2e/collect-ci-artifacts.sh

+for command in "${commands[@]}"; do
+    echo "Collecting ${command} output..."
+    COMMAND_OUTPUT_FILE=${TEST_ARTIFACTS_DIR}/${command// /_}
+    kubectl -n ${TEST_NAMESPACE} ${command} >> "${COMMAND_OUTPUT_FILE}"


Note: this script would also neglect deleting any "empty" files here - any file that contains an empty List will still be created and housed in this directory:

apiVersion: v1 items: [] kind: List metadata: resourceVersion: "" selfLink: ""

…st failures Update the testing e2e suite and add a mechanism for gathering test artifacts during individual test failures. Currently, container logs are gathered when deprovisioning upstream kind clusters, yet we lack fine-grain ability to diagnose test failures further. Note: test failures use the `CurrentGinkgoTestDescription.Failed` field to determine failures. Testing artifacts are only gathered when the base $ARTIFACTS_DIR environment variable has been specified. Add a collect-ci-artifacts.sh bash script, responsible for gathering OLM native resources for an individual testing namespace. This bash script will be called when tearing down the generated testing namespace for relevant e2e packages. Currently, the artifact gathering process is restricted to only a single namespace - longer term, it might be possible to instead migrate towards collecting resources that all share a similar label selector, and utilizing that label selector to handle multi-namespace testing scenarios. Introduce another helper function in test/e2e/util_test.go that's responsible for gathering test artifacts (i.e. calling this newly introduced script) when the test case had failed, and in either case, delete the namespace. Signed-off-by: timflannagan <[email protected]>

…mespace Signed-off-by: timflannagan <[email protected]>

kevinrizza · 2021-11-10T22:48:59Z

/approve

anik120

/lgtm

openshift-ci · 2021-11-11T14:24:21Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: anik120, kevinrizza, timflannagan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [kevinrizza]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci bot requested review from ankitathomas and joelanford November 10, 2021 17:39

timflannagan commented Nov 10, 2021

View reviewed changes

timflannagan added 2 commits November 10, 2021 16:04

test/e2e: Refactor the bundle e2e tests to avoid using global test na…

95ba037

…mespace Signed-off-by: timflannagan <[email protected]>

timflannagan force-pushed the test/collect-failed-test-artifacts branch from 29d4377 to 95ba037 Compare November 10, 2021 21:04

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 10, 2021

timflannagan changed the title ~~Introduce mechanism for gathering test artifacts during individual test failures~~ Introduce a mechanism for gathering test artifacts during individual test failures Nov 10, 2021

anik120 approved these changes Nov 11, 2021

View reviewed changes

openshift-ci bot assigned anik120 Nov 11, 2021

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Nov 11, 2021

openshift-merge-robot merged commit 4cc28bb into operator-framework:master Nov 11, 2021

timflannagan deleted the test/collect-failed-test-artifacts branch November 11, 2021 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce a mechanism for gathering test artifacts during individual test failures #2442

Introduce a mechanism for gathering test artifacts during individual test failures #2442

timflannagan commented Nov 10, 2021

timflannagan Nov 10, 2021

timflannagan Nov 10, 2021

timflannagan Nov 10, 2021

timflannagan Nov 10, 2021

timflannagan Nov 10, 2021

kevinrizza commented Nov 10, 2021

anik120 left a comment

openshift-ci bot commented Nov 11, 2021

Introduce a mechanism for gathering test artifacts during individual test failures #2442

Introduce a mechanism for gathering test artifacts during individual test failures #2442

Conversation

timflannagan commented Nov 10, 2021

timflannagan Nov 10, 2021

Choose a reason for hiding this comment

timflannagan Nov 10, 2021

Choose a reason for hiding this comment

timflannagan Nov 10, 2021

Choose a reason for hiding this comment

timflannagan Nov 10, 2021

Choose a reason for hiding this comment

timflannagan Nov 10, 2021

Choose a reason for hiding this comment

kevinrizza commented Nov 10, 2021

anik120 left a comment

Choose a reason for hiding this comment

openshift-ci bot commented Nov 11, 2021