-
Notifications
You must be signed in to change notification settings - Fork 430
OCPBUGS-45496: Prevent undesired MOSBs from building #4739
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OCPBUGS-45496: Prevent undesired MOSBs from building #4739
Conversation
@RishabhSaini: This pull request references Jira Issue OCPBUGS-45496, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
7c9cf9b
to
9303fc7
Compare
/jira refresh |
@RishabhSaini: This pull request references Jira Issue OCPBUGS-45496, which is valid. The bug has been moved to the POST state. 3 validation(s) were run on this bug
Requesting review from QA contact: In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
@RishabhSaini: This pull request references Jira Issue OCPBUGS-45496, which is valid. 3 validation(s) were run on this bug
Requesting review from QA contact: In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
/test unit |
/retest-required |
LGTM |
9303fc7
to
a67ba70
Compare
osbuildcontroller_test: Unit testing for cascading failure In OCPBUGS-45496, here are the steps which lead to the failure of the creation of an MOSB upon the addition of a new MC: 1) An MC with erroneous contents not caught by the API validations creates a rendered-MC which triggers a MOSB build and fails as expected with an error 2) This erroneous MOSB keeps getting added to the rate limited worker queue till it hits the max retries. Then it is forgotten from the queue and is subjected to a backoff time to get added back again 3) In the meantime if the erroneous MC is deleted and a new valid MC is added targetting the same MCP, a valid MOSB build starts 4) When the erroneous mosb enters the queue again and sees that there already exists another not successfull MOSB in build, it cancels all other builds. Hence the valid MOSB is cancelled and the erroneous MOSB is re-triggerred again. Since it will never be able to start the build and fail again the steps 1 and 2 keep happening. Any new MC will fail to successfully create a MOSB and trigger a build. Thus the solution is to check upon the MOSB sync whether the MCP and rendered-MC that the MOSB targets even exists anymore. If not we don't need to build it.
a67ba70
to
aba18f3
Compare
/test unit |
/test e2e-gcp-op |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: RishabhSaini, umohnani8 The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@RishabhSaini: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
/test e2e-gcp-op |
52b26e7
into
openshift:master
@RishabhSaini: Jira Issue OCPBUGS-45496: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-45496 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
[ART PR BUILD NOTIFIER] Distgit: ose-machine-config-operator |
OCPBUGS-45496: Prevent undesired MOSBs from building
OCPBUGS-45496: Prevent undesired MOSBs from building
OCPBUGS-45496: Prevent undesired MOSBs from building
- What I did
reconciler: only build mosb's currently targeted by the MCP
In OCPBUGS-45496, here are the steps which lead to the failure of the creation of an MOSB upon the addition of a new MC:
and fails as expected with an error
queue till it hits the max retries. Then it is forgotten from the
queue and is subjected to a backoff time to get added back again
added targetting the same MCP, a valid MOSB build starts
already exists another not successfull MOSB in build, it cancels all
other builds.
Hence the valid MOSB is cancelled and the erroneous MOSB is re-triggerred again. Since it will never be able to start the build and fail again the steps 2 and 4 keep happening.
Any new MC will fail to successfully create a MOSB and trigger a build.
Thus the solution is to check upon the MOSB sync whether the MCP and rendered-MC that the MOSB targets even exists anymore. If not we don't need to build it.
- How to verify it
- Description for the changelog