🌱 [WIP] make topology upgrade sequential #6652

ykakarap · 2022-06-15T01:31:53Z

What this PR does / why we need it:

This PR fixes the Kubernetes version upgrade propagation logic in managed topologies so that the upgrades are always done sequentially.

TODO:

Add unit tests
Check to see if any of the TopologyReconciled condition's messaging needs to be adjusted to reflect the new possible state

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #6651

k8s-ci-robot · 2022-06-15T01:32:22Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please assign fabriziopandini after the PR has been reviewed.
You can assign the PR to them by writing /assign @fabriziopandini in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot · 2022-06-15T01:37:08Z

@ykakarap: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-cluster-api-test-main	`9d9b1f2`	link	true	`/test pull-cluster-api-test-main`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

sbueringer · 2022-06-15T05:29:25Z

internal/controllers/topology/cluster/desired_state.go

@@ -611,6 +626,7 @@ func computeMachineDeploymentVersion(s *scope.Scope, desiredControlPlaneState *s
 	// Check if we are about to upgrade the control plane. In that case, do not upgrade the machine deployment yet.
 	// Wait for the new upgrade operation on the control plane to finish before picking up the new version for the
 	// machine deployment.
+	// TODO: We probably don't need this check anymore.


I think you might be right. But because it's very easy to miss an edge case I would prefer keeping this logic as a fail safe so that we never under any circumstances trigger a MD rollout while the CP is still upgrading.

I think with the current code it's also easier to assess that vs. if we have to infer that based on the code above for all edge cases that we can think of.

sbueringer · 2022-06-15T05:31:27Z

Logic looks good to me

k8s-triage-robot · 2022-09-13T06:09:31Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2022-10-13T06:31:33Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

ykakarap · 2022-11-02T00:15:35Z

/lifecycle fronzen

k8s-triage-robot · 2022-12-02T00:31:17Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Reopen this PR with /reopen
Mark this PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2022-12-02T00:31:21Z

@k8s-triage-robot: Closed this PR.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Reopen this PR with /reopen

Mark this PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

make topology upgrade sequential

9d9b1f2

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Jun 15, 2022

k8s-ci-robot requested review from sbueringer and stmcginnis June 15, 2022 01:32

sbueringer reviewed Jun 15, 2022

View reviewed changes

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 13, 2022

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Oct 13, 2022

k8s-ci-robot closed this Dec 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🌱 [WIP] make topology upgrade sequential #6652

🌱 [WIP] make topology upgrade sequential #6652

Uh oh!

ykakarap commented Jun 15, 2022

Uh oh!

k8s-ci-robot commented Jun 15, 2022

Uh oh!

k8s-ci-robot commented Jun 15, 2022

Uh oh!

sbueringer Jun 15, 2022

Uh oh!

sbueringer commented Jun 15, 2022

Uh oh!

k8s-triage-robot commented Sep 13, 2022

Uh oh!

k8s-triage-robot commented Oct 13, 2022

Uh oh!

ykakarap commented Nov 2, 2022

Uh oh!

k8s-triage-robot commented Dec 2, 2022

Uh oh!

k8s-ci-robot commented Dec 2, 2022

Uh oh!

Uh oh!

🌱 [WIP] make topology upgrade sequential #6652

🌱 [WIP] make topology upgrade sequential #6652

Uh oh!

Conversation

ykakarap commented Jun 15, 2022

Uh oh!

k8s-ci-robot commented Jun 15, 2022

Uh oh!

k8s-ci-robot commented Jun 15, 2022

Uh oh!

sbueringer Jun 15, 2022

Choose a reason for hiding this comment

Uh oh!

sbueringer commented Jun 15, 2022

Uh oh!

k8s-triage-robot commented Sep 13, 2022

Uh oh!

k8s-triage-robot commented Oct 13, 2022

Uh oh!

ykakarap commented Nov 2, 2022

Uh oh!

k8s-triage-robot commented Dec 2, 2022

Uh oh!

k8s-ci-robot commented Dec 2, 2022

Uh oh!

Uh oh!