🌱 Add automated machine management section to docs tasks #6421

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

k8s-ci-robot merged 1 commit into kubernetes-sigs:main from enxebre:automated-machine-management-docs

May 27, 2022

Member

enxebre commented Apr 18, 2022

What this PR does / why we need it:
Add automated worker machine management section to docs tasks

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #

k8s-ci-robot added cncf-cla: yes size/S labels

k8s-ci-robot requested review from JoelSpeed and killianmuldoon

April 18, 2022 15:16

enxebre mentioned this pull request

CAPI waiting forever for the volume to be detached #6285

Closed

enxebre force-pushed the automated-machine-management-docs branch 4 times, most recently from 57b275b to acfbabe Compare

April 18, 2022 15:48

fabriziopandini reviewed

View reviewed changes

Member

fabriziopandini left a comment

thanks, @enxebre for improving our docs!

docs/book/src/SUMMARY.md Outdated

+                  - [Automated worker Machine management](./tasks/automated-worker-machine-management/index.md)
+                    - [Scaling](./tasks/automated-worker-machine-management/scaling.md)
+                    - [Auto scaling](./tasks/automated-worker-machine-management/auto-scaling.md)
+                    - [Health checking](./tasks/automated-worker-machine-management/healthchecking.md)

Member

fabriziopandini Apr 20, 2022

Health checking is both for control plane and worker nodes...
What about calling this new section "Automated machine management" and explicitly calling out when a paragraph applies only to a subset of machines

chrischdi reviewed

View reviewed changes

Member

chrischdi left a comment

Only small formatting nits 👍

docs/book/src/tasks/automated-worker-machine-management/scaling.md Outdated


		Machines can be owned by scalable resources i.e. MachineSet and MachineDeployments.

		You can scale MachineSets and MachineDeployments in or out by expressing intent via .spec.replicas or updating the scale subresource e.g `kubectl scale machinedeployment foo --replicas=5`.

Member

chrischdi Apr 20, 2022

Suggested change

      
            You can scale MachineSets and MachineDeployments in or out by expressing intent via .spec.replicas or updating the scale subresource e.g `kubectl scale machinedeployment foo --replicas=5`.
          
            You can scale MachineSets and MachineDeployments in or out by expressing intent via `.spec.replicas` or updating the scale subresource e.g `kubectl scale machinedeployment foo --replicas=5`.

Formatting to follow the other book entries/markdowns

docs/book/src/tasks/automated-worker-machine-management/scaling.md Outdated

+              You can scale MachineSets and MachineDeployments in or out by expressing intent via .spec.replicas or updating the scale subresource e.g `kubectl scale machinedeployment foo --replicas=5`.
+              When you delete a Machine directly or by scaling down, the same process takes place in the same order:
+              - The Node backed by that Machine will try to be drained indefinitely and will wait for any volume to be detached from the Node unless you specify a .spec.nodeDrainTimeout.

Member

chrischdi Apr 20, 2022

Suggested change

      
            - The Node backed by that Machine will try to be drained indefinitely and will wait for any volume to be detached from the Node unless you specify a .spec.nodeDrainTimeout.
          
            - The Node backed by that Machine will try to be drained indefinitely and will wait for any volume to be detached from the Node unless you specify a `.spec.nodeDrainTimeout.`

Formatting to follow the other book entries/markdowns

docs/book/src/tasks/automated-worker-machine-management/scaling.md Outdated

+              When you delete a Machine directly or by scaling down, the same process takes place in the same order:
+              - The Node backed by that Machine will try to be drained indefinitely and will wait for any volume to be detached from the Node unless you specify a .spec.nodeDrainTimeout.
+                - CAPI uses default [kubectl draining implementation](https://kubernetes.io/docs/tasks/administer-cluster/safely-drain-node/) with –ignore-daemonsets=true. If you needed to ensure DaemonSets eviction you'd need to do so manually by also adding proper taints to avoid rescheduling.

Member

chrischdi Apr 20, 2022

Suggested change

      
              - CAPI uses default [kubectl draining implementation](https://kubernetes.io/docs/tasks/administer-cluster/safely-drain-node/) with –ignore-daemonsets=true. If you needed to ensure DaemonSets eviction you'd need to do so manually by also adding proper taints to avoid rescheduling.
          
              - CAPI uses default [kubectl draining implementation](https://kubernetes.io/docs/tasks/administer-cluster/safely-drain-node/) with `--ignore-daemonsets=true`. If you needed to ensure DaemonSets eviction you'd need to do so manually by also adding proper taints to avoid rescheduling.

Formatting to follow the other book entries/markdowns

docs/book/src/tasks/automated-worker-machine-management/scaling.md Outdated

+              - The Node backed by that Machine will try to be drained indefinitely and will wait for any volume to be detached from the Node unless you specify a .spec.nodeDrainTimeout.
+                - CAPI uses default [kubectl draining implementation](https://kubernetes.io/docs/tasks/administer-cluster/safely-drain-node/) with –ignore-daemonsets=true. If you needed to ensure DaemonSets eviction you'd need to do so manually by also adding proper taints to avoid rescheduling.
+              - The infrastructure backing that Node will try to be deleted indefinitely.
+              - Only when the infrastructure is gone, the Node will try to be deleted indefinitely unless you specify spec.nodeDeletionTimeout.

Member

chrischdi Apr 20, 2022

Suggested change

      
            - Only when the infrastructure is gone, the Node will try to be deleted indefinitely unless you specify spec.nodeDeletionTimeout.
          
            - Only when the infrastructure is gone, the Node will try to be deleted indefinitely unless you specify `.spec.nodeDeletionTimeout`.

furkatgofurov7 reviewed

View reviewed changes

Member

furkatgofurov7 left a comment

Thanks, this looks good, only a small nit that could be fixed:

docs/book/src/tasks/automated-worker-machine-management/scaling.md Outdated

		@@ -0,0 +1,13 @@
		# Scaling Nodes

		You can add or remove compute capacity for your cluster workloads by creating or removing Machines. A Machine express intent to have a Node with a defined form factor.

Member

furkatgofurov7 Apr 21, 2022

Suggested change

      
            You can add or remove compute capacity for your cluster workloads by creating or removing Machines. A Machine express intent to have a Node with a defined form factor.
          
            You can add or remove compute capacity for your cluster workloads by creating or removing Machines. A Machine expresses intent to have a Node with a defined form factor.

enxebre force-pushed the automated-machine-management-docs branch from acfbabe to be84108 Compare

April 21, 2022 20:15

Member Author

enxebre commented Apr 21, 2022

Addressed all comments PTAL @fabriziopandini @furkatgofurov7 @chrischdi.

enxebre force-pushed the automated-machine-management-docs branch from be84108 to f35d524 Compare

April 21, 2022 20:23

enxebre changed the title ~~🌱 Add automated worker machine management section to docs tasks~~ 🌱 Add automated machine management section to docs tasks

Member

furkatgofurov7 commented Apr 21, 2022

/retest

furkatgofurov7 reviewed

View reviewed changes

docs/book/src/tasks/automated-machine-management/scaling.md Outdated


		Machines can be owned by scalable resources i.e. MachineSet and MachineDeployments.

		You can scale MachineSets and MachineDeployments in or out by expressing intent via `.spec.replicas or updating the scale subresource e.g `kubectl scale machinedeployment foo --replicas=5`.

Member

furkatgofurov7 Apr 21, 2022

Suggested change

      
            You can scale MachineSets and MachineDeployments in or out by expressing intent via `.spec.replicas or updating the scale subresource e.g `kubectl scale machinedeployment foo --replicas=5`.
          
            You can scale MachineSets and MachineDeployments in or out by expressing intent via `.spec.replicas` or updating the scale subresource e.g `kubectl scale machinedeployment foo --replicas=5`.

enxebre force-pushed the automated-machine-management-docs branch from f35d524 to b253ea8 Compare

April 22, 2022 18:43

furkatgofurov7 reviewed

View reviewed changes

Member

furkatgofurov7 left a comment •

edited

Loading

Thanks @enxebre, LGTM! lint GHA seems flaky though

Member

furkatgofurov7 commented Apr 22, 2022

/retest

enxebre force-pushed the automated-machine-management-docs branch from b253ea8 to f067caa Compare

April 25, 2022 14:10

Member Author

enxebre commented Apr 25, 2022

lint GHA seems flaky though

rebased to pick #6436

Member

sbueringer commented Apr 27, 2022

Nice improvement, thx!!
/lgtm

1 similar comment

Member

sbueringer commented Apr 27, 2022

Nice improvement, thx!!
/lgtm

k8s-ci-robot assigned sbueringer

k8s-ci-robot added the lgtm label

killianmuldoon reviewed

View reviewed changes

Contributor

killianmuldoon left a comment

Looks good - just a couple of nits really. Definitely a useful subsection to have here.

docs/book/src/tasks/automated-machine-management/index.md Show resolved Hide resolved

docs/book/src/tasks/automated-machine-management/scaling.md

+              - The Node backed by that Machine will try to be drained indefinitely and will wait for any volume to be detached from the Node unless you specify a `.spec.nodeDrainTimeout`.
+                - CAPI uses default [kubectl draining implementation](https://kubernetes.io/docs/tasks/administer-cluster/safely-drain-node/) with `-–ignore-daemonsets=true`. If you needed to ensure DaemonSets eviction you'd need to do so manually by also adding proper taints to avoid rescheduling.
+              - The infrastructure backing that Node will try to be deleted indefinitely.
+              - Only when the infrastructure is gone, the Node will try to be deleted indefinitely unless you specify `.spec.nodeDeletionTimeout`.

Contributor

killianmuldoon May 5, 2022

Maybe add a note saying this won't work with a Topology managed Cluster with a ref to the ClusterClass doc?

Member

fabriziopandini May 22, 2022

We are slowly adding all the flags to topology managed clusters, let's aim for feature parity

Member

sbueringer May 24, 2022 •

edited

Loading

I think it's fine if we don't have a note here if it's supported with ClusterClass or not. (especially given that Nabarun will implement #6450 (talked to him last week about it))

enxebre force-pushed the automated-machine-management-docs branch from f067caa to 1f8321f Compare

May 26, 2022 09:10

k8s-ci-robot removed the lgtm label

k8s-ci-robot added size/M and removed size/S labels


          Add automated machine management section to docs tasks

f9546b8

enxebre force-pushed the automated-machine-management-docs branch from 1f8321f to f9546b8 Compare

May 26, 2022 09:11

Member Author

enxebre commented May 26, 2022

PTAL @sbueringer @killianmuldoon

Contributor

killianmuldoon commented May 26, 2022

/lgtm

k8s-ci-robot assigned killianmuldoon

k8s-ci-robot added the lgtm label

Member

sbueringer commented May 27, 2022

Thank you!

/lgtm
/approve

Contributor

k8s-ci-robot commented May 27, 2022

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: sbueringer

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [sbueringer]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the approved label

k8s-ci-robot merged commit f7e1205 into kubernetes-sigs:main

k8s-ci-robot added this to the v1.2 milestone

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

chrischdi chrischdi left review comments

sbueringer sbueringer left review comments

fabriziopandini fabriziopandini left review comments

furkatgofurov7 furkatgofurov7 left review comments

killianmuldoon killianmuldoon left review comments

JoelSpeed Awaiting requested review from JoelSpeed

Labels

approved cncf-cla: yes lgtm size/M