Properly exit when cert-watcher can no longer watch required file #1719

ethernoy · 2021-11-16T10:38:31Z

When the TLS assets are no longer available (missing drive for example) after a webhook server is started with such TLS assets, the cert-watcher throws the following error:

controller-runtime/pkg/certwatcher/certwatcher.go

Line 144 in 4d10a06

log.Error(err, "error re-watching file")

{"level":"error","ts":1636596243.6302137,"logger":"controller-runtime.certwatcher","msg":"error re-watching file","error":"no such file or directory","stacktrace":"github.com/go-logr/zapr.(*zapLogger).Error\n\t/go/src/github.com/open-policy-agent/gatekeeper/vendor/github.com/go-logr/zapr/zapr.go:132\nsigs.k8s.io/controller-runtime/pkg/log.(*DelegatingLogger).Error\n\t/go/src/github.com/open-policy-agent/gatekeeper/vendor/sigs.k8s.io/controller-runtime/pkg/log/deleg.go:144\nsigs.k8s.io/controller-runtime/pkg/webhook/internal/certwatcher.(*CertWatcher).handleEvent\n\t/go/src/github.com/open-policy-agent/gatekeeper/vendor/sigs.k8s.io/controller-runtime/pkg/webhook/internal/certwatcher/certwatcher.go:144\nsigs.k8s.io/controller-runtime/pkg/webhook/internal/certwatcher.(*CertWatcher).Watch\n\t/go/src/github.com/open-policy-agent/gatekeeper/vendor/sigs.k8s.io/controller-runtime/pkg/webhook/internal/certwatcher/certwatcher.go:102"}

After throwing the following error, the cert-watcher simply stops monitoring the path without further action. The last valid certificate persists in currentCert even after the path becomes available again. I wonder if it is better if the cert-watcher can either:

call os.exit after the path missing error occurs
keeps monitoring the path even if it is missing

Happy to do a PR if possible.

The text was updated successfully, but these errors were encountered:

k8s-triage-robot · 2022-02-14T11:33:25Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2022-03-16T12:27:38Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

ethernoy · 2022-03-20T10:01:27Z

/remove-lifecycle rotten

k8s-triage-robot · 2022-06-18T10:55:19Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

hzxuzhonghu · 2022-07-07T09:38:22Z

This is one blog explaining k8s fsnotfy, looks cert-watcher can not handle secret mounted in container.

k8s-triage-robot · 2022-08-06T10:22:06Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2022-09-05T10:32:30Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen
Mark this issue or PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2022-09-05T10:32:33Z

@k8s-triage-robot: Closing this issue.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen

Mark this issue or PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

ichekrygin · 2024-04-01T22:11:08Z

/reopen

k8s-ci-robot · 2024-04-01T22:11:12Z

@ichekrygin: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

sbueringer · 2024-04-08T18:59:34Z

To be honest, I would be really surprised if the certwatcher "can not handle secret mounted in container.". As far as I'm aware that is the standard case with controller-runtime and folks have running this in production for a very long time (also in combination with certificates managed by cert-manager that are rotated every few weeks).

Maybe I'm missing something. Definitely fine to investigate further and if there is evidence that we have a problem fix it.

/remove-lifecycle rotten
/reopen

k8s-ci-robot · 2024-04-08T18:59:38Z

@sbueringer: Reopened this issue.

In response to this:

To be honest, I would be really surprised if the certwatcher "can not handle secret mounted in container.". As far as I'm aware that is the standard case with controller-runtime and folks have running this in production for a very long time (also in combination with certificates managed by cert-manager that are rotated every few weeks).

Maybe I'm missing something. Definitely fine to investigate further and if there is evidence that we have a problem fix it.

/remove-lifecycle rotten
/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-triage-robot · 2024-07-07T19:14:44Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2024-08-06T19:35:45Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2024-09-05T20:02:33Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen
Mark this issue as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

k8s-ci-robot · 2024-09-05T20:02:38Z

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen

Mark this issue as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

sbueringer · 2025-01-02T14:18:56Z

Just to surface it here. With CR v0.20 we'll likely going to have a cert-watcher implementation that uses fsnotify + additionally regularly reads the files from disk (default: every 10s)

xref: #3050

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 14, 2022

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 16, 2022

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Mar 20, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 18, 2022

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 6, 2022

k8s-ci-robot closed this as completed Sep 5, 2022

k8s-ci-robot reopened this Apr 8, 2024

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Apr 8, 2024

dimityrmirchev mentioned this issue Apr 25, 2024

Evaluate the use of CertWatcher to replace internal implementation gardener/gardener-discovery-server#9

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 7, 2024

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 6, 2024

k8s-ci-robot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Properly exit when cert-watcher can no longer watch required file #1719

Properly exit when cert-watcher can no longer watch required file #1719

ethernoy commented Nov 16, 2021

k8s-triage-robot commented Feb 14, 2022

k8s-triage-robot commented Mar 16, 2022

ethernoy commented Mar 20, 2022

k8s-triage-robot commented Jun 18, 2022

hzxuzhonghu commented Jul 7, 2022

k8s-triage-robot commented Aug 6, 2022

k8s-triage-robot commented Sep 5, 2022

k8s-ci-robot commented Sep 5, 2022

ichekrygin commented Apr 1, 2024

k8s-ci-robot commented Apr 1, 2024

sbueringer commented Apr 8, 2024

k8s-ci-robot commented Apr 8, 2024

k8s-triage-robot commented Jul 7, 2024

k8s-triage-robot commented Aug 6, 2024

k8s-triage-robot commented Sep 5, 2024

k8s-ci-robot commented Sep 5, 2024

sbueringer commented Jan 2, 2025 •

edited

Loading

Properly exit when cert-watcher can no longer watch required file #1719

Properly exit when cert-watcher can no longer watch required file #1719

Comments

ethernoy commented Nov 16, 2021

k8s-triage-robot commented Feb 14, 2022

k8s-triage-robot commented Mar 16, 2022

ethernoy commented Mar 20, 2022

k8s-triage-robot commented Jun 18, 2022

hzxuzhonghu commented Jul 7, 2022

k8s-triage-robot commented Aug 6, 2022

k8s-triage-robot commented Sep 5, 2022

k8s-ci-robot commented Sep 5, 2022

ichekrygin commented Apr 1, 2024

k8s-ci-robot commented Apr 1, 2024

sbueringer commented Apr 8, 2024

k8s-ci-robot commented Apr 8, 2024

k8s-triage-robot commented Jul 7, 2024

k8s-triage-robot commented Aug 6, 2024

k8s-triage-robot commented Sep 5, 2024

k8s-ci-robot commented Sep 5, 2024

sbueringer commented Jan 2, 2025 • edited Loading

sbueringer commented Jan 2, 2025 •

edited

Loading