Check if the cached temp conf file still exists before using it. #38

amitsrivastava · 2017-11-03T20:54:01Z

On some systems older unused temp files get cleaned up regularly. So,
we cannot rely on the availability of the temp_files in long running
processes.

On some systems older unused temp files get cleaned up regularly. So, we cannot rely on the availability of the temp_files in long running processes.

codecov-io · 2017-11-03T21:03:46Z

Codecov Report

Merging #38 into master will not change coverage.
The diff coverage is 100%.

@@           Coverage Diff           @@
##           master      #38   +/-   ##
=======================================
  Coverage   93.36%   93.36%           
=======================================
  Files          11       11           
  Lines         829      829           
=======================================
  Hits          774      774           
  Misses         55       55

Impacted Files	Coverage Δ
config/kube_config.py	`91.86% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9213876...3d91dcd. Read the comment docs.

arunmk · 2017-11-04T04:01:41Z

config/kube_config.py

@@ -49,7 +49,7 @@ def _create_temp_file_with_content(content):
    # Because we may change context several times, try to remember files we
    # created and reuse them at a small memory cost.
    content_key = str(content)
-    if content_key in _temp_files:
+    if content_key in _temp_files and os.path.isfile(_temp_files[content_key]):


There is still a small window for a race here:

after the check of is_file, the file may be deleted (before line 53)

the file may be deleted after the function returns, but before the file is consumed by the caller.

There should be a more generic solution to this, or the tracking of the file should be delegated to the caller. Possible solutions include:

'Semantically locking' the file so that deletion will not remove the file: not ideal as the caller should clean up

Modifying the system so that temporary files are not cleared: to be done by the caller.

Hence I think this responsibility of ensuring that the file exists should be delegated to the caller.

That's absolutely correct @arunmk. But, what you suggest is a way bigger change. The temp file names are being passed all over and beyond the scope of this module (kube_config), so the changes have to go all the way up.

Currently, the situation is such that once you get into a bad condition, you can never get out without a restart of the python process. But, with this change, most such cases should get handled and even the one that you pointed can only cause a one-time exception.

If we use the $TMPDIR, I don't see a way out that's better than the solution that you have here @amitsrivastava . Could you also add an API documentation change that says something to the effect of: 'The file that the API emits may not exist due to cleanup policy of $TMPDIR. In that case, the pattern is to retry the API until the file exists." That will clarify the usage of the API.

Temporary files can be safe deleted only if these files are not accessed by processes for a long time. For example tmpwatch (popular on Red Hat) works in this way. Because of this I suggest updating mtime to prevent deleting files. It also removes race condition.

if content_key in _temp_files: try: os.utime(_temp_files[content_key], None) return _temp_files[content_key] # file exists and mtime has been updated except OSError as err: if err.errno == errno.ENOENT: # ups, file disappeared, has to be recreated pass else: raise

How does it look ?

@amitsrivastava @tomplus it looks like the /run directory is better for such purposes. From http://www.h-online.com/open/news/item/Linux-distributions-to-include-run-directory-1219006.html:
"
On the Fedora project's developer list, systemd developer Lennart Poettering has announced the introduction of a /run directory in the root directory and provided detailed background explanations. Similar to the existing /var/run/ directory, the new directory is designed to allow applications to store the data they require in order to operate. This includes process IDs, socket information, lock files and other data which is required at run-time but can't be stored in /tmp/ because programs such as tmpwatch could potentially delete it from there.
"

So if feasible, the /var/run or the /run directory may be a better location to create these temporary files.

Is keeping the temp file open a bad idea?

Hi, Any update on this issue? Looks like it is not merged. How to address this issue ?

I am facing the same issue as well, what would be the best way to work around?

I choose store temp in another directory.

fejta-bot · 2019-04-20T23:56:34Z

Unknown CLA label state. Rechecking for CLA labels.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/check-cla

fejta-bot · 2019-07-20T00:30:27Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

tbarrella · 2019-07-26T06:37:56Z

/remove-lifecycle stale

scottilee · 2019-09-26T04:40:36Z

Hey @amitsrivastava, any update on this? I can get someone to review it afterwards.

Since this PR has been open for a while, if it is no longer valid please close it.

arunmk · 2019-09-27T19:10:03Z

@scottilee @amitsrivastava I can take this forward and create a patch if you are fine by it

pigletfly · 2019-10-24T03:39:45Z

any updates on this?

fejta-bot · 2020-01-22T04:31:50Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-02-21T05:14:29Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

pigletfly · 2020-02-21T08:35:44Z

/remove-lifecycle stale

pigletfly · 2020-02-21T09:01:48Z

/remove-lifecycle rotten

fejta-bot · 2020-05-21T09:28:26Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

arunmk · 2020-05-21T17:46:37Z

/remove-lifecycle stale

arunmk · 2020-05-21T17:47:00Z

/remove-lifecycle rotten

fejta-bot · 2020-10-08T07:59:51Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

arunmk · 2020-10-08T17:03:05Z

/remove-lifecycle stale

ceroloy · 2020-10-09T10:23:48Z

I am seeing several PRs are linked here as related and some of them as merged. Is this issue resolved ? If so can you please share in which version is this fixed ?

Thank you

fejta-bot · 2021-07-11T11:06:58Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

k8s-triage-robot · 2021-08-10T11:44:05Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2021-09-09T11:47:18Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen
Mark this issue or PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2021-09-09T11:47:20Z

@k8s-triage-robot: Closed this PR.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen

Mark this issue or PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Check if the cached temp conf file still exists before using it.

3d91dcd

On some systems older unused temp files get cleaned up regularly. So, we cannot rely on the availability of the temp_files in long running processes.

arunmk reviewed Nov 4, 2017

View reviewed changes

tomplus mentioned this pull request Feb 28, 2019

config temp file check existence #119

Closed

tomplus mentioned this pull request Mar 15, 2019

Don't fail if tmp files have been deleted #122

Closed

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Apr 20, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 20, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 26, 2019

pigletfly mentioned this pull request Oct 24, 2019

k8s config tmp files cleared by centos kubernetes-client/python#765

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 22, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 21, 2020

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Feb 21, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 21, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 21, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 8, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 8, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 11, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 10, 2021

k8s-ci-robot closed this Sep 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check if the cached temp conf file still exists before using it. #38

Check if the cached temp conf file still exists before using it. #38

amitsrivastava commented Nov 3, 2017

codecov-io commented Nov 3, 2017

arunmk Nov 4, 2017 •

edited

Loading

amitsrivastava Nov 4, 2017

amitsrivastava Nov 4, 2017

arunmk Nov 5, 2017

tomplus Nov 5, 2017

arunmk Nov 8, 2017

mbohlool Jan 10, 2018

sn003 Dec 3, 2020

reachnatp Feb 9, 2021

onecer Apr 12, 2021

fejta-bot commented Apr 20, 2019

fejta-bot commented Jul 20, 2019

tbarrella commented Jul 26, 2019

scottilee commented Sep 26, 2019

arunmk commented Sep 27, 2019

pigletfly commented Oct 24, 2019

fejta-bot commented Jan 22, 2020

fejta-bot commented Feb 21, 2020

pigletfly commented Feb 21, 2020

pigletfly commented Feb 21, 2020

fejta-bot commented May 21, 2020

arunmk commented May 21, 2020

arunmk commented May 21, 2020

fejta-bot commented Oct 8, 2020

arunmk commented Oct 8, 2020

ceroloy commented Oct 9, 2020

fejta-bot commented Jul 11, 2021

k8s-triage-robot commented Aug 10, 2021

k8s-triage-robot commented Sep 9, 2021

k8s-ci-robot commented Sep 9, 2021

Check if the cached temp conf file still exists before using it. #38

Check if the cached temp conf file still exists before using it. #38

Conversation

amitsrivastava commented Nov 3, 2017

codecov-io commented Nov 3, 2017

Codecov Report

arunmk Nov 4, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fejta-bot commented Apr 20, 2019

fejta-bot commented Jul 20, 2019

tbarrella commented Jul 26, 2019

scottilee commented Sep 26, 2019

arunmk commented Sep 27, 2019

pigletfly commented Oct 24, 2019

fejta-bot commented Jan 22, 2020

fejta-bot commented Feb 21, 2020

pigletfly commented Feb 21, 2020

pigletfly commented Feb 21, 2020

fejta-bot commented May 21, 2020

arunmk commented May 21, 2020

arunmk commented May 21, 2020

fejta-bot commented Oct 8, 2020

arunmk commented Oct 8, 2020

ceroloy commented Oct 9, 2020

fejta-bot commented Jul 11, 2021

k8s-triage-robot commented Aug 10, 2021

k8s-triage-robot commented Sep 9, 2021

k8s-ci-robot commented Sep 9, 2021

arunmk Nov 4, 2017 •

edited

Loading