Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: catch exceptions from updating the status #2752

Merged
merged 1 commit into from
Apr 4, 2025

Conversation

shawkins
Copy link
Collaborator

@shawkins shawkins commented Apr 3, 2025

There are two possible handlings here:

  • catch and log the error from the status patch, so that the original retry handling is preserved
  • log the original exception, and let the error from the status patch take over as it does now

I opted for the former, but I'm open to the latter.

closes: #2751

@openshift-ci openshift-ci bot requested review from csviri and metacosm April 3, 2025 15:56
@csviri
Copy link
Collaborator

csviri commented Apr 4, 2025

log the original exception, and let the error from the status patch take over as it does now

This makes more sense to me, considering the following situation having a patch from error handler that fails and not to retry raises some concerns of consistency. This way might be at least eventual consistent in general. But not retrying lead to an unexpected state: reconciliation failed and even status is not updated.
What do you think?

@shawkins
Copy link
Collaborator Author

shawkins commented Apr 4, 2025

This makes more sense to me, considering the following situation having a patch from error handler that fails and not to retry raises some concerns of consistency.

Why would you not retry? Wouldn't that mean the core reconciliation action has no retry or retires left?

An independent retry for patching the status doesn't make as much sense for me. Since there is already retry logic built in the client, the most likely causes for the patching to fail would be:

  • a 409 - in which case I'd expect another event to come in
  • the api server is down for longer than the client retry, which I don't think requires special handling just for patching the status

@csviri
Copy link
Collaborator

csviri commented Apr 4, 2025

Ahh yes, so if there is explicit no retry we could say, we don't retry even if patch fails. But again, I guess some might want to patch status for a reason, but you probably right so we might be good enough with logging in this case.

other opinions? @metacosm @xstefank

Copy link
Collaborator

@xstefank xstefank left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It makes sense to me like this. If there are requests to patch the status, we could put this under config.

@metacosm
Copy link
Collaborator

metacosm commented Apr 4, 2025

Ideally, it would make sense to always try to patch the status but it might not be possible, especially if it already failed at least once…

@csviri csviri merged commit 9482c3e into operator-framework:main Apr 4, 2025
25 checks passed
@csviri
Copy link
Collaborator

csviri commented Apr 4, 2025

Thank you @shawkins !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Error handling if status patching fails
4 participants