-
Notifications
You must be signed in to change notification settings - Fork 602
[release-2.6] 🐛: elbv2: wait for LB active state instead of resolving DNS name #5226
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[release-2.6] 🐛: elbv2: wait for LB active state instead of resolving DNS name #5226
Conversation
Using DNS name resolution as a way to check the load balancer is working can cause problems that are dependent on the host running CAPA. In some systems, the DNS resolution can fail with very large TTLs cached DNS responses, causing very long provisioning times. Instead of DNS resolution, let's use the AWS API to check for the load balancer "active" state. Waiting for resolvable DNS names should be left for the clients to do.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/approve
/test pull-cluster-api-provider-aws-e2e |
@damdo: The specified target(s) for
The following commands are available to trigger optional jobs:
Use
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
/test pull-cluster-api-provider-aws-e2e-release-2-6 |
1 similar comment
/test pull-cluster-api-provider-aws-e2e-release-2-6 |
@richardcase it seems the deprecated AMIs issue needs to be solved for other branches as well? |
/test pull-cluster-api-provider-aws-e2e-release-2-6 |
Looking at pull-cluster-api-provider-aws-e2e-release-2-6 failure:
It looks like the failure may correspond to missing creds? cluster-api-provider-aws/test/e2e/shared/aws.go Lines 865 to 868 in 79faf4c
|
@patrickdillon That is a flake that happens sometimes, we haven't nailed it down. /test pull-cluster-api-provider-aws-e2e-release-2-6 /assign @damdo |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: AndiDog, nrb The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/test pull-cluster-api-provider-aws-e2e-release-2-6 |
The e2e tests are failing because the @richardcase what do we suggest here? Switch to 22.04 also for release-2.6 or republish old images for this? We might also need a better story around automated cleanup, we don't really want to not be able to run tests on older releases anymore. |
I'd say we need to support tests on previous release branches for a short while. May be 3 release branches? We could update the tests to use newer versions of k8s/ubuntu? But how much effort should we but into fixing e2e on an old branch? |
@richardcase Yeah I agree with the current CAPA staffing we might not be able to keep up with tests on releases too far back. This is n-1 w.r.t. the current n (latest), which is 2.7, so I think it would be well within reasonable ranges that we can define. |
Shall we just remove the e2e tests from all branches except:
Then i think when 2.8 is released we can keep it as:
I agree with you, with the number of active contributors to CAPA we need to pick where we put our effort. Perhaps we can jump on a call with @nrb and we can come to a consensus. |
I'm +1 to dropping the 2.6 tests. While it would be nice to have an updated, LTS OS for older branches, I don't think it's a valuable use of time or effort to keep these branches testing right now. |
Yes not an ideal situation but we need to be pragmatic with it, we can't maintain CI for older releases without more staffing. |
I guess as far as this PR now concerned we can LGTM and merge as is. |
Considering the above conversation and having @nrb @richardcase and @fiunchinho's green light, I'll proceed with merging this. /lgtm |
/test pull-cluster-api-provider-aws-test-release-2-6 |
@k8s-infra-cherrypick-robot: The following test failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
/test pull-cluster-api-provider-aws-test-release-2-6 |
@r4f4 it might need rebasing |
2361956
into
kubernetes-sigs:release-2.6
It merged! Thanks for the help. |
This is an automated cherry-pick of #5093
/assign nrb