-
Notifications
You must be signed in to change notification settings - Fork 25.2k
Ensure Test Cluster in Azure Discovery Tests Closes #43057
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
* Use `internalCluster().close()` to force all nodes (and not just the datanodes) to shut down even if one fails to shut down in time * More than 10s timeout on node shutdown, if we have a slow run 10s might not suffice in an IT * Force closing httpServer to get cleaner logs if nodes still hang on shut down * Closes elastic#43048
Pinging @elastic/es-distributed |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
Jenkins run elasticsearch-ci/2 |
It appears the increased timeout somehow breaks an ML test?? I'm looking into that. Update: yea the ML tests are in fact dependent on the node shutdown running into the interrupt mentioned below within 10s, so I can't just up that here without messing with those tests. |
@henningandersen I looked into the code some more and I'm not so convinced that upping the timeout here will help (closing a node waits for 10s then interrupts the node's threadpool and waits another 10s so we shouldn't hang here really). I'd still merge this if it's ok with you just to get a clean stack trace for where exactly things are stuck (currently, we're not stopping all the nodes and neither do we stop the web server on failure so it's hard to tell). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Let's get this merged
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
* Use `internalCluster().close()` to force all nodes (and not just the datanodes) to shut down even if one fails to shut down in time * Force closing httpServer to get cleaner logs if nodes still hang on shut down * Relates elastic#43048
* Use `internalCluster().close()` to force all nodes (and not just the datanodes) to shut down even if one fails to shut down in time * Force closing httpServer to get cleaner logs if nodes still hang on shut down * Relates #43048
internalCluster().close()
to force all nodes (and not just the datanodes) to shut down even if one fails to shut down in time