Skip to content

Unassign persistent tasks when the executor node dies #37609

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 2 commits into from

Conversation

davidkyle
Copy link
Member

PersistentTasksClusterService decides if a task should be reassigned by checking there is a node in the cluster with the same Id instead of comparing the ephemeral Id or the actual DiscoveryNode.

https://github.com/elastic/elasticsearch/blob/master/server/src/main/java/org/elasticsearch/persistent/PersistentTasksClusterService.java#L389

PersistentTasksClusterService runs on the master node. If a node exits and re-enters the cluster while there isn't a master node PersistentTasksClusterService will not be active to witness the change and unassign the task. When the node returns and PersistentTasksClusterService is running the assignment appears valid as the node is in the cluster but its departure was missed. This happens during the rolling upgrade tests where there are 3 nodes and minimum master nodes is set to 3.

This change is much larger than it should be because of a rename. Deassociate isn't a word in British or American english dictionaries, the options are disassociate or dissociate.

6.7 only because Zen2 although the rename could be forward ported.

@davidkyle davidkyle added :Distributed Coordination/Task Management Issues for anything around the Tasks API - both persistent and node level. v6.7.0 labels Jan 18, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-distributed

@davidkyle
Copy link
Member Author

Closing in favour of #37656

@davidkyle davidkyle closed this Jan 21, 2019
@davidkyle davidkyle deleted the ptasks-deassociate-nodes branch January 21, 2019 14:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed Coordination/Task Management Issues for anything around the Tasks API - both persistent and node level.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants