Skip to content

Detect deadlocks when running hot_threads #42790

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
ywelsch opened this issue Jun 3, 2019 · 5 comments
Open

Detect deadlocks when running hot_threads #42790

ywelsch opened this issue Jun 3, 2019 · 5 comments
Labels
:Core/Infra/Core Core issues without another label >feature high hanging fruit Team:Core/Infra Meta label for core/infra team triaged Issue has been looked at, and is being left open

Comments

@ywelsch
Copy link
Contributor

ywelsch commented Jun 3, 2019

A node that has deadlocked threads can exhibit all kinds of issues, which can be difficult to explain if not looking for this exact cause (see #41418). It would be convenient if hot_threads, which already provides a lot of useful information about the threads, would also provide information on whether there's a deadlock on the respective node.

Funnily enough, our code base already has a DeadlockAnalyzer (introduced by Shay a very long time ago, but deactivated very early on), so this would make use of this class again, which has been laying dormant for 8 years.

@ywelsch ywelsch added >feature discuss :Core/Infra/Core Core issues without another label labels Jun 3, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-core-infra

@jasontedor jasontedor removed the discuss label Jun 6, 2019
@jasontedor jasontedor self-assigned this Jun 6, 2019
@jasontedor
Copy link
Member

We discussed this during our weekly fix-it meeting. We think this is a worthwhile addition although we are doubtful about overloading hot_threads for this purpose. Instead, we discussed the possibility of adding a dedicated threads API that would give basic reporting about threads (stacktraces, etc.) and could include deadlock detection, basically bring jstack into Elasticsearch.

@rjernst rjernst added the Team:Core/Infra Meta label for core/infra team label May 4, 2020
@rjernst rjernst added the needs:triage Requires assignment of a team area label label Dec 3, 2020
@williamrandolph
Copy link
Contributor

I would like to raise this ticket for discussion with the Core/Infra team to decide if a dedicated threads API is "high hanging fruit" or if it is something that should be on our roadmap.

@williamrandolph williamrandolph added team-discuss and removed needs:triage Requires assignment of a team area label labels Jan 6, 2021
@williamrandolph
Copy link
Contributor

We discussed this today in core/infra. We don't think this will get on the roadmap in the short term, but we still see the benefit in doing it, so I've put the "high hanging fruit" tag on it.

@jasontedor jasontedor removed their assignment Sep 7, 2021
@mosche mosche added the triaged Issue has been looked at, and is being left open label May 9, 2025
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-core-infra (Team:Core/Infra)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Core/Infra/Core Core issues without another label >feature high hanging fruit Team:Core/Infra Meta label for core/infra team triaged Issue has been looked at, and is being left open
Projects
None yet
Development

No branches or pull requests

7 participants