CloseableThreadLocal can severely degrade performance #55772

Hailei · 2020-04-26T04:29:46Z

Describe the feature:

Elasticsearch version (bin/elasticsearch --version): 7.5.1

Plugins installed: []

JVM version (java -version): in-built JDK

OS version (uname -a if on a Unix-like system):
Linux 4.14.81.bm.20-amd64 #1 SMP Debian 4.14.81.bm.20 Sat Mar 14 10:14:04 UTC 2020 x86_64 GNU/Linux

Description of the problem including expected versus actual behavior:
In our benchmark found out that different shard nums have a deleterious effect on the perfromance
environment:
10 machine 96core 1T mem 16T nvme SSD, four es nodes per machine
Index size: 75GB
document size: 0.64 billion
The index is time series, divided according to the days, we will search more then 8~12 days index

shard num	pct95(ms)	qps
1	700	4000
10	10000	800

Using jstack, Flame grapha and arthas(performance tool from alibaba), we found that a lot of thread including search、transport_worker and http_server_worker was blocked by CloseableThreadLocal's lock

According to stack, deep into the code, closeablethreadpool belong to ThreadContext,ThreadContext is globally unique, So all threads are affected.So more shards, more intense lock contention, and severely degrade performance
I think CloseableThreadLocal Solved the gc problem, but bring a lot lock contention.Especially in the case of high concurrency scenario, more harm than good. So we need redesign this code

Steps to reproduce:

Please include a minimal but complete recreation of the problem, including
(e.g.) index creation, mappings, settings, query etc. The easier you make for
us to reproduce it, the more likely that somebody will take the time to look at it.

Provide logs (if relevant):

The text was updated successfully, but these errors were encountered:

DaveCTurner · 2020-04-26T06:49:43Z

Duplicates https://discuss.elastic.co/t/swarm-of-shard-search-requests-cause-elasticsearch-transport-worker-blocked-on-closablethreadlocal/229803 and either fixed or changed beyond recognition by #43249, so I'm closing this.

DaveCTurner closed this as completed Apr 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CloseableThreadLocal can severely degrade performance #55772

CloseableThreadLocal can severely degrade performance #55772

Hailei commented Apr 26, 2020

DaveCTurner commented Apr 26, 2020

CloseableThreadLocal can severely degrade performance #55772

CloseableThreadLocal can severely degrade performance #55772

Comments

Hailei commented Apr 26, 2020

DaveCTurner commented Apr 26, 2020