-
Notifications
You must be signed in to change notification settings - Fork 25.2k
[CI] AssertionError in ShardSearchStats.onFetchPhase #70968
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Labels
:Search/Search
Search-related issues that do not fall into other categories
Team:Search
Meta label for search team
>test-failure
Triaged test failures from CI
Comments
Pinging @elastic/es-search (Team:Search) |
dnhatn
added a commit
that referenced
this issue
Apr 13, 2021
A CounterMetric is used to track the number of completed and outstanding items, for example, the number of executed refreshes, the currently used memory by indexing, the current pending search requests. In all cases, the current count of CounterMetric is always non-negative. However, as this metric is implemented using a LongAdder, the returned count is NOT an atomic snapshot; invocation in the absence of concurrent updates returns an accurate result, but concurrent updates that occur while the sum is being calculated might not be incorporated. We can replace LongAdder with AtomicLong, but this commit chooses to continue using LongAdder but returns 0 when the sum value is negative. Relates #52411 Closes #70968
dnhatn
added a commit
to dnhatn/elasticsearch
that referenced
this issue
Apr 14, 2021
A CounterMetric is used to track the number of completed and outstanding items, for example, the number of executed refreshes, the currently used memory by indexing, the current pending search requests. In all cases, the current count of CounterMetric is always non-negative. However, as this metric is implemented using a LongAdder, the returned count is NOT an atomic snapshot; invocation in the absence of concurrent updates returns an accurate result, but concurrent updates that occur while the sum is being calculated might not be incorporated. We can replace LongAdder with AtomicLong, but this commit chooses to continue using LongAdder but returns 0 when the sum value is negative. Relates elastic#52411 Closes elastic#70968
dnhatn
added a commit
to dnhatn/elasticsearch
that referenced
this issue
Apr 14, 2021
A CounterMetric is used to track the number of completed and outstanding items, for example, the number of executed refreshes, the currently used memory by indexing, the current pending search requests. In all cases, the current count of CounterMetric is always non-negative. However, as this metric is implemented using a LongAdder, the returned count is NOT an atomic snapshot; invocation in the absence of concurrent updates returns an accurate result, but concurrent updates that occur while the sum is being calculated might not be incorporated. We can replace LongAdder with AtomicLong, but this commit chooses to continue using LongAdder but returns 0 when the sum value is negative. Relates elastic#52411 Closes elastic#70968
dnhatn
added a commit
that referenced
this issue
Apr 14, 2021
A CounterMetric is used to track the number of completed and outstanding items, for example, the number of executed refreshes, the currently used memory by indexing, the current pending search requests. In all cases, the current count of CounterMetric is always non-negative. However, as this metric is implemented using a LongAdder, the returned count is NOT an atomic snapshot; invocation in the absence of concurrent updates returns an accurate result, but concurrent updates that occur while the sum is being calculated might not be incorporated. We can replace LongAdder with AtomicLong, but this commit chooses to continue using LongAdder but returns 0 when the sum value is negative. Relates #52411 Closes #70968
dnhatn
added a commit
that referenced
this issue
Apr 14, 2021
A CounterMetric is used to track the number of completed and outstanding items, for example, the number of executed refreshes, the currently used memory by indexing, the current pending search requests. In all cases, the current count of CounterMetric is always non-negative. However, as this metric is implemented using a LongAdder, the returned count is NOT an atomic snapshot; invocation in the absence of concurrent updates returns an accurate result, but concurrent updates that occur while the sum is being calculated might not be incorporated. We can replace LongAdder with AtomicLong, but this commit chooses to continue using LongAdder but returns 0 when the sum value is negative. Relates #52411 Closes #70968
dnhatn
added a commit
that referenced
this issue
Apr 14, 2021
A CounterMetric is used to track the number of completed and outstanding items, for example, the number of executed refreshes, the currently used memory by indexing, the current pending search requests. In all cases, the current count of CounterMetric is always non-negative. However, as this metric is implemented using a LongAdder, the returned count is NOT an atomic snapshot; invocation in the absence of concurrent updates returns an accurate result, but concurrent updates that occur while the sum is being calculated might not be incorporated. We can replace LongAdder with AtomicLong, but this commit chooses to continue using LongAdder but returns 0 when the sum value is negative. Relates #52411 Closes #70968
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
:Search/Search
Search-related issues that do not fall into other categories
Team:Search
Meta label for search team
>test-failure
Triaged test failures from CI
The following assertion tripped in the core search code while doing a search in an ML test on the 7.x branch:
elasticsearch/server/src/main/java/org/elasticsearch/index/search/stats/ShardSearchStats.java
Line 112 in 6343c18
Build scan:
https://gradle-enterprise.elastic.co/s/innofaaydb3fc
Repro line:
Reproduces locally?:
No
Applicable branches:
master, 7.x
Failure history:
https://build-stats.elastic.co/app/kibana#/discover?_g=(refreshInterval:(pause:!t,value:0),time:(from:now-90d,mode:quick,to:now))&_a=(columns:!(_source),index:e58bf320-7efd-11e8-bf69-63c8ef516157,interval:auto,query:(language:lucene,query:'%22Uncaught%20exception%20in%20thread%22%20AND%20ShardSearchStats.java%20AND%20112'),sort:!(time,desc))
Same thing happened in a PR build on 24th February, but in
BasicDistributedJobsIT.testMaxConcurrentJobAllocations
instead ofMlDistributedFailureIT.testJobRelocationIsMemoryAware
.Both tests that have triggered this are internal cluster tests - maybe there's something special about internal cluster tests that violates the expected invariants of search stats?
Failure excerpt:
The text was updated successfully, but these errors were encountered: