[CI] Various tests in ShrinkIndexIT fail with "expected at least one master-eligible node left" #44164

droberts195 · 2019-07-10T12:17:13Z

This failure has occurred many times in the last 36 hours: https://build-stats.elastic.co/app/kibana#/discover?_g=(refreshInterval:(pause:!t,value:0),time:(from:now-30d,mode:relative,to:now))&_a=(columns:!(test,build-id,build_url,branch),filters:!(),index:e58bf320-7efd-11e8-bf69-63c8ef516157,interval:auto,query:(language:lucene,query:'class:%20%22org.elasticsearch.action.admin.indices.create.ShrinkIndexIT%22'),sort:!(time,desc))

Often the failing test is ShrinkIndexIT.testCreateShrinkWithIndexSort, see for example https://gradle.com/s/fglguuelmqobe

More recently a failure of ShrinkIndexIT.testShrinkCommitsMergeOnIdle was seen but with the same error message: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+pull-request-1/2405/testReport/junit/org.elasticsearch.action.admin.indices.create/ShrinkIndexIT/testShrinkCommitsMergeOnIdle/

In both cases the error message is something like:

java.lang.AssertionError: expected at least one master-eligible node left in {node_sc1=org.elasticsearch.test.InternalTestCluster$NodeAndClient@2de106c0}

A REPRO command for 7.x is:

./gradlew :server:integTest --tests "org.elasticsearch.action.admin.indices.create.ShrinkIndexIT.testCreateShrinkWithIndexSort" -Dtests.seed=50AF68DFDA0BACBE -Dtests.security.manager=true -Dtests.locale=zh-TW -Dtests.timezone=America/North_Dakota/Center -Dcompiler.java=12 -Druntime.java=8

A REPRO command for master is:

./gradlew :server:integTest --tests "org.elasticsearch.action.admin.indices.create.ShrinkIndexIT.testShrinkCommitsMergeOnIdle" -Dtests.seed=C57BD4CF13C2F399 -Dtests.security.manager=true -Dtests.locale=fo -Dtests.timezone=Europe/Zagreb -Dcompiler.java=12 -Druntime.java=11

Neither of these reproduce locally for me.

I will mute the suite.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2019-07-10T12:17:15Z

Pinging @elastic/es-distributed

Due to #44164

droberts195 · 2019-07-10T12:23:08Z

Muted on master in 2f4905f and on 7.x in cad804d

Due to #44164

original-brownbear · 2019-07-10T16:36:08Z

I can easily reproduce this on master using seed -Dtests.seed=C57BD4CF13C2F399 and running the test suite on repeat in Idea (for by class). Trying to track this down now

bizybot · 2019-07-11T05:43:11Z

Muted on 7.3 in 52e72db
as it failed in https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+7.3+multijob-unix-compatibility/os=amazon/17/testReport/junit/org.elasticsearch.action.admin.indices.create/ShrinkIndexIT/testCreateShrinkIndex/

* Move this test suit to cluster scope. Currently, `testShrinkThenSplitWithFailedNode` stops a random node which randomly turns out to be the only shared master node so the cluster reset fails on account of the fact that no shared master node survived. * Closes elastic#44164

* Fix ShrinkIndexIT * Move this test suit to cluster scope. Currently, `testShrinkThenSplitWithFailedNode` stops a random node which randomly turns out to be the only shared master node so the cluster reset fails on account of the fact that no shared master node survived. * Closes elastic#44164

* Fix ShrinkIndexIT * Move this test suit to cluster scope. Currently, `testShrinkThenSplitWithFailedNode` stops a random node which randomly turns out to be the only shared master node so the cluster reset fails on account of the fact that no shared master node survived. * Closes #44164

droberts195 added >test-failure Triaged test failures from CI :Distributed Indexing/Distributed A catch all label for anything in the Distributed Indexing Area. Please avoid if you can. labels Jul 10, 2019

droberts195 added a commit that referenced this issue Jul 10, 2019

[TEST] Mute ShrinkIndexIT

cad804d

Due to #44164

droberts195 added a commit that referenced this issue Jul 10, 2019

[TEST] Mute ShrinkIndexIT

2f4905f

Due to #44164

original-brownbear self-assigned this Jul 10, 2019

original-brownbear mentioned this issue Jul 11, 2019

Fix ShrinkIndexIT #44214

Merged

original-brownbear closed this as completed in #44214 Jul 11, 2019

original-brownbear mentioned this issue Jul 11, 2019

Fix ShrinkIndexIT (#44214) #44223

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Various tests in ShrinkIndexIT fail with "expected at least one master-eligible node left" #44164

[CI] Various tests in ShrinkIndexIT fail with "expected at least one master-eligible node left" #44164

droberts195 commented Jul 10, 2019 •

edited

Loading

elasticmachine commented Jul 10, 2019

Uh oh!

droberts195 commented Jul 10, 2019

Uh oh!

original-brownbear commented Jul 10, 2019

Uh oh!

bizybot commented Jul 11, 2019

Uh oh!

[CI] Various tests in ShrinkIndexIT fail with "expected at least one master-eligible node left" #44164

[CI] Various tests in ShrinkIndexIT fail with "expected at least one master-eligible node left" #44164

Comments

droberts195 commented Jul 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

elasticmachine commented Jul 10, 2019

Uh oh!

droberts195 commented Jul 10, 2019

Uh oh!

original-brownbear commented Jul 10, 2019

Uh oh!

bizybot commented Jul 11, 2019

Uh oh!

droberts195 commented Jul 10, 2019 •

edited

Loading