[CI] Failure in org.elasticsearch.cluster.routing.AllocationIdIT.testFailedRecoveryOnAllocateStalePrimaryRequiresAnotherAllocateStalePrimary #66893

original-brownbear · 2020-12-30T14:37:22Z

This just failed on 7.x here: https://gradle-enterprise.elastic.co/s/rhyopb5hm3gu2/tests/:server:internalClusterTest/org.elasticsearch.cluster.routing.AllocationIdIT/testFailedRecoveryOnAllocateStalePrimaryRequiresAnotherAllocateStalePrimary

  2> REPRODUCE WITH: ./gradlew ':server:internalClusterTest' --tests "org.elasticsearch.cluster.routing.AllocationIdIT.testFailedRecoveryOnAllocateStalePrimaryRequiresAnotherAllocateStalePrimary" -Dtests.seed=491F1EE43B42DEC9 -Dtests.security.manager=true -Dbuild.snapshot=false -Dtests.jvm.argline="-Dbuild.snapshot=false" -Dtests.locale=he-IL -Dtests.timezone=Asia/Damascus -Druntime.java=8
  2> java.lang.AssertionError: timed out waiting for yellow state
        at __randomizedtesting.SeedInfo.seed([491F1EE43B42DEC9:C099E915F71965EB]:0)
        at org.junit.Assert.fail(Assert.java:88)
        at org.elasticsearch.test.ESIntegTestCase.ensureColor(ESIntegTestCase.java:953)
        at org.elasticsearch.test.ESIntegTestCase.ensureYellow(ESIntegTestCase.java:913)

does not reproduce locally.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2020-12-30T14:37:25Z

Pinging @elastic/es-distributed (Team:Distributed)

This test failed on WindowsFS. We failed to remove the corrupted file if it's being opened (for a short window by ListShardStore action) and the pending delete files were clear when we restarted that node. This commit fixes the issue by shutting down the node before removing the corrupted file to avoid any access to that file. Closes #66893

original-brownbear added >test-failure Triaged test failures from CI :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) labels Dec 30, 2020

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Dec 30, 2020

dnhatn self-assigned this Jan 5, 2021

dnhatn mentioned this issue Jan 7, 2021

Fix AllocationIdIT test failure on WindowFS #67179

Merged

dnhatn closed this as completed in #67179 Jan 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Failure in org.elasticsearch.cluster.routing.AllocationIdIT.testFailedRecoveryOnAllocateStalePrimaryRequiresAnotherAllocateStalePrimary #66893

[CI] Failure in org.elasticsearch.cluster.routing.AllocationIdIT.testFailedRecoveryOnAllocateStalePrimaryRequiresAnotherAllocateStalePrimary #66893

original-brownbear commented Dec 30, 2020

elasticmachine commented Dec 30, 2020

Uh oh!

[CI] Failure in org.elasticsearch.cluster.routing.AllocationIdIT.testFailedRecoveryOnAllocateStalePrimaryRequiresAnotherAllocateStalePrimary #66893

[CI] Failure in org.elasticsearch.cluster.routing.AllocationIdIT.testFailedRecoveryOnAllocateStalePrimaryRequiresAnotherAllocateStalePrimary #66893

Comments

original-brownbear commented Dec 30, 2020

elasticmachine commented Dec 30, 2020

Uh oh!