Add option to take currently relocating shards' sizes into account #7785

dakrone · 2014-09-18T13:45:12Z

When using the DiskThresholdDecider, it's possible that shards could
already be marked as relocating to the node being evaluated. This commit
adds a new setting cluster.routing.allocation.disk.include_relocations
which adds the size of the shards currently being relocated to this node
to the node's used disk space.

This new option defaults to true, however, it's possible to
over-estimate the usage for a node if the relocation is already
partially complete, for instance:

A node with a 10gb shard that's 45% of the way through a relocation
would add 10gb + (.45 * 10) = 14.5gb to the node's disk usage before
examining the watermarks to see if a new shard can be allocated.

Fixes #7753
Relates to #6168

gibrown · 2014-09-18T15:42:16Z

Because running out of disk space is such a hard failure condition I'd suggest that this should be the default behavior. Temporarily over-estimating seems safer to me.

Great improvement though. Thanks!

grantr · 2014-09-18T16:53:06Z

I also favor safety over accuracy by default. @dakrone what would the example situation that you mention ultimately resolve to? Would the same shards still be allocated, but slower than usual?

s1monw · 2014-09-19T10:19:59Z

...est/java/org/elasticsearch/cluster/routing/allocation/decider/DiskThresholdDeciderTests.java

@@ -649,6 +654,107 @@ public void freeDiskPercentageAfterShardAssignedUnitTest() {
        assertThat(after, equalTo(19.0));
    }

+    @Test
+    @TestLogging("cluster.routing.allocation.decider:TRACE")


is trace logging required here?

s1monw · 2014-09-19T10:21:38Z

I left a minor comment. I think we should move to true as the default. I completely agree that the safer option is preferable. Other than that LGTM

When using the DiskThresholdDecider, it's possible that shards could already be marked as relocating to the node being evaluated. This commit adds a new setting `cluster.routing.allocation.disk.include_relocations` which adds the size of the shards currently being relocated to this node to the node's used disk space. This new option defaults to `true`, however it's possible to over-estimate the usage for a node if the relocation is already partially complete, for instance: A node with a 10gb shard that's 45% of the way through a relocation would add 10gb + (.45 * 10) = 14.5gb to the node's disk usage before examining the watermarks to see if a new shard can be allocated. Fixes elastic#7753 Relates to elastic#6168

dakrone added >enhancement review v1.5.0 v2.0.0-beta1 labels Sep 18, 2014

s1monw reviewed Sep 19, 2014
View reviewed changes

s1monw removed the review label Sep 19, 2014

dakrone force-pushed the dtd-user-current-relocations branch 2 times, most recently from be53e08 to 4185566 Compare September 19, 2014 10:35

dakrone merged commit 4185566 into elastic:master Sep 19, 2014

dakrone deleted the dtd-user-current-relocations branch September 19, 2014 13:35

clintongormley mentioned this pull request Dec 24, 2014

disk.threshold_enabled allocation creates dynamics that can cause runaway disk space usage #6168

Closed

bleskes mentioned this pull request Jan 21, 2015

total_shards_per_node may lead to unassigned shards #9248

Closed

clintongormley added the :Allocation label Mar 19, 2015

lcawl added :Distributed Indexing/Distributed A catch all label for anything in the Distributed Indexing Area. Please avoid if you can. and removed :Allocation labels Feb 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add option to take currently relocating shards' sizes into account #7785

Add option to take currently relocating shards' sizes into account #7785

Uh oh!

dakrone commented Sep 18, 2014

Uh oh!

gibrown commented Sep 18, 2014

Uh oh!

grantr commented Sep 18, 2014

Uh oh!

s1monw Sep 19, 2014

Uh oh!

s1monw commented Sep 19, 2014

Uh oh!

Uh oh!

Add option to take currently relocating shards' sizes into account #7785

Add option to take currently relocating shards' sizes into account #7785

Uh oh!

Conversation

dakrone commented Sep 18, 2014

Uh oh!

gibrown commented Sep 18, 2014

Uh oh!

grantr commented Sep 18, 2014

Uh oh!

s1monw Sep 19, 2014

Choose a reason for hiding this comment

Uh oh!

s1monw commented Sep 19, 2014

Uh oh!

Uh oh!