Expand docs on disk-based shard allocation #65668

DaveCTurner · 2020-12-01T11:11:39Z

Today we document the settings used to control rebalancing and
disk-based shard allocation but there isn't really any discussion around
what these processes do so it's hard to know what, if any, adjustments
to make.

This commit adds some words to help folk understand this area better.

Today we document the settings used to control rebalancing and disk-based shard allocation but there isn't really any discussion around what these processes do so it's hard to know what, if any, adjustments to make. This commit adds some words to help folk understand this area better.

elasticmachine · 2020-12-01T11:11:42Z

Pinging @elastic/es-distributed (Team:Distributed)

elasticmachine · 2020-12-01T11:11:42Z

Pinging @elastic/es-docs (Team:Docs)

jrodewig

LGTM overall.

I left some comments but nothing I would consider blocking.

Bigger thought:
I wonder if we should talk more about balanced data tiers rather than balanced clusters. While users can disable them, data tiers seem like a part of our default experience now.

docs/reference/modules/cluster/disk_allocator.asciidoc

jrodewig · 2020-12-01T14:29:24Z

docs/reference/modules/cluster/disk_allocator.asciidoc

+The <<shards-rebalancing-settings,balance>> of the cluster depends only on the
+number of shards on each node and the indices to which those shards belong. It
+considers neither the sizes of these shards nor the available disk space on
+each node, for the following reasons:


I wonder if mixing in the concept of balance here is more confusing. We may want to just start with your The disk-based shard allocator... paragraph. The gist of this text seems adequately covered there and in the later admon about unequal disk usage.

Hmm, good point about leading with the later paragraph. I'll think about removing this vs moving it elsewhere. It's a common source of confusion that "balanced" doesn't mean "equal disk usage", I think we need to spell out why that isn't the case. But there's no need to lead with this.

jrodewig · 2020-12-01T14:40:36Z

docs/reference/modules/cluster/shards_allocation.asciidoc

+A cluster is _balanced_ when it has an equal number of shards on each node
+without having a concentration of shards from any index on any node. {es} runs
+an automatic process called _rebalancing_ which moves shards between the nodes
+in your cluster in order to improve its balance. Rebalancing obeys all other
+shard allocation rules such as <<cluster-shard-allocation-filtering,allocation
+filtering>> and <<forced-awareness,forced awareness>> which may prevent it from
+completely balancing the cluster. In that case, rebalancing strives to acheve
+the most balanced cluster possible within the rules you have configured.


Feels like we should mention data tiers here. In many cases, I imagine the tiers, rather than the cluster, will be balanced.

docs/reference/modules/cluster/disk_allocator.asciidoc

jrodewig · 2020-12-01T14:47:26Z

docs/reference/modules/cluster/disk_allocator.asciidoc

+Shard movements triggered by the disk-based shard allocator must also satisfy
+all other shard allocation rules such as
+<<cluster-shard-allocation-filtering,allocation filtering>> and
+<<forced-awareness,forced awareness>>. If these rules are too strict then they
+can also prevent the shard movements needed to keep the nodes' disk usage under
+control.


Another opportunity to mention data tiers.

docs/reference/modules/cluster/shards_allocation.asciidoc

docs/reference/modules/cluster/disk_allocator.asciidoc

Co-authored-by: James Rodewig <[email protected]>

Today we document the settings used to control rebalancing and disk-based shard allocation but there isn't really any discussion around what these processes do so it's hard to know what, if any, adjustments to make. This commit adds some words to help folk understand this area better.

DaveCTurner added >docs General docs changes :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) v8.0.0 v7.11.0 v7.10.2 labels Dec 1, 2020

DaveCTurner requested a review from jrodewig December 1, 2020 11:11

elasticmachine added Team:Docs Meta label for docs team Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. labels Dec 1, 2020

Every damn time

8985fa1

jrodewig approved these changes Dec 1, 2020

View reviewed changes

jrodewig reviewed Dec 1, 2020

View reviewed changes

docs/reference/modules/cluster/disk_allocator.asciidoc Outdated Show resolved Hide resolved

DaveCTurner and others added 5 commits December 7, 2020 13:22

Apply suggestions from code review

67a9e61

Co-authored-by: James Rodewig <[email protected]>

Move balance-based intro lower down

b21fa1e

Mention data tiers w.r.t. disk allocation and balancing

c3c38e4

Better location for balancing intro

c59d8ae

on -> to; reformat

331e9b6

DaveCTurner merged commit aa4ab0b into elastic:master Dec 7, 2020

DaveCTurner deleted the 2020-12-01-disk-allocator-docs branch December 7, 2020 14:51

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expand docs on disk-based shard allocation #65668

Expand docs on disk-based shard allocation #65668

Uh oh!

DaveCTurner commented Dec 1, 2020

Uh oh!

elasticmachine commented Dec 1, 2020

Uh oh!

elasticmachine commented Dec 1, 2020

Uh oh!

jrodewig left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jrodewig Dec 1, 2020 •

edited

Loading

Uh oh!

DaveCTurner Dec 7, 2020

Uh oh!

jrodewig Dec 1, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jrodewig Dec 1, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Expand docs on disk-based shard allocation #65668

Expand docs on disk-based shard allocation #65668

Uh oh!

Conversation

DaveCTurner commented Dec 1, 2020

Uh oh!

elasticmachine commented Dec 1, 2020

Uh oh!

elasticmachine commented Dec 1, 2020

Uh oh!

jrodewig left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jrodewig Dec 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DaveCTurner Dec 7, 2020

Choose a reason for hiding this comment

Uh oh!

jrodewig Dec 1, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jrodewig Dec 1, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jrodewig left a comment •

edited

Loading

jrodewig Dec 1, 2020 •

edited

Loading