More Efficient Ordering of Shard Upload Execution (#42791) #46588

original-brownbear · 2019-09-11T10:10:57Z

Inspired by #39657 and resolving the issue raised in that PR:

The problem currently is that we are uploading all segments in a shard sequentially per shard and only parallelize snapshotting across shards but not within shards.
This PR adjusts the logic towards parallel segment uploads:

Instead of starting all shard uploads in parallel and without any order, run segment uploads shard by shard.
Only acquire index commit when it's actually required (after determining the current state of a shard in the repository, not before that).
- @DaveCTurner pointed out that going sequentially in determining the segments to upload shard by shard increases the timespan between index commits. This is true for the case pf having more snapshot threads than shards to be snapshotted. On the other hand if there's more shards to be snapshotted on a node than snapshot threads it's the other way around. Currently, in this situation once the snapshot thread pool is fully occupied a shard will have to wait for another shard to completely finish its snapshot before taking the index commit (which could be many minutes!) while now the time span between the index commits is likely never more than a few seconds apart (biggest difference between two index commit times is basically equal to the time it takes all the shard folder listing operations to finish)
Release index commit asap (with this change we generally way reduce the amount of time we hold on to an index commit in cases of uploading multiple shards with multiple segments)

Backport of #42791 and #46208

* Change the upload order of of snapshots to work file by file in parallel on the snapshot pool instead of merely shard-by-shard * Inspired by elastic#39657

Aborts and failures were handled in a somewhat unfortunate way in elastic#42791: Since the tasks for all files are generated before uploading they are all executed when a snapshot is aborted and lead to a massive number of failures added to the original aborted exception. In the case of failures the situation was not very reasonable as well. If one blob fails uploading the snapshot logic would upload all the remaining files as well and then fail (when previously it would just fail all following files). I fixed both of the above issues, by just short-circuiting all remaining tasks for a shard in case of an exception in any one upload.

elasticmachine · 2019-09-11T10:10:58Z

Pinging @elastic/es-distributed

original-brownbear added 2 commits September 11, 2019 12:07

More Efficient Ordering of Shard Upload Execution (elastic#42791)

f9a39ed

* Change the upload order of of snapshots to work file by file in parallel on the snapshot pool instead of merely shard-by-shard * Inspired by elastic#39657

original-brownbear added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs backport labels Sep 11, 2019

original-brownbear changed the title ~~42791 7.x~~ More Efficient Ordering of Shard Upload Execution (#42791) Sep 11, 2019

original-brownbear merged commit 41633cb into elastic:7.x Sep 11, 2019

original-brownbear deleted the 42791-7.x branch September 11, 2019 11:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

More Efficient Ordering of Shard Upload Execution (#42791) #46588

More Efficient Ordering of Shard Upload Execution (#42791) #46588

Uh oh!

original-brownbear commented Sep 11, 2019

Uh oh!

elasticmachine commented Sep 11, 2019

Uh oh!

Uh oh!

More Efficient Ordering of Shard Upload Execution (#42791) #46588

More Efficient Ordering of Shard Upload Execution (#42791) #46588

Uh oh!

Conversation

original-brownbear commented Sep 11, 2019

Uh oh!

elasticmachine commented Sep 11, 2019

Uh oh!

Uh oh!