Allow Parallel Snapshot Restore And Delete #51608

original-brownbear · 2020-01-29T13:01:31Z

There is no reason not to allow deletes in parallel to restores
if they're dealing with different snapshots.
A delete will not remove any files related to the snapshot that
is being restored if it is different from the deleted snapshot
because those files will still be referenced by the restoring
snapshot.
Also, the snapshot restore is using the snap-${uuid}.dat metadata in the shard
folders to determine the files to restore, so concurrently modifying shard metadata
isn't an issue as well.
Loading RepositoryData concurrently to modifying it is concurrency-safe
nowadays as well since the repo generation is tracked in the
cluster state.

Closes #41463

I'd open a follow-up to this one for concurrent snapshot + restore, that will work for the same reasons delete + restore work concurrently.

There is no reason not to allow deletes in parallel to restores if they're dealing with different snapshots. A delete will not remove any files related to the snapshot that is being restored if it is different from the deleted snapshot because those files will still be referenced by the restoring snapshot. Loading RepositoryData concurrently to modifying it is concurrency safe nowadays as well since the repo generation is tracked in the cluster state. Closes elastic#41463

elasticmachine · 2020-01-29T13:01:33Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2020-01-29T13:49:31Z

server/src/test/java/org/elasticsearch/snapshots/MinThreadsSnapshotRestoreIT.java

@@ -152,58 +152,4 @@ public void testSnapshottingWithInProgressDeletionNotAllowed() throws Exception
        client().admin().cluster().prepareCreateSnapshot(repo, snapshot2).setWaitForCompletion(true).get();
        assertEquals(1, client().admin().cluster().prepareGetSnapshots(repo).setSnapshots("_all").get().getSnapshots(repo).size());
    }
-
-    public void testRestoreWithInProgressDeletionsNotAllowed() throws Exception {


Removing this one instead of adjusting it, it's totally redundant to the new resiliency test I added, that proves we don't get a dead-lock in this code either and it's much more useful for debugging.

server/src/test/java/org/elasticsearch/snapshots/SnapshotResiliencyTests.java

tlrx · 2020-01-30T09:52:39Z

server/src/test/java/org/elasticsearch/snapshots/SnapshotResiliencyTests.java

+        final StepListener<CreateSnapshotResponse> createSnapshotResponseStepListener = new StepListener<>();
+
+        continueOrDie(createRepoAndIndex(repoName, index, shards),
+            createIndexResponse -> client().admin().cluster().prepareCreateSnapshot(repoName, snapshotName)


Should we also index some docs (mostly to generate more snapshot files) before and in-between snapshots, and then run a query?

Good idea, right now I don't expect it to make a difference but good to have some checks on this :) I pushed e8fe17e

I don't expect it either but I prefer to have some shard files around :) Thanks

tlrx

LGTM

ywelsch

LGTM

original-brownbear · 2020-01-30T12:11:17Z

Thanks Tanguy + Yannick!

There is no reason not to allow deletes in parallel to restores if they're dealing with different snapshots. A delete will not remove any files related to the snapshot that is being restored if it is different from the deleted snapshot because those files will still be referenced by the restoring snapshot. Loading RepositoryData concurrently to modifying it is concurrency safe nowadays as well since the repo generation is tracked in the cluster state. Closes elastic#41463

There is no reason not to allow deletes in parallel to restores if they're dealing with different snapshots. A delete will not remove any files related to the snapshot that is being restored if it is different from the deleted snapshot because those files will still be referenced by the restoring snapshot. Loading RepositoryData concurrently to modifying it is concurrency safe nowadays as well since the repo generation is tracked in the cluster state. Closes #41463

original-brownbear added >enhancement :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.7.0 labels Jan 29, 2020

this test still makes sense ...

52bbaf4

original-brownbear commented Jan 29, 2020

View reviewed changes

original-brownbear requested review from ywelsch and tlrx January 29, 2020 15:36

tlrx reviewed Jan 30, 2020

View reviewed changes

original-brownbear added 2 commits January 30, 2020 10:37

Merge remote-tracking branch 'elastic/master' into 41463

b2a8b2b

more accurate test

ece5a80

original-brownbear requested a review from tlrx January 30, 2020 09:43

tlrx reviewed Jan 30, 2020

View reviewed changes

index some docs

e8fe17e

original-brownbear requested a review from tlrx January 30, 2020 10:28

tlrx approved these changes Jan 30, 2020

View reviewed changes

ywelsch approved these changes Jan 30, 2020

View reviewed changes

original-brownbear merged commit 2854f5c into elastic:master Jan 30, 2020

original-brownbear deleted the 41463 branch January 30, 2020 12:11

original-brownbear mentioned this pull request Jan 30, 2020

Allow Parallel Snapshot Restore And Delete (#51608) #51666

Merged

codebrain mentioned this pull request Apr 1, 2020

7.7.0 meta ticket (Part 2) elastic/elasticsearch-net#4533

Closed

original-brownbear restored the 41463 branch August 6, 2020 18:23

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Parallel Snapshot Restore And Delete #51608

Allow Parallel Snapshot Restore And Delete #51608

original-brownbear commented Jan 29, 2020 •

edited

Loading

elasticmachine commented Jan 29, 2020

original-brownbear Jan 29, 2020

tlrx Jan 30, 2020

original-brownbear Jan 30, 2020

tlrx Jan 30, 2020

tlrx left a comment

ywelsch left a comment

original-brownbear commented Jan 30, 2020

Allow Parallel Snapshot Restore And Delete #51608

Allow Parallel Snapshot Restore And Delete #51608

Conversation

original-brownbear commented Jan 29, 2020 • edited Loading

elasticmachine commented Jan 29, 2020

original-brownbear Jan 29, 2020

Choose a reason for hiding this comment

tlrx Jan 30, 2020

Choose a reason for hiding this comment

original-brownbear Jan 30, 2020

Choose a reason for hiding this comment

tlrx Jan 30, 2020

Choose a reason for hiding this comment

tlrx left a comment

Choose a reason for hiding this comment

ywelsch left a comment

Choose a reason for hiding this comment

original-brownbear commented Jan 30, 2020

original-brownbear commented Jan 29, 2020 •

edited

Loading