Make Parsing SnapshotInfo more Efficient #74005

original-brownbear · 2021-06-10T12:52:51Z

Flatting the logic for parsing SnapshotInfo to go field by field like we do for RepositoryData
which is both easier to read and also faster (mostly when moving to batch multiple of these blobs into one
and doing on-the-fly filtering in an upcoming PR where the approach allows for more tricks).
Also, optimized/deduplicated the logic for parsing out (mostly/often) empty lists in the deserialization code
and used the new utility in a few more spots as well to save empty lists.

Lastly, fixed the at times very deeply nested Collections.unmodifiableList( chains that the way the duplicate constructors for x-content parsing and normal construction would cause.

Flatting the logic for parsing `SnapshotInfo` to go field by field like we do for `RepositoryData` which is both easier to read and also faster (mostly when moving to batch multiple of these blobs into one and doing on-the-fly filtering in an upcoming PR where the approach allows for more tricks). Also, simplified/deduplicated parsing out (mostly/often) empty lists in the deserialization code and used the new utility in a few more spots as well to save empty lists.

elasticmachine · 2021-06-10T12:52:54Z

Pinging @elastic/es-distributed (Team:Distributed)

DaveCTurner

I left a couple of comments. Don't really follow why this is more efficient but it's certainly neater.

...r/src/main/java/org/elasticsearch/index/snapshots/blobstore/BlobStoreIndexShardSnapshot.java

DaveCTurner · 2021-06-10T15:35:59Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotInfo.java

-        this.indices = Collections.unmodifiableList(Objects.requireNonNull(indices));
-        this.dataStreams = Collections.unmodifiableList(Objects.requireNonNull(dataStreams));
-        this.featureStates = Collections.unmodifiableList(Objects.requireNonNull(featureStates));
+        this.indices = List.copyOf(indices);


Do these changes translate to 7.x ok? I guess we'll end up using org.elasticsearch.core.List#copyOf which does a complete copy every time...

Sort of ... I think we should fix the copy of behavior in 7.x if it's inefficient in this spot but still nice to not have 5x deep nesting on this list in any case I guess :) I'll look into a 7.x fix of that method later/next-week :)

DaveCTurner

LGTM

original-brownbear · 2021-06-10T18:54:36Z

Thanks David!

Flatting the logic for parsing `SnapshotInfo` to go field by field like we do for `RepositoryData` which is both easier to read and also faster (mostly when moving to batch multiple of these blobs into one and doing on-the-fly filtering in an upcoming PR where the approach allows for more tricks). Also, simplified/deduplicated parsing out (mostly/often) empty lists in the deserialization code and used the new utility in a few more spots as well to save empty lists.

original-brownbear added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >refactoring v8.0.0 v7.14.0 labels Jun 10, 2021

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Jun 10, 2021

original-brownbear requested review from tlrx and DaveCTurner June 10, 2021 13:44

DaveCTurner reviewed Jun 10, 2021

View reviewed changes

original-brownbear requested a review from DaveCTurner June 10, 2021 15:49

DaveCTurner approved these changes Jun 10, 2021

View reviewed changes

original-brownbear merged commit d4e6e4c into elastic:master Jun 10, 2021

original-brownbear deleted the improve-snapshot-info-parsing branch June 10, 2021 18:54

original-brownbear added the backport pending label Jun 10, 2021

original-brownbear mentioned this pull request Jun 13, 2021

Make Parsing SnapshotInfo more Efficient (#74005) #74047

Merged

original-brownbear removed the backport pending label Jun 13, 2021

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

original-brownbear restored the improve-snapshot-info-parsing branch April 18, 2023 20:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make Parsing SnapshotInfo more Efficient #74005

Make Parsing SnapshotInfo more Efficient #74005

Uh oh!

original-brownbear commented Jun 10, 2021

Uh oh!

elasticmachine commented Jun 10, 2021

Uh oh!

DaveCTurner left a comment

Uh oh!

Uh oh!

DaveCTurner Jun 10, 2021

Uh oh!

original-brownbear Jun 10, 2021

Uh oh!

DaveCTurner left a comment

Uh oh!

original-brownbear commented Jun 10, 2021

Uh oh!

Uh oh!

Make Parsing SnapshotInfo more Efficient #74005

Make Parsing SnapshotInfo more Efficient #74005

Uh oh!

Conversation

original-brownbear commented Jun 10, 2021

Uh oh!

elasticmachine commented Jun 10, 2021

Uh oh!

DaveCTurner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DaveCTurner Jun 10, 2021

Choose a reason for hiding this comment

Uh oh!

original-brownbear Jun 10, 2021

Choose a reason for hiding this comment

Uh oh!

DaveCTurner left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Jun 10, 2021

Uh oh!

Uh oh!