Abort non-fully consumed S3 input streams instead of draining #62167

tlrx · 2020-09-09T13:40:56Z

Today when an S3RetryingInputStream is closed the remaining bytes that were not consumed are drained right before closing the underlying stream. In some contexts it might be more efficient to not consume the remaining bytes and just drop the connection.

This is for example the case with snapshot backed indices prewarming, where there is not point in reading potentially large blobs if we know the cache file we want to write the content of the blob as already been evicted. Draining all bytes here takes a slot in the prewarming thread pool for nothing.

Regular snapshot restores could also benefit from dropping connection instead of draining bytes in the case the restore is aborted. As of today, the restoring of the file continues even if the restore was aborted and takes a slot in the snapshot thread pool. By throwing an appropriate exception and aborting the S3 input stream we could quickly stop the download and free up the slot in the snapshot thread pool (could be done in a follow up PR).

elasticmachine · 2020-09-09T13:40:57Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2020-09-14T08:41:24Z

Sorry for the delay here @tlrx , I'm taking a look today :)

tlrx · 2020-09-14T08:42:10Z

Sorry for the delay here @tlrx , I'm taking a look today :)

Take your time, no hurry :)

original-brownbear

Looks good, I wonder if we can make this change a lot smaller though by just not bothering with the object metadata in the way suggested inline?

original-brownbear · 2020-09-14T09:08:31Z

...ins/repository-s3/src/main/java/org/elasticsearch/repositories/s3/S3RetryingInputStream.java


-    private InputStream currentStream;
+    private S3ObjectInputStream currentStream;
+    private long currentStreamLastOffset;


See my comment below, I think this isn't necessary potentially.

original-brownbear · 2020-09-14T09:21:33Z

...ins/repository-s3/src/main/java/org/elasticsearch/repositories/s3/S3RetryingInputStream.java

+            return metadata.getContentLength();
+        } catch (Exception e) {
+            assert false : e;
+            return Long.MAX_VALUE - 1L; // assume a large stream so that the underlying stream is aborted on closing, unless eof is reached


Same here, maybe we should just use our own end and start offsets or use metadata.getContentLength() if we don't have an end instead of going through the indirection of the SDH header parsing here? That seems a lot more straight forward to me and doesn't require us to be scared of random exceptions from SDK misbehavior?

Also, then we could just make our life real easy. If eof is set to true or start + currentOffset == currentStreamLastOffset -> close, else abort. No need to even get the length from the metadata because any open ended stream of unknown length we'd read till EOF anyway?

No need to even get the length from the metadata because any open ended stream of unknown length we'd read till EOF anyway?

That's what I'm trying to avoid here; if we know the exact range or if we don't know it, the S3 endpoint should return the content length and we can use it to know if all bytes were really consumed. The exceptional case here should never happen and in this case we set an extra large end which should force anyway the stream to be aborted before closing.

Fair point, I guess I still don't really like that we redundantly store the lengths and offsets here to some degree, but I also just noticed we we do the same in the GCS stream as well. I suppose this approach is the safest for now :) => let's go with it then.

original-brownbear · 2020-09-14T09:35:52Z

...ins/repository-s3/src/main/java/org/elasticsearch/repositories/s3/S3RetryingInputStream.java

+     * suppressing all thrown exceptions.
+     */
+    private void maybeAbort(S3ObjectInputStream stream) {
+        if (eof) {


if (eof || start + currentOffset == currentStreamLastOffset) { and drop the conditional from the try { since both cases mean the same to us?

original-brownbear

LGTM, let's do it you way :) Thanks Tanguy!

original-brownbear · 2020-09-14T11:04:41Z

...ins/repository-s3/src/main/java/org/elasticsearch/repositories/s3/S3RetryingInputStream.java

+            return metadata.getContentLength();
+        } catch (Exception e) {
+            assert false : e;
+            return Long.MAX_VALUE - 1L; // assume a large stream so that the underlying stream is aborted on closing, unless eof is reached


Fair point, I guess I still don't really like that we redundantly store the lengths and offsets here to some degree, but I also just noticed we we do the same in the GCS stream as well. I suppose this approach is the safest for now :) => let's go with it then.

tlrx · 2020-09-15T07:43:51Z

@elasticmachine update branch

tlrx · 2020-09-15T10:52:52Z

Thanks Armin!

Today when an S3RetryingInputStream is closed the remaining bytes that were not consumed are drained right before closing the underlying stream. In some contexts it might be more efficient to not consume the remaining bytes and just drop the connection. This is for example the case with snapshot backed indices prewarming, where there is not point in reading potentially large blobs if we know the cache file we want to write the content of the blob as already been evicted. Draining all bytes here takes a slot in the prewarming thread pool for nothing.

Abort non-fully consumed S3 input stream

824e8f9

tlrx added >non-issue :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.10.0 labels Sep 9, 2020

tlrx requested a review from original-brownbear September 9, 2020 13:40

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Sep 9, 2020

tlrx requested a review from fcofdez September 14, 2020 07:35

original-brownbear reviewed Sep 14, 2020

View reviewed changes

original-brownbear approved these changes Sep 14, 2020

View reviewed changes

Merge branch 'master' into abort-s3-retrying-input-streams

03f5f37

tlrx merged commit 92fb003 into elastic:master Sep 15, 2020

tlrx deleted the abort-s3-retrying-input-streams branch September 15, 2020 10:52

tlrx mentioned this pull request Sep 15, 2020

Abort non-fully consumed S3 input stream #62370

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Abort non-fully consumed S3 input streams instead of draining #62167

Abort non-fully consumed S3 input streams instead of draining #62167

Uh oh!

tlrx commented Sep 9, 2020

Uh oh!

elasticmachine commented Sep 9, 2020

Uh oh!

original-brownbear commented Sep 14, 2020

Uh oh!

tlrx commented Sep 14, 2020

Uh oh!

original-brownbear left a comment

Uh oh!

original-brownbear Sep 14, 2020

Uh oh!

original-brownbear Sep 14, 2020

Uh oh!

tlrx Sep 14, 2020

Uh oh!

original-brownbear Sep 14, 2020

Uh oh!

original-brownbear Sep 14, 2020

Uh oh!

original-brownbear left a comment

Uh oh!

original-brownbear Sep 14, 2020

Uh oh!

tlrx commented Sep 15, 2020

Uh oh!

tlrx commented Sep 15, 2020

Uh oh!

Uh oh!

Abort non-fully consumed S3 input streams instead of draining #62167

Abort non-fully consumed S3 input streams instead of draining #62167

Uh oh!

Conversation

tlrx commented Sep 9, 2020

Uh oh!

elasticmachine commented Sep 9, 2020

Uh oh!

original-brownbear commented Sep 14, 2020

Uh oh!

tlrx commented Sep 14, 2020

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear Sep 14, 2020

Choose a reason for hiding this comment

Uh oh!

original-brownbear Sep 14, 2020

Choose a reason for hiding this comment

Uh oh!

tlrx Sep 14, 2020

Choose a reason for hiding this comment

Uh oh!

original-brownbear Sep 14, 2020

Choose a reason for hiding this comment

Uh oh!

original-brownbear Sep 14, 2020

Choose a reason for hiding this comment

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear Sep 14, 2020

Choose a reason for hiding this comment

Uh oh!

tlrx commented Sep 15, 2020

Uh oh!

tlrx commented Sep 15, 2020

Uh oh!

Uh oh!