Deserialize BlobStore Metadata Files in a Streaming Manner #73149

original-brownbear · 2021-05-17T12:37:21Z

We were reading the full file contents up-front here because of the complexity
of verifying the footer otherwise. This commit moves the logic for reading metadata
blobs (that can become quite sizable in some cases + there's plans for larger aggregate meta blobs as well) in a streaming manner by manually doing the footer verification as Lucene's utility methods don't allow for
verification on top of a stream.

A possible follow-up to this would be to fix the write side the same way and get rid of the need to fully-buffer blobs before writing there as well.

We were reading the full file contents up-front here because of the complexity of verifying the footer otherwise. This commit moves the logic for reading metadata blobs (that can become quite sizable in some cases) in a streaming manner by manually doing the footer verification as Lucene's utility methods don't allow for verification on top of a stream.

elasticmachine · 2021-05-17T12:37:25Z

Pinging @elastic/es-distributed (Team:Distributed)

original-brownbear · 2021-05-17T12:40:05Z

server/src/test/java/org/elasticsearch/snapshots/BlobStoreFormatTests.java

            fail("Should have failed due to corruption");
-        } catch (ElasticsearchCorruptionException ex) {
-            assertThat(ex.getMessage(), containsString("test-path"));


No need to include the path here and complicate the code IMO, we wrap and/or log the path of what failed to deserialize upstream anyway to get insight into where other IOExceptions happened.

original-brownbear · 2021-05-17T12:54:30Z

Jenkins run elasticsearch-ci/part-2

DaveCTurner

Seems nicer indeed. I left some ideas mainly on comments, assertions and tests.

server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java

DaveCTurner · 2021-05-17T14:08:40Z

server/src/test/java/org/elasticsearch/snapshots/BlobStoreFormatTests.java

@@ -98,10 +97,10 @@ public void testBlobStoreOperations() throws IOException {
                MockBigArrays.NON_RECYCLING_INSTANCE);

        // Assert that all checksum blobs can be read
-        assertEquals(checksumSMILE.read(blobContainer, "check-smile", xContentRegistry(), MockBigArrays.NON_RECYCLING_INSTANCE).getText(),
+        assertEquals(checksumSMILE.read(blobContainer, "check-smile", xContentRegistry()).getText(),


I think these tests only cover very small blobs so we're not really exercising the corners of the refilling logic. Let's have some blobs that are up to a few times larger than the buffer size too.

We also apparently only use a few different read sizes and only call the one-byte read() for reading the header.

IMO DeserializeMetaBlobInputStream could reasonably be a top-level class with some more focussed tests.

I think these tests only cover very small blobs so we're not really exercising the corners of the refilling logic. Let's have some blobs that are up to a few times larger than the buffer size too.

++ I adjusted the test to use some larger blobs now.

We also apparently only use a few different read sizes and only call the one-byte read() for reading the header.

Right, it's just single reads during header read, 4k for the decompressing stream buffer and exactly 8000 from the Jackson SMILE parser at the moment as far as I can tell. Testing with random sizes of up to 3x8k should run into all mathematically possible corner cases now I think. At least I was able to run 100k+ iterations without failure with the new tests.

I'm still concerned about coverage, for instance we apparently never hit the interesting cases in read():

diff --git a/server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java b/server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java index 7d43b3d715e..02b8aba2fd0 100644 --- a/server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java +++ b/server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java @@ -154,9 +154,11 @@ public final class ChecksumBlobStoreFormat<T extends ToXContent> { @Override public int read() throws IOException { if (buffered() <= 0) { + assert bufferCount == 0; fill(); } if (buffered() <= 0) { + assert false; return -1; } return buffer[bufferPos++];

Fine for now but the scope for future bugs worries me.

I think I'd be semi-happy if we consolidated the logic that tries to make sure some bytes are available, something like this (with a suitable renaming too):

diff --git a/server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java b/server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java index 7d43b3d715e..8b3738ab141 100644 --- a/server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java +++ b/server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java @@ -153,10 +153,7 @@ public final class ChecksumBlobStoreFormat<T extends ToXContent> { @Override public int read() throws IOException { - if (buffered() <= 0) { - fill(); - } - if (buffered() <= 0) { + if (getAvailable() <= 0) { return -1; } return buffer[bufferPos++]; @@ -187,10 +184,7 @@ public final class ChecksumBlobStoreFormat<T extends ToXContent> { } private int doRead(byte[] b, int off, int len) throws IOException { - if (buffered() <= 0) { - fill(); - } - final int available = buffered(); + final int available = getAvailable(); if (available < 0) { return -1; } @@ -237,24 +231,26 @@ public final class ChecksumBlobStoreFormat<T extends ToXContent> { return CompressorFactory.COMPRESSOR.isCompressed(new BytesArray(buffer, bufferPos, bufferCount - bufferPos)); } - private int buffered() { - // bytes in the buffer minus 16 bytes that could be the footer - return bufferCount - bufferPos - CodecUtil.footerLength(); - } - - private void fill() throws IOException { + /** + * @return the number of bytes available in the buffer, possibly refilling the buffer if needed + */ + private int getAvailable() throws IOException { + final int footerLen = CodecUtil.footerLength(); if (bufferCount == 0) { + // first read, fill the buffer + assert bufferPos == 0; bufferCount = Streams.readFully(in, buffer, 0, buffer.length); - } else { + } else if (bufferPos == bufferCount - footerLen) { // crc and discard all but the last 16 bytes in the buffer that might be the footer bytes - final int footerLen = CodecUtil.footerLength(); assert bufferCount >= footerLen; - assert bufferPos == bufferCount - footerLen; crc32.update(buffer, 0, bufferPos); System.arraycopy(buffer, bufferPos, buffer, 0, footerLen); bufferCount = footerLen + Streams.readFully(in, buffer, footerLen, buffer.length - footerLen); bufferPos = 0; } + + // bytes in the buffer minus 16 bytes that could be the footer + return bufferCount - bufferPos - footerLen; } }

++ applied that change, hope semi is ok in the short-run :)

…blob-metadata

original-brownbear · 2021-05-18T03:39:35Z

Thanks @DaveCTurner :) All suggestions applied with the exception of extracting the stream to a top-level class. It seemed for now just testing a wider range of blob sizes gives us the same coverage with less noise.

tlrx

LGTM

tlrx · 2021-05-18T07:50:06Z

server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java

+        boolean nextBytesCompressed() {
+            // we already have bytes buffered here because we verify the blob's header (far less than the 8k buffer size) before calling
+            // this method
+            return CompressorFactory.COMPRESSOR.isCompressed(new BytesArray(buffer, bufferPos, bufferCount - bufferPos));


Maybe assert that bufferPos > 0?

server/src/main/java/org/elasticsearch/repositories/blobstore/ChecksumBlobStoreFormat.java

server/src/test/java/org/elasticsearch/snapshots/BlobStoreFormatTests.java

…blob-metadata

DaveCTurner

I think we have a problem in BlobStoreFormatTests#testBlobCorruption, since we now start reading the blob before checking it's valid so we get all sorts of other exceptions too.

…blob-metadata

original-brownbear · 2021-05-18T10:46:13Z

I think we have a problem in BlobStoreFormatTests#testBlobCorruption, since we now start reading the blob before checking it's valid so we get all sorts of other exceptions too.

Urgh, nice find :) I partly fixed this by expanding the try-catch scope to include the header check and partly by just expecting more exceptions in the test. I didn't want to blanket catch-rethrow the ones I only added in the test because there's not "proof" of corruption with those and they may be the result of random bugs in our parsing (e.g. when we moved to not allow duplicate fields any more and would start throwing all kinds ).

DaveCTurner

👍 LGTM thanks for the extra iterations

DaveCTurner

Hmm actually no I've had a change of heart. I think throwing these random exceptions on corruption will cause pain since they're indistinguishable from an actual corruption. Can we make it so that we read to the end in case of exception and report if the checksum was broken in any case?

…blob-metadata

original-brownbear · 2021-05-18T11:42:33Z

@DaveCTurner fair point :)

I pushed 9c7fb6e to always verify footer

DaveCTurner

Thanks, yes that's better IMO. LGTM now.

original-brownbear · 2021-05-18T12:32:58Z

Thanks David & Tanguy!

…3149) We were reading the full file contents up-front here because of the complexity of verifying the footer otherwise. This commit moves the logic for reading metadata blobs (that can become quite sizable in some cases) in a streaming manner by manually doing the footer verification as Lucene's utility methods don't allow for verification on top of a stream.

…74050) We were reading the full file contents up-front here because of the complexity of verifying the footer otherwise. This commit moves the logic for reading metadata blobs (that can become quite sizable in some cases) in a streaming manner by manually doing the footer verification as Lucene's utility methods don't allow for verification on top of a stream.

original-brownbear added >non-issue :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.14.0 labels May 17, 2021

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label May 17, 2021

original-brownbear commented May 17, 2021

View reviewed changes

original-brownbear requested review from fcofdez, tlrx and DaveCTurner May 17, 2021 13:44

DaveCTurner reviewed May 17, 2021

View reviewed changes

original-brownbear added 3 commits May 17, 2021 18:19

Merge remote-tracking branch 'elastic/master' into efficient-reading-…

67834d0

…blob-metadata

CR: comments

7e95f55

larger text in test

582fc20

original-brownbear requested a review from DaveCTurner May 18, 2021 03:37

tlrx approved these changes May 18, 2021

View reviewed changes

original-brownbear added 2 commits May 18, 2021 11:15

Merge remote-tracking branch 'elastic/master' into efficient-reading-…

5dbccdd

…blob-metadata

CR comments

4dc53f9

DaveCTurner requested changes May 18, 2021

View reviewed changes

original-brownbear added 2 commits May 18, 2021 12:23

Merge remote-tracking branch 'elastic/master' into efficient-reading-…

b69acf4

…blob-metadata

CR: fixes

5b2a9a8

original-brownbear requested a review from DaveCTurner May 18, 2021 10:46

DaveCTurner approved these changes May 18, 2021

View reviewed changes

DaveCTurner requested changes May 18, 2021

View reviewed changes

original-brownbear added 2 commits May 18, 2021 13:30

Merge remote-tracking branch 'elastic/master' into efficient-reading-…

d9e9fcc

…blob-metadata

always verify footer

9c7fb6e

original-brownbear requested a review from DaveCTurner May 18, 2021 11:42

DaveCTurner approved these changes May 18, 2021

View reviewed changes

original-brownbear merged commit 6dd2a2a into elastic:master May 18, 2021

original-brownbear deleted the efficient-reading-blob-metadata branch May 18, 2021 12:33

original-brownbear added the backport pending label May 18, 2021

original-brownbear mentioned this pull request May 20, 2021

Fix SnapshotInfo.fromXContentInternal not Fully Consuming Parser #73268

Merged

original-brownbear removed the backport pending label Jun 13, 2021

original-brownbear mentioned this pull request Jun 13, 2021

Deserialize BlobStore Metadata Files in a Streaming Manner (#73149) #74050

Merged

original-brownbear mentioned this pull request Jun 21, 2021

Improve Snapshot Repository Scalability #74350

Closed

16 tasks

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

original-brownbear restored the efficient-reading-blob-metadata branch April 18, 2023 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deserialize BlobStore Metadata Files in a Streaming Manner #73149

Deserialize BlobStore Metadata Files in a Streaming Manner #73149

original-brownbear commented May 17, 2021

elasticmachine commented May 17, 2021

original-brownbear May 17, 2021

original-brownbear commented May 17, 2021

DaveCTurner left a comment

DaveCTurner May 17, 2021

original-brownbear May 17, 2021

DaveCTurner May 18, 2021 •

edited

Loading

original-brownbear May 18, 2021 •

edited

Loading

original-brownbear commented May 18, 2021

tlrx left a comment

tlrx May 18, 2021

DaveCTurner left a comment

original-brownbear commented May 18, 2021

DaveCTurner left a comment

DaveCTurner left a comment

original-brownbear commented May 18, 2021

DaveCTurner left a comment

original-brownbear commented May 18, 2021

Deserialize BlobStore Metadata Files in a Streaming Manner #73149

Deserialize BlobStore Metadata Files in a Streaming Manner #73149

Conversation

original-brownbear commented May 17, 2021

elasticmachine commented May 17, 2021

original-brownbear May 17, 2021

Choose a reason for hiding this comment

original-brownbear commented May 17, 2021

DaveCTurner left a comment

Choose a reason for hiding this comment

DaveCTurner May 17, 2021

Choose a reason for hiding this comment

original-brownbear May 17, 2021

Choose a reason for hiding this comment

DaveCTurner May 18, 2021 • edited Loading

Choose a reason for hiding this comment

original-brownbear May 18, 2021 • edited Loading

Choose a reason for hiding this comment

original-brownbear commented May 18, 2021

tlrx left a comment

Choose a reason for hiding this comment

tlrx May 18, 2021

Choose a reason for hiding this comment

DaveCTurner left a comment

Choose a reason for hiding this comment

original-brownbear commented May 18, 2021

DaveCTurner left a comment

Choose a reason for hiding this comment

DaveCTurner left a comment

Choose a reason for hiding this comment

original-brownbear commented May 18, 2021

DaveCTurner left a comment

Choose a reason for hiding this comment

original-brownbear commented May 18, 2021

DaveCTurner May 18, 2021 •

edited

Loading

original-brownbear May 18, 2021 •

edited

Loading