Use sequential access of stored fields in CCR #68961

dnhatn · 2021-02-13T19:32:04Z

This commit re-introduces the sequential access of stored fields in CCR. Unlike the previous change, we apply this optimization only when we are accessing 10+ consecutive document ids.

I ran a CCR benchmark, and this change increased the indexing throughput on the leader by 30%.

dnhatn · 2021-02-16T02:10:41Z

Below is a benchmark from eventdata dataset (append-no-conflicts challenge).

                                                       Metric |    Baseline |   Contender |     Diff |   Unit
-------------------------------------------------------------:|------------:|------------:|---------:|-------
                   Cumulative indexing time of primary shards |     105.252 |     72.2137 | -33.0379 |    min
            Min cumulative indexing time across primary shard |     20.5952 |      14.192 | -6.40315 |    min
         Median cumulative indexing time across primary shard |     21.0276 |     14.4028 | -6.62472 |    min
            Max cumulative indexing time across primary shard |     21.3481 |     14.6981 | -6.65002 |    min
          Cumulative indexing throttle time of primary shards |           0 |           0 |        0 |    min
   Min cumulative indexing throttle time across primary shard |           0 |           0 |        0 |    min
Median cumulative indexing throttle time across primary shard |           0 |           0 |        0 |    min
   Max cumulative indexing throttle time across primary shard |           0 |           0 |        0 |    min
                      Cumulative merge time of primary shards |     7.17612 |      85.695 |  78.5189 |    min
                     Cumulative merge count of primary shards |           8 |        2299 |     2291 |       
               Min cumulative merge time across primary shard |   0.0409833 |     16.5366 |  16.4956 |    min
            Median cumulative merge time across primary shard |     2.19387 |     17.0846 |  14.8908 |    min
               Max cumulative merge time across primary shard |     2.60822 |     17.7988 |  15.1906 |    min
             Cumulative merge throttle time of primary shards |     1.16685 |     4.76085 |    3.594 |    min
      Min cumulative merge throttle time across primary shard |           0 |    0.599183 |  0.59918 |    min
   Median cumulative merge throttle time across primary shard |     0.35825 |     0.68505 |   0.3268 |    min
      Max cumulative merge throttle time across primary shard |    0.415783 |     1.56858 |   1.1528 |    min
                    Cumulative refresh time of primary shards |     2.22127 |     33.6916 |  31.4704 |    min
                   Cumulative refresh count of primary shards |          81 |       20283 |    20202 |       
             Min cumulative refresh time across primary shard |    0.318417 |     6.66377 |  6.34535 |    min
          Median cumulative refresh time across primary shard |    0.437233 |     6.72832 |  6.29108 |    min
             Max cumulative refresh time across primary shard |    0.553483 |     6.84258 |   6.2891 |    min
                      Cumulative flush time of primary shards |     11.5855 |    0.503367 | -11.0822 |    min
                     Cumulative flush count of primary shards |          31 |          30 |       -1 |       
               Min cumulative flush time across primary shard |      1.9158 |   0.0930833 | -1.82272 |    min
            Median cumulative flush time across primary shard |     2.27915 |      0.0968 | -2.18235 |    min
               Max cumulative flush time across primary shard |      2.9223 |      0.1139 |  -2.8084 |    min
                                      Total Young Gen GC time |      38.794 |       5.519 |  -33.275 |      s
                                     Total Young Gen GC count |        1534 |         220 |    -1314 |       
                                        Total Old Gen GC time |           0 |           0 |        0 |      s
                                       Total Old Gen GC count |           0 |           0 |        0 |       
                                                   Store size |     5.80673 |     5.84045 |  0.03372 |     GB
                                                Translog size | 2.56114e-07 | 2.56114e-07 |        0 |     GB
                                       Heap used for segments |    0.613789 |    0.642971 |  0.02918 |     MB
                                     Heap used for doc values |   0.0842857 |   0.0664864 |  -0.0178 |     MB
                                          Heap used for terms |    0.451263 |    0.490997 |  0.03973 |     MB
                                          Heap used for norms |           0 |           0 |        0 |     MB
                                         Heap used for points |           0 |           0 |        0 |     MB
                                  Heap used for stored fields |   0.0782394 |   0.0854874 |  0.00725 |     MB
                                                Segment count |         159 |         173 |       14 |       
                                               Min Throughput |     10617.9 |     12356.9 |     1739 | docs/s
                                              Mean Throughput |     11439.8 |     15415.8 |  3975.98 | docs/s
                                            Median Throughput |     11465.1 |     15678.8 |   4213.7 | docs/s
                                               Max Throughput |     11703.4 |     16059.6 |  4356.13 | docs/s
                                      50th percentile latency |      3163.5 |     2352.58 | -810.925 |     ms
                                      90th percentile latency |     3954.71 |     2734.01 |  -1220.7 |     ms
                                      99th percentile latency |     6229.13 |      4212.4 | -2016.74 |     ms
                                    99.9th percentile latency |     27692.5 |     7813.23 | -19879.3 |     ms
                                     100th percentile latency |     36693.7 |     8577.53 | -28116.2 |     ms
                                 50th percentile service time |      3163.5 |     2352.58 | -810.925 |     ms
                                 90th percentile service time |     3954.71 |     2734.01 |  -1220.7 |     ms
                                 99th percentile service time |     6229.13 |      4212.4 | -2016.74 |     ms
                               99.9th percentile service time |     27692.5 |     7813.23 | -19879.3 |     ms
                                100th percentile service time |     36693.7 |     8577.53 | -28116.2 |     ms
                                                   error rate |           0 |           0 |        0 |      %

elasticmachine · 2021-02-16T02:11:52Z

Pinging @elastic/es-distributed (Team:Distributed)

romseygeek · 2021-02-16T10:33:14Z

server/src/test/java/org/elasticsearch/index/engine/LuceneChangesSnapshotTests.java

+                }
+                assertFalse(snapshot.useSequentialStoredFieldsReader());
+            }
+            // disable optimization for non-sequential accesses


This comment doesn't seem to correspond with the test below?

Good catch. I fixed in 3eb853d.

jimczi

I left one comment, LGTM otherwise

jimczi · 2021-02-16T15:09:59Z

server/src/main/java/org/elasticsearch/index/engine/LuceneChangesSnapshot.java

+        }
+    }
+
+    private static boolean hasSequentialAccess(ScoreDoc[] scoreDocs) {


You can avoid the loop entirely like in FetchPhase#hasSequentialDocs

This is a bit different. We can access documents out of order instead of ascending like in the fetch phase.

dnhatn · 2021-02-16T16:10:21Z

run elasticsearch-ci/2

dnhatn · 2021-02-16T17:57:08Z

@romseygeek @jimczi Thanks for review.

This commit re-introduces the sequential access of stored fields in CCR. Unlike the previous change, we apply this optimization only when we are accessing 10+ consecutive document ids. I ran a CCR benchmark, and this change increased the indexing throughput on the leader by 30%.

We can't enable the sequential access optimization for stored fields of changes snapshots used in peer recoveries because they are accessed by multiple threads. Relates to #68961

We can't enable the sequential access optimization for stored fields of changes snapshots used in peer recoveries because they are accessed by multiple threads. Relates to elastic#68961

We can't enable the sequential access optimization for stored fields of changes snapshots used in peer recoveries because they are accessed by multiple threads. Relates to #68961

dnhatn force-pushed the ccr-optimize-stored-fields branch 3 times, most recently from e7ae7e4 to 3d2dcde Compare February 15, 2021 22:05

Use sequential access stored fields in CCR

68053ef

dnhatn force-pushed the ccr-optimize-stored-fields branch from 3d2dcde to 68053ef Compare February 15, 2021 22:06

Merge branch 'master' into ccr-optimize-stored-fields

52b2382

dnhatn requested review from jimczi and romseygeek February 16, 2021 02:10

dnhatn added :Distributed Indexing/CCR Issues around the Cross Cluster State Replication features >enhancement v7.12.0 v8.0.0 labels Feb 16, 2021

dnhatn marked this pull request as ready for review February 16, 2021 02:11

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Feb 16, 2021

romseygeek reviewed Feb 16, 2021

View reviewed changes

dnhatn added 2 commits February 16, 2021 10:00

Merge branch 'master' into ccr-optimize-stored-fields

4ca74cd

fix comments

3eb853d

jimczi approved these changes Feb 16, 2021

View reviewed changes

dnhatn requested a review from romseygeek February 16, 2021 16:09

dnhatn merged commit dad0aea into elastic:master Feb 16, 2021

dnhatn deleted the ccr-optimize-stored-fields branch February 16, 2021 17:57

dnhatn mentioned this pull request Feb 16, 2021

Use sequential access of stored fields in CCR #69083

Merged

dnhatn mentioned this pull request Feb 20, 2021

Read CCR changes from translog when possible #68790

Closed

original-brownbear mentioned this pull request Feb 22, 2021

[CI] SearchableSnapshotsIntegTests.testCreateAndRestorePartialSearchableSnapshot fails #69336

Closed

dnhatn mentioned this pull request Feb 22, 2021

Disable stored fields access optimization in recovery #69385

Merged

dnhatn mentioned this pull request Feb 22, 2021

Disable stored fields access optimization in recovery #69398

Merged

dnhatn mentioned this pull request Feb 22, 2021

Disable stored fields access optimization in recovery #69399

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use sequential access of stored fields in CCR #68961

Use sequential access of stored fields in CCR #68961

Uh oh!

dnhatn commented Feb 13, 2021 •

edited

Loading

Uh oh!

dnhatn commented Feb 16, 2021

Uh oh!

elasticmachine commented Feb 16, 2021

Uh oh!

romseygeek Feb 16, 2021

Uh oh!

dnhatn Feb 16, 2021

Uh oh!

jimczi left a comment

Uh oh!

jimczi Feb 16, 2021

Uh oh!

dnhatn Feb 16, 2021 •

edited

Loading

Uh oh!

dnhatn commented Feb 16, 2021

Uh oh!

dnhatn commented Feb 16, 2021

Uh oh!

Uh oh!

Use sequential access of stored fields in CCR #68961

Use sequential access of stored fields in CCR #68961

Uh oh!

Conversation

dnhatn commented Feb 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dnhatn commented Feb 16, 2021

Uh oh!

elasticmachine commented Feb 16, 2021

Uh oh!

romseygeek Feb 16, 2021

Choose a reason for hiding this comment

Uh oh!

dnhatn Feb 16, 2021

Choose a reason for hiding this comment

Uh oh!

jimczi left a comment

Choose a reason for hiding this comment

Uh oh!

jimczi Feb 16, 2021

Choose a reason for hiding this comment

Uh oh!

dnhatn Feb 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dnhatn commented Feb 16, 2021

Uh oh!

dnhatn commented Feb 16, 2021

Uh oh!

Uh oh!

dnhatn commented Feb 13, 2021 •

edited

Loading

dnhatn Feb 16, 2021 •

edited

Loading