Skip to content

Commit 6ca49ce

Browse files
authored
Merge pull request elastic#29 from jimczi/challenges/index_sorting
Add index sorting challenges to nightly Closes elastic#27
2 parents 1399045 + 7eb7e65 commit 6ca49ce

File tree

4 files changed

+20
-7
lines changed

4 files changed

+20
-7
lines changed

external/pages/adhoc/geonames/index.html

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -103,6 +103,10 @@ <h3>Overview</h3>
103103
<li><strong>Append Fast</strong>: Indexes the whole document corpus using a setup that will lead to a larger indexing
104104
throughput than the default settings. Document ids are unique so all index operations are append only.
105105
</li>
106+
<li><strong>Append Fast Sorted</strong>: Indexes the whole document corpus in an index sorted by country_code field in ascending order
107+
and using a setup that will lead to a larger indexing throughput than the default settings.
108+
Document ids are unique so all index operations are append only.
109+
</li>
106110
<li><strong>Id Conflicts</strong>: Indexes the whole document corpus using a setup that will lead to a larger indexing
107111
throughput than the default settings. Rally will produce duplicate ids in 25% of all documents (not configurable) so we
108112
can simulate a scenario with appends most of the time and some updates in between.

external/pages/adhoc/nyc_taxis/index.html

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -94,10 +94,12 @@ <h3>Overview</h3>
9494
it to evaluate the performance of Elasticsearch for structured data. We run the following variation (which we call
9595
"challenge" in Rally):</p>
9696
<ul>
97-
<li><strong>Append</strong>: Indexes the whole document corpus using Elasticsearch default settings. We only adjust the
98-
number of replicas as we benchmark a single node cluster and Rally will only start the benchmark if the cluster turns
99-
green. Document ids are unique so all index operations are append only. After that a couple of queries are run in
100-
parallel by multiple clients.
97+
<li><strong>Append</strong>: Indexes the whole document corpus using a setup that will lead to a larger indexing throughput than the default settings
98+
and produce a smaller index (higher compression rate). Document ids are unique so all index operations are append only.
99+
</li>
100+
<li><strong>Append Sorted</strong>: Indexes the whole document corpus in an index sorted by pickup_datetime field in descending order (most recent first)
101+
and using a setup that will lead to a larger indexing throughput than the default settings and produce a smaller index (higher compression rate).
102+
Document ids are unique so all index operations are append only.
101103
</li>
102104
</ul>
103105
<p>The benchmarks are run either for an out of the box configuration of Elasticsearch but with a larger heap of 4GB. For more

external/pages/adhoc/pmc/index.html

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -102,6 +102,9 @@ <h3>Overview</h3>
102102
<li><strong>Append Fast</strong>: Indexes the whole document corpus using a setup that will lead to a larger indexing
103103
throughput than the default settings. Document ids are unique so all index operations are append only.
104104
</li>
105+
<li><strong>Append Fast Sorted</strong>: Indexes the whole document corpusin an index sorted by timestamp field in descending order (most recent first)
106+
and using a setup that will lead to a larger indexing throughput than the default settings. Document ids are unique so all index operations are append only.
107+
</li>
105108
<li><strong>Id Conflicts</strong>: Indexes the whole document corpus using a setup that will lead to a larger indexing
106109
throughput than the default settings. Rally will produce duplicate ids in 25% of all documents (not configurable) so we
107110
can simulate a scenario with appends most of the time and some updates in between.

night_rally.py

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@
2020
["append-no-conflicts", "defaults"],
2121
["append-no-conflicts-index-only", "4gheap"],
2222
["append-fast-no-conflicts", "4gheap"],
23+
["append-fast-sorted-no-conflicts", "4gheap"],
2324
["append-fast-with-conflicts", "4gheap"],
2425
["append-no-conflicts-index-only-1-replica", "two_nodes"],
2526
["append-no-conflicts-index-only", "verbose_iw"]
@@ -41,18 +42,21 @@
4142
["append-no-conflicts-index-only", "defaults"],
4243
["append-no-conflicts", "4gheap"],
4344
["append-fast-no-conflicts", "4gheap"],
45+
["append-fast-sorted-no-conflicts", "4gheap"],
4446
["append-fast-with-conflicts", "4gheap"],
4547
["append-no-conflicts-index-only-1-replica", "two_nodes"]
4648
]
4749

4850
tracks["nyc_taxis"] = [
49-
["append-no-conflicts", "4gheap"]
51+
["append-no-conflicts", "4gheap"],
52+
["append-sorted-no-conflicts-index-only", "4gheap"]
5053
]
5154

5255
tracks["logging"] = [
5356
["append-no-conflicts-index-only", "defaults"],
5457
["append-no-conflicts", "4gheap"],
55-
["append-fast-no-conflicts", "4gheap"]
58+
["append-fast-no-conflicts", "4gheap"],
59+
["append-fast-sorted-no-conflicts", "4gheap"]
5660
]
5761

5862
# default challenge / car per track
@@ -124,7 +128,7 @@ def ensure_dir(directory):
124128
def sanitize(text):
125129
"""
126130
Sanitizes the input text so it is safe to use as an environment name in Rally.
127-
131+
128132
:param text: A text to sanitize
129133
"""
130134
return text.lower().replace(" ", "-").replace(".", "_")

0 commit comments

Comments
 (0)