Reindex search resiliency #45497

henningandersen · 2019-08-13T11:01:42Z

Local reindex can now survive loosing data nodes that contain source
data. The original query will be restarted with a filter for
_seq_no >= last_seq_no when a failure is detected.

Part of #42612 and split out from #43187

A couple of follow-ups will be done (all indicated with todos) in addition to what is in the meta issue:

Fix the problem with allowPartialSearchResult in master
Build the ScrollableHitSource separately rather than in super constructor.
SeqNo mapping creation or avoid the issue.

Local reindex can now survive loosing data nodes that contain source data. The original query will be restarted with a filter for `_seq_no >= last_seq_no` when a failure is detected. Part of elastic#42612 and split out from elastic#43187

elasticmachine · 2019-08-13T11:01:44Z

Pinging @elastic/es-distributed

Tim-Brooks · 2019-08-15T03:02:02Z

I will take a look at this tomorrow.

Tim-Brooks

This is mostly looking good to me. I don't see any issues with the approach or your design. It seems likely that the test failures are related to this change. Do you want to let me know when I should look again to officially approve?

Tim-Brooks · 2019-08-16T03:12:51Z

...s/reindex/src/main/java/org/elasticsearch/index/reindex/AbstractAsyncBulkByScrollAction.java

@@ -135,7 +136,9 @@
        this.listener = listener;
        BackoffPolicy backoffPolicy = buildBackoffPolicy();
        bulkRetry = new Retry(BackoffPolicy.wrap(backoffPolicy, worker::countBulkRetry), threadPool);
-        scrollSource = buildScrollableResultSource(backoffPolicy);
+        // todo: this is trappy, since if a subclass override relies on subclass fields, they are not initialized. We should fix


I agree. This was causing me problems in a recent PR.

…_search_resiliency

No longer retry/restart the original search request, since this is not what we used to do and it leads to long wait time to get the info back that a search request is bad.

henningandersen · 2019-08-16T11:27:44Z

Thanks @tbrooks8 and sorry for not fixing the test failure before now. It should be ready for another round (provided tests succeed).

No longer fail on the empty index. So far we consider this a workaround/hack more than the solution.

Use unmapped_type to circumvent problem with newly created indices without a mapping, since this is local to reindex. Once type removal is complete, ensuring that the metadata fields are always available likely becomes easier, so we will defer a solution to the search/sort problem.

henningandersen · 2019-08-18T17:29:37Z

@elasticmachine run elasticsearch-ci/1

Tim-Brooks

LGTM

Reindex search resiliency

812e5eb

Local reindex can now survive loosing data nodes that contain source data. The original query will be restarted with a filter for `_seq_no >= last_seq_no` when a failure is detected. Part of elastic#42612 and split out from elastic#43187

henningandersen added >enhancement v8.0.0 :Distributed Indexing/Reindex Issues relating to reindex that are not caused by issues further down labels Aug 13, 2019

henningandersen requested a review from Tim-Brooks August 14, 2019 10:49

Tim-Brooks reviewed Aug 16, 2019

View reviewed changes

henningandersen added 2 commits August 16, 2019 10:03

Merge remote-tracking branch 'origin/reindex_v2' into enhance_reindex…

0234278

…_search_resiliency

Reindex search resiliency

a0176b2

No longer retry/restart the original search request, since this is not what we used to do and it leads to long wait time to get the info back that a search request is bad.

henningandersen requested a review from Tim-Brooks August 16, 2019 11:26

henningandersen added 2 commits August 16, 2019 18:36

Reindex search resiliency

536a8a3

No longer fail on the empty index. So far we consider this a workaround/hack more than the solution.

Tim-Brooks approved these changes Aug 19, 2019

View reviewed changes

henningandersen merged commit 3ee5c4f into elastic:reindex_v2 Aug 19, 2019

henningandersen mentioned this pull request Aug 23, 2019

Reindex search resiliency prototype #43187

Closed

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reindex search resiliency #45497

Reindex search resiliency #45497

henningandersen commented Aug 13, 2019

elasticmachine commented Aug 13, 2019

Tim-Brooks commented Aug 15, 2019

Tim-Brooks left a comment

Tim-Brooks Aug 16, 2019

henningandersen commented Aug 16, 2019

henningandersen commented Aug 18, 2019

Tim-Brooks left a comment

Reindex search resiliency #45497

Reindex search resiliency #45497

Conversation

henningandersen commented Aug 13, 2019

elasticmachine commented Aug 13, 2019

Tim-Brooks commented Aug 15, 2019

Tim-Brooks left a comment

Choose a reason for hiding this comment

Tim-Brooks Aug 16, 2019

Choose a reason for hiding this comment

henningandersen commented Aug 16, 2019

henningandersen commented Aug 18, 2019

Tim-Brooks left a comment

Choose a reason for hiding this comment