Simplify BulkProcessor handling and retry logic #24051

Tim-Brooks · 2017-04-11T20:17:16Z

This commit collapses the SyncBulkRequestHandler and
AsyncBulkRequestHandler into a single BulkRequestHandler. The new
handler executes a bulk request and awaits for the completion if the
BulkProcessor was configured with a concurrentRequests setting of 0.
Otherwise the execution happens asynchronously.

As part of this change the Retry class has been refactored.
withSyncBackoff and withAsyncBackoff have been replaced with two
versions of withBackoff. One method takes a listener that will be
called on completion. The other method returns a future that will been
complete on request completion.

Tim-Brooks · 2017-04-11T20:21:11Z

The PR maintains the existing BulkProcessor features. There is this setting concurrentRequests. This impacts the number of concurrent streams that are allowed by the BulkRequestHandler. If it is set to 0, then an "execute" action is blocked until the action is complete.

This setting requires the handler to still be kind of complicated. But I assume we want all the current features of the BulkProcessor to stay the same for backwards compatibility?

s1monw · 2017-04-12T09:46:58Z

This setting requires the handler to still be kind of complicated. But I assume we want all the current features of the BulkProcessor to stay the same for backwards compatibility?

my take on this is, if we can fail hard if it's used we can drop it. What would be the way to block for the user instead?

s1monw

I like this a lot! I left some suggestions can also be followups (the listener)

s1monw · 2017-04-12T09:50:07Z

core/src/main/java/org/elasticsearch/action/bulk/BulkRequestHandler.java

-                    listener.afterBulk(executionId, bulkRequest, e);
+    public void execute(BulkRequest bulkRequest, long executionId) {
+        boolean bulkRequestSetupSuccessful = false;
+        boolean acquired = false;


that's just a suggestion, I tend to do thsi this way:

Runnable toRelease = () -> {}; //... semaphore.acquire(); toRelease = semaphore:release

that way you don't need to check any boolean logic and can just call the runnable

s1monw · 2017-04-12T09:55:26Z

core/src/main/java/org/elasticsearch/action/bulk/BulkRequestHandler.java

+            semaphore.acquire();
+            acquired = true;
+            CountDownLatch latch = new CountDownLatch(1);
+            retry.withBackoff(consumer, bulkRequest, new ActionListener<BulkResponse>() {


we do have a LatchedActionListener but I see that you need to also call the release method. I wonder if we can generalize LatchedActionListener into something that only takes a Runnable such that users that want to use it with a latch can just pass a method handle for latch::release we could overload ActionListener::wrap to take a runnable as a third parameter and be done with it?

s1monw · 2017-04-12T09:56:32Z

core/src/main/java/org/elasticsearch/action/bulk/BulkRequestHandler.java

-                return true;
-            }
-            return false;
+    public boolean awaitClose(long timeout, TimeUnit unit) throws InterruptedException {


does this need to be public and also does this class need to be subclassable

Tim-Brooks · 2017-04-12T14:53:40Z

What would be the way to block for the user instead?

I just kind of feel weird about the fact that "we will block until completion" functionality is determined by the "concurrentStreams" config being set to 0. Obviously the API could return a future for the user to use to block. Or there could be a blockingFlush() api. But since a flush can happen implicitly (add() when a threshold is crossed) or explicitly (flush()) this is not super easy to do. So I'm not sure that there is an obvious change.

Tim-Brooks added 4 commits April 5, 2017 16:45

Simplify BulkProcessor handler

a9fb856

Simplify Retry and BulkHandler

546bc23

Update documentation

c6141b3

Rename tests

fc41b7f

Tim-Brooks added :Core/Infra/Core Core issues without another label >non-issue review v6.0.0-alpha1 labels Apr 11, 2017

Tim-Brooks requested review from javanna and s1monw April 11, 2017 20:17

Merge branch 'master' into refactor_bulk

ad706cc

s1monw approved these changes Apr 12, 2017

View reviewed changes

Make changes based on review

97fac47

Tim-Brooks merged commit ffaac5a into elastic:master Apr 13, 2017

Tim-Brooks deleted the refactor_bulk branch November 14, 2018 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify BulkProcessor handling and retry logic #24051

Simplify BulkProcessor handling and retry logic #24051

Uh oh!

Tim-Brooks commented Apr 11, 2017

Uh oh!

Tim-Brooks commented Apr 11, 2017 •

edited

Loading

Uh oh!

s1monw commented Apr 12, 2017

Uh oh!

s1monw left a comment

Uh oh!

s1monw Apr 12, 2017

Uh oh!

s1monw Apr 12, 2017

Uh oh!

s1monw Apr 12, 2017

Uh oh!

Tim-Brooks commented Apr 12, 2017

Uh oh!

Uh oh!

Simplify BulkProcessor handling and retry logic #24051

Simplify BulkProcessor handling and retry logic #24051

Uh oh!

Conversation

Tim-Brooks commented Apr 11, 2017

Uh oh!

Tim-Brooks commented Apr 11, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

s1monw commented Apr 12, 2017

Uh oh!

s1monw left a comment

Choose a reason for hiding this comment

Uh oh!

s1monw Apr 12, 2017

Choose a reason for hiding this comment

Uh oh!

s1monw Apr 12, 2017

Choose a reason for hiding this comment

Uh oh!

s1monw Apr 12, 2017

Choose a reason for hiding this comment

Uh oh!

Tim-Brooks commented Apr 12, 2017

Uh oh!

Uh oh!

Tim-Brooks commented Apr 11, 2017 •

edited

Loading