Cleanup TransportReplicationAction #12395

martijnvg · 2015-07-22T11:16:23Z

Split the actual primary operation from the primary phase into a dedicated AsyncPrimaryAction class (similar to AsyncReplicaAction)
Removed threaded_operation option from replication based transport actions.
Let the threading be handled by by the transport service and drop forking from new threads from the transport replication action.

brwe · 2015-07-23T09:55:20Z

core/src/main/java/org/elasticsearch/action/support/replication/TransportReplicationAction.java

-
-                        });
-            } else {
-                if (replicaRequest.operationThreaded()) {


so happy that you remove this part!

martijnvg · 2015-07-23T19:06:21Z

@brwe I've fixed the test and made sure that the newly added ReplicationRequest#internalShardRouting is always used in the test.

martijnvg · 2015-08-04T15:28:11Z

I rebased this PR and moved the retry primary handling entirely to the coordinating node.

Also I included @brwe test from #12574 which passes with this PR. This test simulates a situation where 2 nodes endlessly redirect same index request to each other caused by a number of events described in #12573.

The reason this test is included is because the way primary write requests are redirected has changed in the PR. The coordinating node remains in charge of the entire write operation, even if the primary shard is on a different node than the coordinating node. While before the node that holds the primary shard is always in charge of a write request. In a situation that is described in #12573 this can lead to nodes endlessly redirecting the same write request to each other until the cluster state caught up or something bad happens. With this PR with the #12573 situation, the write request will be retried when a new cluster state comes or the write timeout has been met.

brwe · 2015-08-05T09:28:46Z

core/src/main/java/org/elasticsearch/action/support/replication/TransportReplicationAction.java

+                    try {
+                        channel.sendResponse(e);
+                    } catch (Throwable t) {
+                        logger.warn("failed to send response for get", t);


for write request?

brwe · 2015-08-05T11:57:14Z

Overall I really like this split of primary operation and primary phase. I think this might make it much easier to understand what is happening. I left some comments and questions around testing and where we should retry though. Hope they are not too confusing...

…cated AsyncPrimaryAction class (similar to AsyncReplicaAction) Removed threaded_operation option from replication based transport actions. Let the threading be handled by by the transport service and drop forking from new threads from the transport replication action.

…ccurs but retry on the node holding the primary shard

…ally then use the transport service instead of the threadpool for forking a new thread. This way there is one place where new threads are being spawned.

…ionRequest#internalShardRouting

…uest can get stuck in an endless redirect loop between nodes due to slow cluster state processing.

…tting

…ought back

martijnvg · 2015-08-25T12:06:27Z

@bleskes I brought back the chasing of the primary shard to PrimaryPhase inner class and delegate to the primary action if the primary shard is local.

bleskes · 2015-08-26T09:19:34Z

core/src/main/java/org/elasticsearch/action/support/replication/TransportReplicationAction.java

+            final ShardRouting primary = request.internalShardRouting;
+            // Although this gets executed locally, this more of an assertion, but if change the primary action
+            // to be performed remotely this check is important to check before performing the action:
+            if (clusterService.localNode().id().equals(primary.currentNodeId()) == false) {


do we need to do this? if we fail to find the shard in the indicesService, we'll throw an exception anyhow?

true. it kind of made sense how the pr did work, i left it because imo it would be a nice check. but lets just remove it.

martijnvg · 2015-08-26T13:14:54Z

The tricky part about the PR is that the cluster state is observed twice, one time in the primary phase and one time in the async primary action. In cases we retry, we might miss the update the to the cluster state. This would only be likely to occur more if the primary action was executed remotely which, is what the plan was after as follow up issue. But this can be dangerous. It is fixable if we remember on what version we decided to execute the primary operation, but that would make this change bigger than the plan is and we should maybe to a break from this change and reconsider it post 2.0

martijnvg added v2.0.0-beta1 review labels Jul 22, 2015

martijnvg force-pushed the cleanup_transport_replication_action branch 8 times, most recently from 1a0c33f to 680e9da Compare July 23, 2015 09:12

brwe reviewed Jul 23, 2015
View reviewed changes

martijnvg force-pushed the cleanup_transport_replication_action branch from 7412f68 to 5a96d4d Compare August 4, 2015 15:08

brwe reviewed Aug 5, 2015
View reviewed changes

martijnvg force-pushed the cleanup_transport_replication_action branch from 5a96d4d to dc6c053 Compare August 10, 2015 16:22

martijnvg added blocker v2.0.0 and removed v2.0.0-beta1 labels Aug 11, 2015

martijnvg force-pushed the cleanup_transport_replication_action branch 3 times, most recently from 9c0d06a to 12db140 Compare August 19, 2015 07:19

martijnvg added 8 commits August 25, 2015 12:48

don't fall back to the coordinating node if RetryOnPrimaryException o…

d5b551c

…ccurs but retry on the node holding the primary shard

In the event that a primary or replica action needs to be retried loc…

5ba4283

…ally then use the transport service instead of the threadpool for forking a new thread. This way there is one place where new threads are being spawned.

test: removed the custom shardId integer and instead use the Replicat…

405402c

…ionRequest#internalShardRouting

Added test from Britta's PR elastic#12574 that shows how an index req…

07ef93f

…uest can get stuck in an endless redirect loop between nodes due to slow cluster state processing.

Execute the retry logic only on the coordinating node.

51bfdb8

applied feedback

b35f12c

use mock transport via plugin instead of setting it directly via a se…

2abe7c4

…tting

bring back the primary shard chasing to the PrimaryPhase

3f64076

martijnvg force-pushed the cleanup_transport_replication_action branch from 3a9e901 to 3f64076 Compare August 25, 2015 11:28

removed endless indexing test, since the chasing primary logic was br…

356161e

…ought back

bleskes reviewed Aug 26, 2015
View reviewed changes

martijnvg mentioned this pull request Aug 26, 2015

Removed the operation_threaded option. #13119

Merged

martijnvg closed this Aug 26, 2015

martijnvg removed blocker review labels Aug 26, 2015

clintongormley removed the v2.0.0 label Aug 27, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup TransportReplicationAction #12395

Cleanup TransportReplicationAction #12395

martijnvg commented Jul 22, 2015

brwe Jul 23, 2015

martijnvg commented Jul 23, 2015

martijnvg commented Aug 4, 2015

brwe Aug 5, 2015

martijnvg Aug 6, 2015

brwe commented Aug 5, 2015

martijnvg commented Aug 25, 2015

bleskes Aug 26, 2015

martijnvg Aug 26, 2015

martijnvg commented Aug 26, 2015

Cleanup TransportReplicationAction #12395

Cleanup TransportReplicationAction #12395

Conversation

martijnvg commented Jul 22, 2015

brwe Jul 23, 2015

Choose a reason for hiding this comment

martijnvg commented Jul 23, 2015

martijnvg commented Aug 4, 2015

brwe Aug 5, 2015

Choose a reason for hiding this comment

martijnvg Aug 6, 2015

Choose a reason for hiding this comment

brwe commented Aug 5, 2015

martijnvg commented Aug 25, 2015

bleskes Aug 26, 2015

Choose a reason for hiding this comment

martijnvg Aug 26, 2015

Choose a reason for hiding this comment

martijnvg commented Aug 26, 2015