[ML] Additional outlier detection parameters #47600

dimitris-athanasiou · 2019-10-04T18:52:04Z

Adds the following parameters to outlier_detection:

compute_feature_influence (boolean): whether to compute or not
feature influence scores
outlier_fraction (double): the proportion of the data set assumed
to be outlying prior to running outlier detection
standardization_enabled (boolean): whether to apply standardization
to the feature values

Adds the following parameters to `outlier_detection`: - `compute_feature_influence` (boolean): whether to compute or not feature influence scores - `outlier_fraction` (double): the proportion of the data set assumed to be outlying prior to running outlier detection - `standardization_enabled` (boolean): whether to apply standardization to the feature values

elasticmachine · 2019-10-04T18:52:06Z

Pinging @elastic/ml-core (:ml)

dimitris-athanasiou · 2019-10-04T18:52:38Z

@szabosteve Could you please take a look at the docs changes of this PR?

dimitris-athanasiou · 2019-10-04T18:53:12Z

@dolaru Pinging as this might warrant new test scenarios for our QA

benwtrent

LGTM!

benwtrent · 2019-10-04T19:59:59Z

.../core/src/main/java/org/elasticsearch/xpack/core/ml/dataframe/analyses/OutlierDetection.java

    }

    public OutlierDetection(StreamInput in) throws IOException {
        nNeighbors = in.readOptionalVInt();
        method = in.readBoolean() ? in.readEnum(Method.class) : null;
        featureInfluenceThreshold = in.readOptionalDouble();
+        if (in.getVersion().onOrAfter(Version.V_7_5_0)) {


now that you have BWC tests, this may have to be changed to V_8_0_0

Yes, that's why I'm holding on before merging the BWC tests :-)

dimitris-athanasiou · 2019-10-05T09:49:28Z

This depends on elastic/ml-cpp#716 to be merged first.

szabosteve

Thank you for taking care of this.
LGTM.

przemekwitek

LGTM

przemekwitek · 2019-10-07T07:03:29Z

.../src/test/java/org/elasticsearch/xpack/core/ml/dataframe/analyses/OutlierDetectionTests.java

+        OutlierDetection outlierDetection = new OutlierDetection.Builder().build();
+        Map<String, Object> params = outlierDetection.getParams();
+        assertThat(params.size(), equalTo(3));
+        assertThat(params.containsKey("compute_feature_influence"), is(true));


FYI: There are hasKey and hasEntry methods in org.hamcrest.Matchers which you could use here.

przemekwitek · 2019-10-07T07:13:13Z

docs/java-rest/high-level/ml/put-data-frame-analytics.asciidoc

@@ -96,6 +96,10 @@ include-tagged::{doc-tests-file}[{api}-outlier-detection-customized]
 <1> Constructing a new OutlierDetection object
 <2> The method used to perform the analysis
 <3> Number of neighbors taken into account during analysis
+<4> The min `outlier_score` required to compute feature influence


Just curious: Could the functionality of compute_feature_influence setting be achieved with setting min_outlier_score to a very high number?

Indeed, one can set feature_influence_threshold to 1 to achieve the same as setting compute_feature_influence to false.

dimitris-athanasiou · 2019-10-07T10:51:01Z

run elasticsearch-ci/1

dimitris-athanasiou · 2019-10-07T10:51:08Z

run elasticsearch-ci/2

dimitris-athanasiou · 2019-10-07T11:58:07Z

run elasticsearch-ci/1

Adds the following parameters to `outlier_detection`: - `compute_feature_influence` (boolean): whether to compute or not feature influence scores - `outlier_fraction` (double): the proportion of the data set assumed to be outlying prior to running outlier detection - `standardization_enabled` (boolean): whether to apply standardization to the feature values Backport of elastic#47600

Adds the following parameters to `outlier_detection`: - `compute_feature_influence` (boolean): whether to compute or not feature influence scores - `outlier_fraction` (double): the proportion of the data set assumed to be outlying prior to running outlier detection - `standardization_enabled` (boolean): whether to apply standardization to the feature values Backport of #47600

dimitris-athanasiou added >enhancement :ml Machine learning v8.0.0 v7.5.0 labels Oct 4, 2019

Fix integ tests and add test with custom params

20ba159

benwtrent approved these changes Oct 4, 2019

View reviewed changes

Fix test

ecc0ad6

szabosteve approved these changes Oct 7, 2019

View reviewed changes

przemekwitek approved these changes Oct 7, 2019

View reviewed changes

dimitris-athanasiou merged commit e99435a into elastic:master Oct 7, 2019

dimitris-athanasiou deleted the additional-outlier-detection-params branch October 7, 2019 12:28

dimitris-athanasiou mentioned this pull request Oct 7, 2019

[7.x][ML] Additional outlier detection parameters (#47600) #47669

Merged

Mpdreamz mentioned this pull request Nov 19, 2019

[meta] 7.5 release elastic/elasticsearch-net#4232

Closed

24 tasks

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Additional outlier detection parameters #47600

[ML] Additional outlier detection parameters #47600

dimitris-athanasiou commented Oct 4, 2019

elasticmachine commented Oct 4, 2019

dimitris-athanasiou commented Oct 4, 2019

dimitris-athanasiou commented Oct 4, 2019

benwtrent left a comment

benwtrent Oct 4, 2019

dimitris-athanasiou Oct 5, 2019

dimitris-athanasiou commented Oct 5, 2019

szabosteve left a comment

przemekwitek left a comment

przemekwitek Oct 7, 2019

przemekwitek Oct 7, 2019

dimitris-athanasiou Oct 7, 2019

dimitris-athanasiou commented Oct 7, 2019

dimitris-athanasiou commented Oct 7, 2019

dimitris-athanasiou commented Oct 7, 2019

[ML] Additional outlier detection parameters #47600

[ML] Additional outlier detection parameters #47600

Conversation

dimitris-athanasiou commented Oct 4, 2019

elasticmachine commented Oct 4, 2019

dimitris-athanasiou commented Oct 4, 2019

dimitris-athanasiou commented Oct 4, 2019

benwtrent left a comment

Choose a reason for hiding this comment

benwtrent Oct 4, 2019

Choose a reason for hiding this comment

dimitris-athanasiou Oct 5, 2019

Choose a reason for hiding this comment

dimitris-athanasiou commented Oct 5, 2019

szabosteve left a comment

Choose a reason for hiding this comment

przemekwitek left a comment

Choose a reason for hiding this comment

przemekwitek Oct 7, 2019

Choose a reason for hiding this comment

przemekwitek Oct 7, 2019

Choose a reason for hiding this comment

dimitris-athanasiou Oct 7, 2019

Choose a reason for hiding this comment

dimitris-athanasiou commented Oct 7, 2019

dimitris-athanasiou commented Oct 7, 2019

dimitris-athanasiou commented Oct 7, 2019