Add MlClientDocumentationIT tests for classification. #47569

przemekwitek · 2019-10-04T11:33:32Z

This PR enhances client documentation tests with the new classification analysis type:

testPutDataFrameAnalytics
testEvaluateDataFrame, testEvaluateDataFrame_Classification, testEvaluateDataFrame_Regression

Additionally, it adds basic java rest high-level docs related to classification.

Relates #46735

elasticmachine · 2019-10-07T10:08:37Z

Pinging @elastic/ml-core (:ml)

elasticmachine · 2019-10-07T10:08:38Z

Pinging @elastic/es-docs (>docs)

szabosteve

LGTM

droberts195

Looks good. I left a few nits and questions. Maybe nothing needs to be changed if it's obvious which of the duplicate class names to use, not possible to link to bookmarks on external sites, and we don't care about full stops at the end of numbered items. But they are things to at least consider.

...high-level/src/test/java/org/elasticsearch/client/documentation/MlClientDocumentationIT.java

droberts195 · 2019-10-09T12:56:42Z

...high-level/src/test/java/org/elasticsearch/client/documentation/MlClientDocumentationIT.java

+        {
+            // tag::evaluate-data-frame-evaluation-regression
+            Evaluation evaluation =
+                new Regression( // <1>


This Regression class is not fully qualified. But I don't think the doc examples include the imports. So this doesn't make it clear which package to choose when typing Regression into an IDE and it suggests two possible classes that could be imported.

It might be best to rename one of the classes, or else fully qualify the name here as well as where the other one is used in the docs.

(Same for Classification on line 3341.)

Done.
I've fully qualified the names Regression and Classification in this file for now.
LMK if you like the idea of renaming Regression to RegressionEvaluation and Classification to ClassificationEvaluation (or maybe have a different idea for naming). Then I could move on with renaming.

droberts195 · 2019-10-09T12:57:59Z

docs/java-rest/high-level/ml/evaluate-data-frame.asciidoc

+<2> Name of the field in the index. Its value denotes the actual (i.e. ground truth) label for an example. Must be either true or false.
+<3> Name of the field in the index. Its value denotes the probability (as per some ML algorithm) of the example being classified as positive.
+<4> The remaining parameters are the metrics to be calculated based on the two fields described above.
+<5> https://en.wikipedia.org/wiki/Precision_and_recall[Precision] calculated at thresholds: 0.4, 0.5 and 0.6


Is it possible to link to the #Precision bookmark on this page?

You mean, instead of wikipedia link, or in addition?
Such a section does not exist yet on our page. Should I add it?

You can link to a specific bookmark on the Wikipedia page like this:

https://en.wikipedia.org/wiki/Precision_and_recall#Precision

I'm not sure it's possible in Asciidoc though. Maybe the # causes a problem. If not don't worry.

Ah, that's what you meant.

Sure, done.

droberts195 · 2019-10-09T12:58:08Z

docs/java-rest/high-level/ml/evaluate-data-frame.asciidoc

+<3> Name of the field in the index. Its value denotes the probability (as per some ML algorithm) of the example being classified as positive.
+<4> The remaining parameters are the metrics to be calculated based on the two fields described above.
+<5> https://en.wikipedia.org/wiki/Precision_and_recall[Precision] calculated at thresholds: 0.4, 0.5 and 0.6
+<6> https://en.wikipedia.org/wiki/Precision_and_recall[Recall] calculated at thresholds: 0.5 and 0.7


Is it possible to link to the #Recall bookmark on this page?

See my questions above.

docs/java-rest/high-level/ml/evaluate-data-frame.asciidoc

droberts195

LGTM

I'm happy to merge if the docs team is happy with the numbered lists.

przemekwitek · 2019-10-10T15:23:01Z

run elasticsearch-ci/packaging-sample-matrix

przemekwitek · 2019-10-11T05:11:30Z

run elasticsearch-ci/packaging-sample

…47896)

przemekwitek added the WIP label Oct 4, 2019

przemekwitek force-pushed the classification_docs branch from 41c48de to d39605a Compare October 7, 2019 07:34

przemekwitek mentioned this pull request Oct 7, 2019

[ML] Introduce classification analysis type #46735

Closed

9 tasks

przemekwitek force-pushed the classification_docs branch 5 times, most recently from 65152a7 to c8df7f8 Compare October 7, 2019 10:05

przemekwitek removed the WIP label Oct 7, 2019

przemekwitek marked this pull request as ready for review October 7, 2019 10:06

przemekwitek force-pushed the classification_docs branch from c8df7f8 to 230020a Compare October 7, 2019 10:08

przemekwitek added :ml Machine learning >docs General docs changes v7.5.0 v8.0.0 labels Oct 7, 2019

przemekwitek requested a review from szabosteve October 7, 2019 10:08

szabosteve approved these changes Oct 8, 2019

View reviewed changes

droberts195 reviewed Oct 9, 2019

View reviewed changes

droberts195 approved these changes Oct 10, 2019

View reviewed changes

przemekwitek force-pushed the classification_docs branch from e2e5d40 to 9a6a33c Compare October 10, 2019 12:15

przemekwitek added 3 commits October 10, 2019 15:08

Add MlClientDocumentationIT::testEvaluateDataFrame_Classification test

2574604

Apply code review comments

4062c43

Apply code review comments

36e6446

przemekwitek force-pushed the classification_docs branch from 9a6a33c to 36e6446 Compare October 10, 2019 13:09

przemekwitek merged commit 9b5770d into elastic:master Oct 11, 2019

przemekwitek deleted the classification_docs branch October 11, 2019 06:21

przemekwitek mentioned this pull request Oct 11, 2019

[7.x] Add MlClientDocumentationIT tests for classification. (#47569) #47896

Merged

przemekwitek added a commit to przemekwitek/elasticsearch that referenced this pull request Oct 11, 2019

Add MlClientDocumentationIT tests for classification. (elastic#47569)

c0acf51

przemekwitek added a commit that referenced this pull request Oct 11, 2019

[7.x] Add MlClientDocumentationIT tests for classification. (#47569) (#…

d210bfa

…47896)

howardhuanghua pushed a commit to TencentCloudES/elasticsearch that referenced this pull request Oct 14, 2019

Add MlClientDocumentationIT tests for classification. (elastic#47569)

f6ab7d4

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Add MlClientDocumentationIT tests for classification. #47569

Add MlClientDocumentationIT tests for classification. #47569

Uh oh!

Conversation

przemekwitek commented Oct 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Oct 7, 2019

Uh oh!

elasticmachine commented Oct 7, 2019

Uh oh!

szabosteve left a comment

Choose a reason for hiding this comment

Uh oh!

droberts195 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

droberts195 left a comment

Choose a reason for hiding this comment

Uh oh!

przemekwitek commented Oct 10, 2019

Uh oh!

przemekwitek commented Oct 11, 2019

Uh oh!

Uh oh!

przemekwitek commented Oct 4, 2019 •

edited

Loading