Skip to content

[DOCS] Adds data frame analytics API and evaluate API resource documentation #43972

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 14 commits into from
Jul 11, 2019

Conversation

szabosteve
Copy link
Contributor

@szabosteve szabosteve commented Jul 4, 2019

This PR adds the resource documentation of the data frame analytics APIs and the evaluate API to the ML API doc pool.

NOTE: This PR must be merged after elastic/docs#968

@szabosteve szabosteve added :ml Machine learning :ml/Transform Transform >docs General docs changes v7.3.0 v7.4.0 v8.0.0 labels Jul 4, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/ml-core

@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-docs

mertics:

`auc_roc` (Optional)::
(object) The AUC ROC score and optionally the curve.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • For each metric we should probably have another section where we explain each object.
  • Also, I wonder if we should add links to wikipedia pages here and for the other metrics.
  • Finally, let's also explain what auc_roc stands for: area under the curve of the receiver operating characteristic

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • I'll add these sections later as it takes some more time to produce them.
  • I haven't seen such external references, but I'll ask if it is possible to add.
  • The latest commit explains the abbreviation.

@szabosteve szabosteve requested a review from lcawl July 4, 2019 16:06
@tveasey
Copy link
Contributor

tveasey commented Jul 5, 2019

One last comment. In the description of the default behaviour of the method parameter, I'd change the sentence:

If the method is not set, the system assembles different models that use different methods, then hierarchically combines the different models to find a solution that applies to the dataset.

to

If the method is not set {oldetection} uses an ensemble of different methods and normalises and combines their individual outlier scores to obtain the overall {olscore}.

The technical term for what we do is to create an ensemble. The way we combine the results is not really hierarchical so I would just say combine and not be specific. We can provide more technical details of exactly how we combine the results, but it would probably be too much to include in the API docs. However, knowing that we use a normalised combination of individual method scores is useful.

@jpountz jpountz removed the v7.3.0 label Jul 5, 2019
Copy link
Contributor

@tveasey tveasey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Couple of minor points on model_memory_limit, but otherwise looks good @szabosteve.

@lcawl
Copy link
Contributor

lcawl commented Jul 8, 2019

@elasticmachine update branch

@elasticmachine
Copy link
Collaborator

merge conflict between base and head

[testenv="platinum"]
[[ml-evaluate-dfanalytics-resources]]
=== {dfanalytics-cap} evaluation resources

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This page is different than most of the other pages in the "Definitions" section, since it seems to be defining the input (request body) properties for the evaluate DF analytics API, rather than the output (response body) properties. In many other cases, the input and output is similar (i.e. input to create job matches output from get jobs so the "job resources" applies to both). That doesn't seem to be the case here, though.

I think we should either (a) extend the evaluation resources page to also describe the response objects, or (b) move the configuration objects into the API reference page and only cover the response objects in the resources page.

If I've explained this poorly or misunderstood the goal of this page, just let me know!

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The resources page contains both the request body and the response body parameters.

Some params are overlapping, for example auc_roc, precison, recall, confusion_matrix can be also the part of the request body as well as of the response body. tp, fp, tn, fn could be only the part of the response body. As far as I see, all the response objects are covered here.

[[binary-sc-resources-properties]]
===== {api-definitions-title}

`actual_field` (Required)::
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Typically this information about which fields are required or optional appears in the API reference page. It's unusual to see it here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I removed the notes in fc3ad12.

@szabosteve szabosteve merged commit 74c16ef into elastic:master Jul 11, 2019
@szabosteve szabosteve deleted the dfaresources branch July 11, 2019 16:05
szabosteve added a commit that referenced this pull request Jul 11, 2019
…ntation (#43972)

This PR adds the resource documentation of the data frame analytics APIs and the evaluate API to the ML API doc pool.
szabosteve added a commit that referenced this pull request Jul 11, 2019
…ntation (#43972)

This PR adds the resource documentation of the data frame analytics APIs and the evaluate API to the ML API doc pool.
@szabosteve
Copy link
Contributor Author

Backported to 7.x and 7.3.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>docs General docs changes :ml/Transform Transform :ml Machine learning v7.4.0 v8.0.0-alpha1
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants