Skip to content

[ML] Validate that AucRoc has the data necessary to be calculated #63302

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Oct 8, 2020

Conversation

przemekwitek
Copy link
Contributor

@przemekwitek przemekwitek commented Oct 6, 2020

When AucRoc is evaluated in the context of multiclass classification evaluation, it must require that the probability of the class in question (class_name) is known for every document. Otherwise, the results will not be correct.

This PR tightens the validation so that when the probability of the class in question is not known for at least one document, the evaluation request fails rather than returning erroneous results.

Consequently, AbstractAucRoc.Result.doc_count field is no longer needed so it is removed in this PR as well.

Marking this PR >non-issue as the AucRoc metric is not released yet.

Relates #63306

@przemekwitek przemekwitek force-pushed the auc_roc_remove_doc_count branch 2 times, most recently from 08045f5 to 08a3942 Compare October 6, 2020 08:06
@przemekwitek
Copy link
Contributor Author

run elasticsearch-ci/packaging-sample-windows

@przemekwitek przemekwitek changed the title [ML] Remove AbstractAucRoc.Result.doc_count field [ML] Validate that AucRoc has the data necessary to be calculated Oct 6, 2020
@przemekwitek przemekwitek force-pushed the auc_roc_remove_doc_count branch from 08a3942 to f41a7f5 Compare October 6, 2020 08:27
@przemekwitek
Copy link
Contributor Author

run elasticsearch-ci/packaging-sample-windows

@elasticmachine
Copy link
Collaborator

Pinging @elastic/ml-core (:ml)

@przemekwitek
Copy link
Contributor Author

run elasticsearch-ci/2

@przemekwitek przemekwitek force-pushed the auc_roc_remove_doc_count branch from 16d8a24 to 0fe60e4 Compare October 7, 2020 08:23
@przemekwitek
Copy link
Contributor Author

run elasticsearch-ci/2

@przemekwitek przemekwitek force-pushed the auc_roc_remove_doc_count branch from 0fe60e4 to 175f2a7 Compare October 7, 2020 12:16
@andreidan andreidan added v7.11.0 and removed v7.10.0 labels Oct 7, 2020
@przemekwitek przemekwitek merged commit b0019bd into elastic:master Oct 8, 2020
@przemekwitek przemekwitek deleted the auc_roc_remove_doc_count branch October 8, 2020 06:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants