Skip to content

[ML] Correct missing category handling #1042

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Mar 7, 2020

Conversation

tveasey
Copy link
Contributor

@tveasey tveasey commented Mar 6, 2020

There were some places in the code that assumed categorical features don't have missing values which hadn't been updated as part of #1034. I've also updated the encoding schemes so that missing categories map to missing [1]. Finally, I've also improved unit testing w.r.t. missing values.

Since this is fallout from #1034, which isn't yet released, I'm marking as a non-issue.

Closes #1040.

[1] This may not be the best approach to take, but represents the least effort fix for the SIGSEGV.

Copy link
Contributor

@droberts195 droberts195 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tveasey
Copy link
Contributor Author

tveasey commented Mar 7, 2020

retest

@tveasey tveasey merged commit 25820b6 into elastic:master Mar 7, 2020
@tveasey tveasey deleted the missing-category-sigsegv branch March 7, 2020 21:15
tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request Mar 7, 2020
tveasey added a commit that referenced this pull request Mar 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[ML] Error during classification analytics
2 participants