[ML] Improvements to sparse count modelling #721

tveasey · 2019-10-08T11:33:07Z

This simplifies sparse count and sum modelling and migrates to always updating the time series model, but using a weight which decreases in proportion to the number of empty buckets. This means we simply smoothly transition to modelling non-empty buckets for sparse data.

I've also removed the correction to the probability which accounts for the fraction of non-empty buckets: we have the rare function anyway if this is the primary concern. Finally, I changed periodicity testing so that it approximates the old behaviour, i.e. it tends to ignoring empty buckets as their proportion increases.

Closes #696.

edsavage

LGTM - just a few nits noted

lib/maths/CPeriodicityHypothesisTests.cc

lib/maths/CTimeSeriesModel.cc

lib/model/CProbabilityAndInfluenceCalculator.cc

Backport #721.

tveasey added 2 commits October 7, 2019 22:26

Overhaul sparse count and sum

167f0ff

Test fallout

cef8f83

tveasey added >enhancement review :ml affects-results v8.0.0 v7.5.0 labels Oct 8, 2019

tveasey requested a review from edsavage October 8, 2019 11:33

edsavage approved these changes Oct 8, 2019

View reviewed changes

tveasey added 2 commits October 8, 2019 14:52

Docs

4375568

Explicit capture

5d3c3a3

tveasey merged commit 9417dbc into elastic:master Oct 8, 2019

tveasey deleted the sparse-count-modelling branch October 8, 2019 17:57

tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request Oct 8, 2019

[ML] Improvements to sparse count modelling (elastic#721)

ed98a51

tveasey mentioned this pull request Oct 8, 2019

[7.5][ML] Improvements to sparse count modelling #722

Merged

tveasey added a commit that referenced this pull request Oct 9, 2019

[7.5][ML] Improvements to sparse count modelling (#722)

1da7698

Backport #721.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Improvements to sparse count modelling #721

[ML] Improvements to sparse count modelling #721

Uh oh!

tveasey commented Oct 8, 2019

Uh oh!

edsavage left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[ML] Improvements to sparse count modelling #721

[ML] Improvements to sparse count modelling #721

Uh oh!

Conversation

tveasey commented Oct 8, 2019

Uh oh!

edsavage left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!