[ML] Switch data frame analytics memory estimate from KB to MB #1110

droberts195 · 2020-03-31T17:03:26Z

For anomaly detection memory limits/estimates are done in terms of whole megabytes.

For data frame analytics the estimates are done in kilobytes and limits can be as granular as single bytes.

Originally estimates were going to be in tenths of megabytes, because the feeling was that many analyses would use less than a megabyte and rounding up would be too wasteful. Since tenths of megabytes to not round nicely to byte values we decided to go with kilobytes instead.

However, as #1106 showed, even quite simple analyses are using ~10MB, so rounding up to the next megabyte would not be a major problem.

Rounding estimates to the nearest megabyte would avoid the excessive precision problem noted in elastic/elasticsearch#54506. It would also improve consistency with anomaly detection.

We would still have to accept limits set in units other than megabytes, as jobs that have such limits will already exist. However, we could nudge future jobs towards using whole numbers of megabytes for their memory limits by always returning estimates as whole numbers of megabytes.

/cc @peteharverson

droberts195 · 2020-04-06T10:15:20Z

We discussed this and agreed to switch to MB. I will open a PR to do this.

Previously data frame analytics memory estimates were rounded to the nearest kilobyte, but this results in excessive precision for large analyses. This changes the estimates to always be reported in whole megabytes, rounded up from the low level estimate. Closes elastic#1110 Closes elastic/elasticsearch#54506

Previously data frame analytics memory estimates were rounded to the nearest kilobyte, but this results in excessive precision for large analyses. This changes the estimates to always be reported in whole megabytes, rounded up from the low level estimate. Closes #1110 Closes elastic/elasticsearch#54506

droberts195 added discuss :ml/DataFrameAnalysis labels Mar 31, 2020

droberts195 mentioned this issue Mar 31, 2020

[ML] Excessive precision in analytic model memory limit estimate from _explain endpoint elastic/elasticsearch#54506

Closed

droberts195 self-assigned this Apr 6, 2020

droberts195 removed the discuss label Apr 6, 2020

droberts195 mentioned this issue Apr 6, 2020

[ML] Round up data frame analytics memory estimates to next MB #1126

Merged

droberts195 closed this as completed in #1126 Apr 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Switch data frame analytics memory estimate from KB to MB #1110

[ML] Switch data frame analytics memory estimate from KB to MB #1110

droberts195 commented Mar 31, 2020

droberts195 commented Apr 6, 2020

Uh oh!

[ML] Switch data frame analytics memory estimate from KB to MB #1110

[ML] Switch data frame analytics memory estimate from KB to MB #1110

Comments

droberts195 commented Mar 31, 2020

droberts195 commented Apr 6, 2020

Uh oh!