Skip to content

Commit 6653ed0

Browse files
lcawldroberts195
andcommitted
[DOCS] Adds ml-cpp PRs to release notes (#52158)
Co-Authored-By: David Roberts <[email protected]>
1 parent f75cad2 commit 6653ed0

File tree

1 file changed

+62
-50
lines changed

1 file changed

+62
-50
lines changed

docs/reference/release-notes/7.6.asciidoc

+62-50
Original file line numberDiff line numberDiff line change
@@ -76,9 +76,11 @@ Features/Ingest::
7676

7777
Machine Learning::
7878
* Implement `precision` and `recall` metrics for classification evaluation {pull}49671[#49671] (issue: {issue}48759[#48759])
79-
* [ML] Explain data frame analytics API {pull}49455[#49455]
80-
* [ML] ML Model Inference Ingest Processor {pull}49052[#49052]
81-
* Implement accuracy metric for multiclass classification {pull}47772[#47772] (issue: {issue}48759[#48759])
79+
* Explain data frame analytics API {pull}49455[#49455]
80+
* Machine learning model inference ingest processor {pull}49052[#49052]
81+
* Implement accuracy metric for multi-class classification {pull}47772[#47772] (issue: {issue}48759[#48759])
82+
* Add feature importance values to classification and regression results (using tree
83+
SHapley Additive exPlanation, or SHAP) {ml-pull}857[#857]
8284

8385
Mapping::
8486
* Add per-field metadata. {pull}49419[#49419] (issue: {issue}33267[#33267])
@@ -222,23 +224,43 @@ License::
222224
* Support "enterprise" license types {pull}49223[#49223] (issue: {issue}48510[#48510])
223225

224226
Machine Learning::
225-
* [ML] Add audit warning for 1000 categories found early in job {pull}51146[#51146] (issue: {issue}50749[#50749])
226-
* [ML] Add num_top_feature_importance_values param to regression and classi… {pull}50914[#50914]
227-
* [ML] Implement force deleting a data frame analytics job {pull}50553[#50553] (issue: {issue}48124[#48124])
228-
* [ML] Delete unused data frame analytics state {pull}50243[#50243]
227+
* Add audit warning for 1000 categories found early in job {pull}51146[#51146] (issue: {issue}50749[#50749])
228+
* Add `num_top_feature_importance_values` param to regression and classification {pull}50914[#50914]
229+
* Implement force deleting a data frame analytics job {pull}50553[#50553] (issue: {issue}48124[#48124])
230+
* Delete unused data frame analytics state {pull}50243[#50243]
229231
* Make each analysis report desired field mappings to be copied {pull}50219[#50219] (issue: {issue}50119[#50119])
230-
* [ML] retry bulk indexing of state docs {pull}50149[#50149] (issue: {issue}50143[#50143])
231-
* [ML] Persist/restore state for DFA classification {pull}50040[#50040]
232-
* [ML] Introduce randomize_seed setting for regression and classification {pull}49990[#49990]
232+
* Retry bulk indexing of state docs {pull}50149[#50149] (issue: {issue}50143[#50143])
233+
* Persist/restore state for data frame analytics classification {pull}50040[#50040]
234+
* Introduce `randomize_seed` setting for regression and classification {pull}49990[#49990]
233235
* Pass `prediction_field_type` to C++ analytics process {pull}49861[#49861] (issue: {issue}49796[#49796])
234-
* [ML] Add optional source filtering during data frame reindexing {pull}49690[#49690] (issue: {issue}49531[#49531])
235-
* [ML] Add default categorization analyzer definition to ML info {pull}49545[#49545]
236-
* [ML] Add graceful retry for anomaly detector result indexing failures {pull}49508[#49508] (issue: {issue}45711[#45711])
237-
* Lower minimum model memory limit value from 1MB to 1kB. {pull}49227[#49227] (issue: {issue}49168[#49168])
238-
* Throw an exception when memory usage estimation endpoint encounters empty data frame. {pull}49143[#49143] (issue: {issue}49140[#49140])
239-
* Change format of MulticlassConfusionMatrix result to be more self-explanatory {pull}48174[#48174] (issue: {issue}46735[#46735])
240-
* Make num_top_classes parameter's default value equal to 2 {pull}48119[#48119] (issue: {issue}46735[#46735])
241-
* [ML] Improve model_memory_limit UX for data frame analytics jobs {pull}44699[#44699]
236+
* Add optional source filtering during data frame reindexing {pull}49690[#49690] (issue: {issue}49531[#49531])
237+
* Add default categorization analyzer definition to ML info {pull}49545[#49545]
238+
* Add graceful retry for anomaly detector result indexing failures {pull}49508[#49508] (issue: {issue}45711[#45711])
239+
* Lower minimum model memory limit value for data frame analytics jobs from 1MB to 1kB {pull}49227[#49227] (issue: {issue}49168[#49168])
240+
* Improve `model_memory_limit` user experience for data frame analytics jobs {pull}44699[#44699]
241+
* Improve performance of boosted tree training for both classification and regression {ml-pull}775[#775]
242+
* Reduce the peak memory used by boosted tree training and fix an overcounting bug
243+
estimating maximum memory usage {ml-pull}781[#781]
244+
* Stratified fractional cross validation for regression {ml-pull}784[#784]
245+
* Added `geo_point` supported output for `lat_long` function records {ml-pull}809[#809], {pull}47050[#47050]
246+
* Use a random bag of the data to compute the loss function derivatives for each
247+
new tree which is trained for both regression and classification {ml-pull}811[#811]
248+
* Emit `prediction_probability` field alongside prediction field in ml results {ml-pull}818[#818]
249+
* Reduce memory usage of {ml} native processes on Windows {ml-pull}844[#844]
250+
* Reduce runtime of classification and regression {ml-pull}863[#863]
251+
* Stop early training a classification and regression forest when the validation
252+
error is no longer decreasing {ml-pull}875[#875]
253+
* Emit `prediction_field_name` in data frame analytics results using the type
254+
provided as `prediction_field_type` parameter {ml-pull}877[#877]
255+
* Improve performance updating quantile estimates {ml-pull}881[#881]
256+
* Migrate to use Bayesian optimisation for initial hyperparameter value line
257+
searches and stop early if the expected improvement is too small {ml-pull}903[#903]
258+
* Stop cross-validation early if the predicted test loss has a small chance of
259+
being smaller than for the best parameter values found so far {ml-pull}915[#915]
260+
* Optimize decision threshold for classification to maximize minimum class recall {ml-pull}926[#926]
261+
* Include categorization memory usage in the `model_bytes` field in
262+
`model_size_stats`, so that it is taken into account in node assignment
263+
decisions {ml-pull}927[#927] (issue:{ml-issue}724[#724])
242264

243265
Mapping::
244266
* Add telemetry for flattened fields. {pull}48972[#48972]
@@ -297,11 +319,11 @@ Store::
297319
* mmap dim files in HybridDirectory {pull}49272[#49272] (issue: {issue}48509[#48509])
298320

299321
Transform::
300-
* [Transform] Improve force stop robustness in case of an error {pull}51072[#51072]
301-
* [Transform] add actual timeout in message {pull}50140[#50140]
302-
* [Transform] automatic deletion of old checkpoints {pull}49496[#49496]
303-
* [Transform] improve error handling of script errors {pull}48887[#48887] (issue: {issue}48467[#48467])
304-
* [ML][Transforms] add wait_for_checkpoint flag to stop {pull}47935[#47935] (issue: {issue}45293[#45293])
322+
* Improve force stop robustness in case of an error {pull}51072[#51072]
323+
* Add actual timeout in message {pull}50140[#50140]
324+
* Automatic deletion of old checkpoints {pull}49496[#49496]
325+
* Improve error handling of script errors {pull}48887[#48887] (issue: {issue}48467[#48467])
326+
* Add `wait_for_checkpoint` flag to stop {pull}47935[#47935] (issue: {issue}45293[#45293])
305327

306328

307329

@@ -447,28 +469,21 @@ Infra/REST API::
447469
* Slash missed in indices.put_mapping url {pull}49468[#49468]
448470

449471
Machine Learning::
450-
* [ML] Fix 2 digit year regex in find_file_structure {pull}51469[#51469]
451-
* [ML] Validate classification dependent_variable cardinality is at lea… {pull}51232[#51232]
472+
* Fix 2 digit year regex in find_file_structure {pull}51469[#51469]
473+
* Validate classification `dependent_variable` cardinality is at least two {pull}51232[#51232]
452474
* Do not copy mapping from dependent variable to prediction field in regression analysis {pull}51227[#51227]
453-
* Handle nested and aliased fields correctly when copying mapping. {pull}50918[#50918] (issue: {issue}50787[#50787])
454-
* [ML] Fix off-by-one error in ml_classic tokenizer end offset {pull}50655[#50655]
455-
* [ML] Improve uniqueness of result document IDs {pull}50644[#50644] (issue: {issue}50613[#50613])
456-
* [7.x] Synchronize processInStream.close() call {pull}50581[#50581] (issue: {issue}49680[#49680])
457-
* Fix accuracy metric {pull}50310[#50310] (issue: {issue}48759[#48759])
475+
* Handle nested and aliased fields correctly when copying mapping {pull}50918[#50918] (issue: {issue}50787[#50787])
476+
* Fix off-by-one error in `ml_classic` tokenizer end offset {pull}50655[#50655]
477+
* Improve uniqueness of result document IDs {pull}50644[#50644] (issue: {issue}50613[#50613])
478+
* Fix accuracy metric in multi-class confusion matrix {pull}50310[#50310] (issue: {issue}48759[#48759])
458479
* Fix race condition when stopping a data frame analytics jobs immediately after starting it {pull}50276[#50276] (issues: {issue}49680[#49680], {issue}50177[#50177])
459-
* Use query in cardinality check {pull}49939[#49939]
460-
* Make only a part of `stop()` method a critical section. {pull}49756[#49756] (issue: {issue}49680[#49680])
461-
* Fix expired job results deletion audit message {pull}49560[#49560] (issue: {issue}49549[#49549])
462-
* [ML] Apply source query on data frame analytics memory estimation {pull}49517[#49517] (issue: {issue}49454[#49454])
463-
* Stop timing stats failure propagation {pull}49495[#49495]
464-
* [ML] Fix r_squared eval when variance is 0 {pull}49439[#49439]
465-
* Blacklist a number of prediction field names. {pull}49371[#49371] (issue: {issue}48808[#48808])
466-
* Make AnalyticsProcessManager class more robust {pull}49282[#49282] (issue: {issue}49095[#49095])
467-
* [ML] Fixes for stop datafeed edge cases {pull}49191[#49191] (issues: {issue}43670[#43670], {issue}48931[#48931])
468-
* [ML] Avoid NPE when node load is calculated on job assignment {pull}49186[#49186] (issue: {issue}49150[#49150])
469-
* Do not throw exceptions resulting from persisting datafeed timing stats. {pull}49044[#49044] (issue: {issue}49032[#49032])
470-
* [ML] Deduplicate multi-fields for data frame analytics {pull}48799[#48799] (issues: {issue}48756[#48756], {issue}48770[#48770])
471-
* [ML] Prevent fetching multi-field from source {pull}48770[#48770] (issue: {issue}48756[#48756])
480+
* Apply source query on data frame analytics memory estimation {pull}49517[#49517] (issue: {issue}49454[#49454])
481+
* Fix r_squared eval when variance is 0 {pull}49439[#49439]
482+
* Blacklist a number of prediction field names {pull}49371[#49371] (issue: {issue}48808[#48808])
483+
* Make data frame analytics more robust for very short-lived analyses {pull}49282[#49282] (issue: {issue}49095[#49095])
484+
* Fixes potential memory corruption when determining seasonality {ml-pull}852[#852]
485+
* Prevent `prediction_field_name` clashing with other fields in {ml} results {ml-pull}861[#861]
486+
* Include out-of-order as well as in-order terms in categorization reverse searches {ml-pull}950[#950] (issue:{ml-issue}949[#949])
472487

473488
Mapping::
474489
* Ensure that field collapsing works with field aliases. {pull}50722[#50722] (issues: {issue}32648[#32648], {issue}50121[#50121])
@@ -532,11 +547,11 @@ Snapshot/Restore::
532547
* Cleanup Concurrent RepositoryData Loading {pull}48329[#48329] (issue: {issue}48122[#48122])
533548

534549
Transform::
535-
* [Transform] Fix mapping deduction for scaled_float {pull}51990[#51990] (issue: {issue}51780[#51780])
536-
* [Transform] Fix stats can return old state information if security is enabled {pull}51732[#51732] (issue: {issue}51728[#51728])
537-
* [Transform] fail to start/put on missing pipeline {pull}50701[#50701] (issue: {issue}50135[#50135])
550+
* Fix mapping deduction for scaled_float {pull}51990[#51990] (issue: {issue}51780[#51780])
551+
* Fix stats can return old state information if security is enabled {pull}51732[#51732] (issue: {issue}51728[#51728])
552+
* Fail to start/put on missing pipeline {pull}50701[#50701] (issue: {issue}50135[#50135])
538553
* Fix possible audit logging disappearance after rolling upgrade {pull}49731[#49731] (issue: {issue}49730[#49730])
539-
* [Transform] do not fail checkpoint creation due to global checkpoint mismatch {pull}48423[#48423] (issue: {issue}48379[#48379])
554+
* Do not fail checkpoint creation due to global checkpoint mismatch {pull}48423[#48423] (issue: {issue}48379[#48379])
540555

541556

542557

@@ -549,6 +564,3 @@ Engine::
549564

550565
Infra/Packaging::
551566
* Upgrade the bundled JDK to JDK 13.0.2 {pull}51511[#51511]
552-
553-
554-

0 commit comments

Comments
 (0)