You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[ML] Line search feature bag fraction for classification and regression model training (#1761)
Following on from #1733, we can get further speedups by line searching for the best feature bag fraction for data sets
where we only need a fraction of the features per tree. For example, training time on Higgs 1M drops from 2585s to
1742s and we actually get a small improvement in accuracy because our hyperparameter search region is better
initialised.
This makes three changes:
1. Adds a line search for the best initial feature bag fraction to use.
2. Adds a small linear penalty at most 1% minimum loss to encourage larger down sample factors and smaller feature
bag fractions.
3. Handles better the case we have many features and relatively few training examples.
0 commit comments