[ML] Logistic regression loss function for boosted tree training #713

tveasey · 2019-10-02T12:06:52Z

This implements binomial logistic regression for the boosted tree. In particular, this targets cross entropy and builds a forest to predict the class log-odds.

We should also have been including sum square leaf weight penalty in the calculation of the optimum tree leaf values, since the splits are chosen targeting the regularised objective. (Note that the regularisation applies to the log-odds for logistic regression, i.e. we'll shrink the log-odds towards zero and so the predicted probabilities towards 0.5.)

I haven't wired this in yet, since that work depends on #701.

…de correct parameters.

valeriy42

Good work! I have just minor comments on improving readability.

include/maths/CBoostedTree.h

lib/maths/CBoostedTree.cc

include/maths/CBoostedTree.h

lib/maths/CBoostedTree.cc

lib/api/unittest/CDataFrameAnalyzerTest.cc

lib/maths/unittest/CBoostedTreeTest.cc

tveasey · 2019-10-09T08:47:02Z

I've now addressed all your review comments. Can you take another look @valeriy42.

valeriy42

LGTM. Good work on writing explanation comments. I left a couple of minor comments. No need for me to look over it again.

valeriy42 · 2019-10-09T08:34:17Z

lib/maths/CBoostedTree.cc

+
+    // We searching for the value x which minimises
+    //
+    //    x^* = argmin_x{ sum_i{(a_i - (p_i + x))^2} + lambda * x^2 }
+    //
+    // This is convex so there is one minimum where derivative w.r.t. x is zero
+    // and x^* = 1 / (n + lambda) sum_i{ a_i - p_i }. Denoting the mean prediction
+    // error m = 1/n sum_i{ a_i - p_i } we have x^* = n / (n + lambda) m.
+


Good job on explaining what the function does! 👍

valeriy42 · 2019-10-09T08:35:40Z

lib/maths/CBoostedTree.cc

+    // This is true if and only if all the predictions were identical. In this
+    // case we only need one pass over the data and can compute the optimal


valeriy42 · 2019-10-09T08:40:40Z

lib/maths/CBoostedTree.cc

+        // zero to close to one. In particular, the idea is to minimize the leaf
+        // weight on an interval [a, b] where if we add "a" the log-odds for all
+        // rows <= -5, i.e. max prediction + a = -5, and if we add "b" the log-odds
+        // for all rows >= 5, i.e. min prediction + a = 5.


Nice explanation! 👍

lib/maths/unittest/CBoostedTreeTest.cc

Co-Authored-By: Valeriy Khakhutskyy <[email protected]>

…o logistic-regression

…stic#713)

…#730) Backport #713.

tveasey added 3 commits October 2, 2019 11:53

Logistic regression loss function

0ffe193

Typo

801fa7b

Better comments

b7decfd

tveasey added >enhancement review v8.0.0 :ml/DataFrameAnalysis v7.5.0 labels Oct 2, 2019

tveasey requested a review from valeriy42 October 2, 2019 12:06

tveasey added 5 commits October 2, 2019 13:13

Update tests

885da7d

Docs

55dc6af

Relax test thresholds for other platforms

72010b4

Another one

d0f6246

Fix and overhaul test. Break hidden dependency on state names. Overri…

28624bb

…de correct parameters.

valeriy42 reviewed Oct 8, 2019

View reviewed changes

tveasey added 12 commits October 8, 2019 14:54

Typo

c1b3ec1

Better variable name

f011036

Explain change to include lambda in minimum additive weight

1b9c6a8

Update out of date comment

dd77bb3

Explain the bucket width check

2b88359

Explain magic minus signs

4cb6f0d

Extend test comment

7c3dbcf

Explain loop

4b62cac

More descriptive names

85aa192

Merge branch 'master' into logistic-regression

4ad1df0

Review comment

cddab1f

Merge branch 'master' into logistic-regression

fcfc5ef

valeriy42 approved these changes Oct 9, 2019

View reviewed changes

Typo

4fed5ec

Co-Authored-By: Valeriy Khakhutskyy <[email protected]>

tveasey added 2 commits October 9, 2019 13:33

Further comment tweak

a09929d

Merge branch 'logistic-regression' of github.com:tveasey/ml-cpp-1 int…

f20c203

…o logistic-regression

tveasey merged commit 60c9e02 into elastic:master Oct 9, 2019

tveasey deleted the logistic-regression branch October 9, 2019 13:26

tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request Oct 11, 2019

[ML] Logistic regression loss function for boosted tree training (ela…

d82931e

…stic#713)

tveasey mentioned this pull request Oct 11, 2019

[7.5][ML] Logistic regression loss function for boosted tree training #730

Merged

tveasey added a commit that referenced this pull request Oct 11, 2019

[7.5][ML] Logistic regression loss function for boosted tree training (…

9957f81

…#730) Backport #713.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Logistic regression loss function for boosted tree training #713

[ML] Logistic regression loss function for boosted tree training #713

tveasey commented Oct 2, 2019

valeriy42 left a comment

tveasey commented Oct 9, 2019

valeriy42 left a comment

valeriy42 Oct 9, 2019

valeriy42 Oct 9, 2019

valeriy42 Oct 9, 2019

		// This is true if and only if all the predictions were identical. In this
		// case we only need one pass over the data and can compute the optimal

[ML] Logistic regression loss function for boosted tree training #713

[ML] Logistic regression loss function for boosted tree training #713

Conversation

tveasey commented Oct 2, 2019

valeriy42 left a comment

Choose a reason for hiding this comment

tveasey commented Oct 9, 2019

valeriy42 left a comment

Choose a reason for hiding this comment

valeriy42 Oct 9, 2019

Choose a reason for hiding this comment

valeriy42 Oct 9, 2019

Choose a reason for hiding this comment

valeriy42 Oct 9, 2019

Choose a reason for hiding this comment