Add cancellation checkpoint in logistic regression. #3032

codemzs · 2019-03-20T01:18:30Z

Please read the issue before reviewing this PR.

codecov · 2019-03-20T01:55:44Z

Codecov Report

Merging #3032 into master will increase coverage by 0.09%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master    #3032      +/-   ##
==========================================
+ Coverage   72.41%    72.5%   +0.09%     
==========================================
  Files         803      804       +1     
  Lines      143851   144080     +229     
  Branches    16173    16179       +6     
==========================================
+ Hits       104171   104467     +296     
+ Misses      35258    35197      -61     
+ Partials     4422     4416       -6

Flag	Coverage Δ
#Debug	`72.5% <100%> (+0.09%)`	⬆️
#production	`68.15% <100%> (+0.06%)`	⬆️
#test	`88.69% <ø> (+0.08%)`	⬆️

Impacted Files	Coverage Δ
.../Standard/LogisticRegression/LbfgsPredictorBase.cs	`71.33% <100%> (+0.06%)`	⬆️
...crosoft.ML.StandardTrainers/Optimizer/Optimizer.cs	`73.41% <100%> (+0.07%)`	⬆️
...soft.ML.Data/DataView/DataViewConstructionUtils.cs	`85.27% <0%> (-0.9%)`	⬇️
src/Microsoft.ML.Transforms/Text/LdaTransform.cs	`89.26% <0%> (-0.63%)`	⬇️
src/Microsoft.ML.FastTree/TreeTrainersCatalog.cs	`94.18% <0%> (ø)`	⬆️
...soft.ML.Data/DataLoadSave/DataOperationsCatalog.cs	`73.23% <0%> (ø)`	⬆️
...osoft.ML.Data/DataView/InternalSchemaDefinition.cs	`56.94% <0%> (ø)`	⬆️
...crosoft.ML.StandardTrainers/Standard/SdcaBinary.cs	`72.68% <0%> (ø)`	⬆️
...osoft.ML.Functional.Tests/SchemaDefinitionTests.cs	`98.46% <0%> (ø)`
test/Microsoft.ML.Tests/Scenarios/Api/TestApi.cs	`97.63% <0%> (+0.01%)`	⬆️
... and 11 more

rogancarr · 2019-03-20T21:46:40Z

What's the performance implications here?

wschin · 2019-03-21T04:45:12Z

src/Microsoft.ML.StandardTrainers/Standard/LogisticRegression/LbfgsPredictorBase.cs

@@ -475,6 +475,7 @@ private protected virtual void TrainCore(IChannel ch, RoleMappedData data)
                    e => e.SetProgress(0, exCount, totalCount));
                while (cursor.MoveNext())
                {
+                    Host.CheckAlive();
                    WeightSum += cursor.Weight;


I feel it's not the only place we need a check point.

Yep, added one more in Line Search Minimize function.

wschin · 2019-03-21T04:46:11Z

This looks very suspicious. Could you add some check points to this function? I also feel we need to perf algs before adding checking points.

Refers to: src/Microsoft.ML.StandardTrainers/Standard/LogisticRegression/LbfgsPredictorBase.cs:567 in 5540101. [](commit_id = 5540101, deletion_comment = False)

sfilipi · 2019-03-22T05:18:10Z

src/Microsoft.ML.StandardTrainers/Standard/LogisticRegression/LbfgsPredictorBase.cs

@@ -475,6 +475,7 @@ private protected virtual void TrainCore(IChannel ch, RoleMappedData data)
                    e => e.SetProgress(0, exCount, totalCount));
                while (cursor.MoveNext())
                {
+                    Host.CheckAlive();


Host.CheckAlive(); [](start = 18, length = 20)

is it too much to do it in every row fetch? would it be enough to do it every 10 cursor moves, or some other number > 1.
(idk if there are any best practiced on how to determine the frequency of checks , from maybe the CancellationToken implementations)

And how is that any more efficient than what we have now? You will end up executing an if condition on every row fetch ... based on my analysis of the current solution this doesn’t add any significant overhead.

Cancellation token works differently. You register a callback with it and when a signal is sent it invokes the callback and you do work to gracefully shutdown a process. Our plan is to implement cancellation token post 1.0.

We spoke offline. I think this is the best we can do until we get cancellation tokens into the mix. CheckAlive only checks a bool property, so it's probably faster than checking to see if it's the 10th iteration or not.

To add some late flavor to this, the branch predictor should be slightly better at the (almost perfectly) constant bool property, than the return value from iteration % 10 (including the hidden division operation..could check every 8 as the compiler should optimize to a bitwise AND).

That said, there's the overhead of the CheckAlive() function call which maybe greater if not inlined.

Add cancellation checkpoint in logistic regression.

5540101

codemzs requested a review from wschin March 20, 2019 17:37

wschin reviewed Mar 21, 2019

View reviewed changes

wschin approved these changes Mar 21, 2019

View reviewed changes

Add checkpoint in LineSearch.Minimize()

5f3a7db

codemzs requested review from sfilipi, shauheen and yaeldekel March 21, 2019 21:40

sfilipi reviewed Mar 22, 2019

View reviewed changes

rogancarr approved these changes Mar 22, 2019

View reviewed changes

codemzs merged commit b6c5b70 into dotnet:master Mar 22, 2019

codemzs mentioned this pull request Apr 2, 2019

Adding release notes for RC1 #3176

Merged

ghost locked as resolved and limited conversation to collaborators Mar 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cancellation checkpoint in logistic regression. #3032

Add cancellation checkpoint in logistic regression. #3032

codemzs commented Mar 20, 2019 •

edited

Loading

codecov bot commented Mar 20, 2019 •

edited

Loading

rogancarr commented Mar 20, 2019

wschin Mar 21, 2019

codemzs Mar 21, 2019

wschin commented Mar 21, 2019

sfilipi Mar 22, 2019

codemzs Mar 22, 2019

rogancarr Mar 22, 2019

justinormont Aug 6, 2019

Add cancellation checkpoint in logistic regression. #3032

Add cancellation checkpoint in logistic regression. #3032

Conversation

codemzs commented Mar 20, 2019 • edited Loading

codecov bot commented Mar 20, 2019 • edited Loading

Codecov Report

rogancarr commented Mar 20, 2019

wschin Mar 21, 2019

Choose a reason for hiding this comment

codemzs Mar 21, 2019

Choose a reason for hiding this comment

wschin commented Mar 21, 2019

sfilipi Mar 22, 2019

Choose a reason for hiding this comment

codemzs Mar 22, 2019

Choose a reason for hiding this comment

rogancarr Mar 22, 2019

Choose a reason for hiding this comment

justinormont Aug 6, 2019

Choose a reason for hiding this comment

codemzs commented Mar 20, 2019 •

edited

Loading

codecov bot commented Mar 20, 2019 •

edited

Loading