Add new benchmarks to test\Microsoft.ML.Benchmarks #722

briancylui · 2018-08-23T18:57:08Z

Submitting @yaeldekel's new benchmarks via this PR.

Added a new benchmark for KMeans and Logistic Regression (LR) under test\Microsoft.ML.Benchmarks
Added a new sentiment test inside the existing SDCA benchmark

eerhardt · 2018-08-23T19:20:55Z

test/Microsoft.ML.Tests/ScenariosWithDirectInstantiation/SentimentPredictionTests.cs

@@ -8,6 +8,7 @@
 using Microsoft.ML.Runtime.Data;
 using Microsoft.ML.Runtime.FastTree;
 using Microsoft.ML.Runtime.Internal.Calibration;
+using Microsoft.ML.Runtime.Learners;


The changes in this file can be reverted. No need to change this file.

The changes in this file are mostly refactoring that improves the style. There are actually multiple redundant usings here that I am going to remove in my next commit.

In general, this kind of style/clean up changes should not happen as part of another PR. It distracts from the real change in this PR, and it also clutters history (When some one looks at all the changes to this file, they see this change. Or when someone looks at this commit in the history, there is this needless change also included.)

I will revert the changes - thanks!

eerhardt · 2018-08-23T19:22:03Z

test/Microsoft.ML.Benchmarks/KMeansAndLogisticRegressionBench.cs

+        public void Setup()
+        {
+            s_dataPath = Program.GetDataPath("adult.train");
+            StochasticDualCoordinateAscentClassifierBench.s_metrics = Models.ClassificationMetrics.Empty;


This seems like unfortunate coupling that I don't think we want. Can this be removed?

+1

In reply to: 212427929 [](ancestors = 212427929)

eerhardt · 2018-08-23T19:23:00Z

test/Microsoft.ML.Benchmarks/KMeansAndLogisticRegressionBench.cs

+            }
+        }
+
+        public class IrisData


I don't believe IrisData and IrisPrediction are used. Can they be removed?

eerhardt · 2018-08-23T19:25:06Z

test/Microsoft.ML.Benchmarks/StochasticDualCoordinateAscentClassifierBench.cs

+            {
+                // Pipeline
+                var loader = new TextLoader(env,
+                new TextLoader.Arguments()


(nit) this should be indented since it is a continuation of the line above.

Thanks! I also indented the subsequent lines since they are actually all inside one pair of parantheses, which may not be apparent from the current code style.

eerhardt · 2018-08-23T19:27:58Z

src/Microsoft.ML/Models/ClassificationMetrics.cs

@@ -15,6 +15,7 @@ namespace Microsoft.ML.Models
    /// </summary>
    public sealed class ClassificationMetrics
    {
+        public static ClassificationMetrics Empty = new ClassificationMetrics();


I don't think we should be adding this public API in this PR.

Zruty0

eerhardt

Thanks @briancylui

eerhardt · 2018-08-23T22:41:51Z

test OSX10.13 Debug
test OSX10.13 Release

briancylui · 2018-08-24T00:37:08Z

@eerhardt: dotnet build -c Release is fine, but dotnet run -c Release throws an exception. Deleting the two lines about s_metrics makes the GetValue(…) method in ClassificationMetricColumn not well-defined.

briancylui · 2018-08-24T08:33:26Z

Copying a relevant comment from #724:

Regarding the perf results posted on #724, KMeansAndLogisticRegression (KMeans+LR) shares the same AccuracyMacro with SDCA (0.98), but since the GetValue method of Program.cs:ClassificationMetricsColumn only references StochasticDualCoordinateAscentClassifierBench.s_metrics (link), which is irrelevant to KMeans+LR, it might be possible that KMeans+LR displayed SDCA's AccuracyMacro as its own metric in the perf results. When I ran dotnet run -c Release and chose KMeans+LR only, the AccuracyMacro was displayed as 0. It may have something to do with the added public variable ClassificationMetrics Empty.

eerhardt · 2018-08-24T14:58:46Z

@briancylui - do you think there are changes from #724 that should be brought over here?

@adamsitnik - I see the existing benchmark test has a bit of coupling between the Program.cs:ClassificationMetricsColumn and the StochasticDualCoordinateAscentClassifierBench test. Is it possible to have metrics columns that are specific to a set of tests? For example, the current test is a classification test, and those metrics only apply to that test. The new tests being added should have different metrics.

adamsitnik · 2018-08-27T04:45:30Z

@briancylui @eerhardt I have solved this issue in #735

yaeldMS added 4 commits August 23, 2018 11:26

Add sentiment test with SDCA.

b0f3d2c

Add a test with KMeans and LR.

3f48dd4

Add LR training to test

7e9651a

Convert unit tests to benchmark tests, with merge conflicts fixed.

edaa160

eerhardt requested review from Ivanidzo4ka, codemzs, TomFinley, justinormont and Zruty0 August 23, 2018 19:19

eerhardt reviewed Aug 23, 2018

View reviewed changes

Zruty0 approved these changes Aug 23, 2018

View reviewed changes

Respond to PR feedback

6fa909e

eerhardt approved these changes Aug 23, 2018

View reviewed changes

Respond to PR feedback: Revert changes to SentimentPredictionTests.cs

d749c8c

eerhardt mentioned this pull request Aug 24, 2018

Benchmarks created by @yaeldekel #724

Closed

Zruty0 merged commit 4fd8a9c into dotnet:master Aug 24, 2018

ghost locked as resolved and limited conversation to collaborators Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new benchmarks to test\Microsoft.ML.Benchmarks #722

Add new benchmarks to test\Microsoft.ML.Benchmarks #722

briancylui commented Aug 23, 2018 •

edited

Loading

eerhardt Aug 23, 2018

briancylui Aug 23, 2018

eerhardt Aug 23, 2018

briancylui Aug 23, 2018

eerhardt Aug 23, 2018

Zruty0 Aug 23, 2018

eerhardt Aug 23, 2018

eerhardt Aug 23, 2018

briancylui Aug 23, 2018

eerhardt Aug 23, 2018

Zruty0 left a comment

eerhardt left a comment

eerhardt commented Aug 23, 2018

briancylui commented Aug 24, 2018

briancylui commented Aug 24, 2018

eerhardt commented Aug 24, 2018

adamsitnik commented Aug 27, 2018

Add new benchmarks to test\Microsoft.ML.Benchmarks #722

Add new benchmarks to test\Microsoft.ML.Benchmarks #722

Conversation

briancylui commented Aug 23, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Zruty0 left a comment

Choose a reason for hiding this comment

eerhardt left a comment

Choose a reason for hiding this comment

eerhardt commented Aug 23, 2018

briancylui commented Aug 24, 2018

briancylui commented Aug 24, 2018

eerhardt commented Aug 24, 2018

adamsitnik commented Aug 27, 2018

briancylui commented Aug 23, 2018 •

edited

Loading