API reference - Finalize docs template for trainer API & estimators #3218

shmoradims · 2019-04-05T18:01:18Z

Currently API reference documentation for each trainer is split into two pages: 1) the creation method, and 2) the trainer estimator class. We also have a 3rd page for trainer options. In this issue, I want to reach a consensus about the content that goes in each page. The current proposal is as follows:

Page-1 - Creation extension methods

These methods act as the constructor for the trainer estimator class. There are two overloads per trainer and they're listed as extension methods in a MLContext trainer catalog. E.g. BinaryClassificationCatalog.BinaryClassificationTrainers.

Both overloads also show up in the same page for the extension class; we call this page-1 (e.g. LightGbm; please note that this page includes all LightGbm overloads including multiclass, ranking, etc and not just binary classification versions).

Summary

1-liner summary of what the trainer does, then "cref=the estimator class, i.e. page-2"
Training algorithm details are not here, and are included in page-2, so that other overloads of this APIs share the same content.

Remarks

[Gleb: add optional description for current overload]

Parameters

Parameters are defined

Example

One example is provided for this API (one per overload)

Page-2 - Trainer Estimator Class

This is the page for trainer estimator class. E.g. LightGbmBinaryTrainer

Summary

1-liner summary of what the trainer does with "cref=IEstimator(TTransformer)". [Gleb: add info on when it is good to use it. - answer the WHY question.] [Gleb: Add link to options in summary]
Training algorithm details are not in the summary.

Remarks

Note about creation: "For creating this trainer please see "cref to both overload methods from page-1"

Easy properties:

Machine learning task: (redundant?)
Expected label type: bool, etc
Output columns: "Score", "PredictedLabel", etc with description of what each does
Is normalization required? Yes/No
Is caching required? Yes/No
Is convertible to Onnx format? Yes/No
Additional NuGet: "Link to NuGet" OR None of all that are included already in Microsoft.ML

Complex properties:

Trainer Category (requires some taxonomy to be created)
When to use this trainer? [what goes here?]
Supported number of features?
Supported number of examples?

Training algorithm details with all the reference links.

Example

Repeat example from overload-1 of page-1
Repeat example from overload-2 of page-1

Page-3 - Trainer Options Class

This is the page for trainer options class that's used in one of the overloads in page-1. E.g. AveragedPerceptronTrainer.Options

Page-1 already links to this page from the type of the option parameter.

Summary

Options for "cref=page-2" as used in method "cref=page-1/overload-with-option".

Remarks

None

Example

None (page-1 already includes an example for using the options)

Parameters

Parameters are defined

Page-1 - Creation extension methods

Page-1-overload1
Page-1-overload2

Above links are section-links on the same method page

Summary

[For overload with params] Create an <cref=AveragedPerceptronTrainer>, which predicts a target using a linear binary classification model trained over boolean label data.
[For overload with options] Create an <cref=AveragedPerceptronTrainer> with advanced options, which predicts a target using a linear binary classification model trained over boolean label data.

Remarks

None

Parameters

Parameters are defined as usual. Extra info is added to the following params:

labelColumnName: Add: The column data must be of type <cref=System.Boolean>
featureColumnName: Add: The column data must be a known-sized vector of <cref="System.Single">.

Example

One example is provided for this API (one per overload)

Page-2 - Trainer Estimator Class

AveragedPerceptronTrainer

Summary

The <cref=IEstimator(TTransformer)> to predict a target using a linear binary classification model trained with the averaged perceptron.

Remarks

To create this trainer, use <xref=overload1> or <xref=overload2>. [we're using xref instead of cref to have short anchor text for the links. cref links for overloads are super long and unreadable]

[Input / Output is common for ML tasks. we reuse it by importing external files]

Input / Output Columns

The input label column data must be of type <xref=System.Single>.
This trainer outputs the following columns:

Output Column Name	Column Type	Description
`Score`	<cref=System.Single>	The unbounded score that was calculated by the trainer to determine the prediction.
`PredictedLabel`	<cref=System.Boolean>	The label predicted by the trainer. `false` maps to negative score and `true` maps to positive score.
`Probability`	<cref=System.Single>	The probability of the score in range [0,1].

Trainer Characteristics


Machine learning task	Binary classification
Is normalization required?	Yes/No
Is caching required?	Yes/No
Required NuGet in additional to Microsoft.ML	"Link to NuGet" OR None if already part of Microsoft.ML

Training Algorithm Details

Training algorithm is described here with references.

Example

None (we already have samples in Page-1; we're not repeating them here)

Page-3 - Trainer Options Class

AveragedPerceptronTrainer.Options

Summary

Options for <cref=AveragedPerceptronTrainer> as used in method <xref=overload2-with-option>

Remarks

None

Example

None (page-1 already includes an example for using the options)

Parameters

Parameters are defined

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API reference - Finalize docs template for trainer API & estimators #3218

API reference - Finalize docs template for trainer API & estimators #3218

shmoradims commented Apr 5, 2019 •

edited

Loading

This comment has been minimized.

shmoradims commented Apr 10, 2019 •

edited

Loading

This comment has been minimized.

shmoradims commented Apr 25, 2019

API reference - Finalize docs template for trainer API & estimators #3218

API reference - Finalize docs template for trainer API & estimators #3218

Comments

shmoradims commented Apr 5, 2019 • edited Loading

Page-1 - Creation extension methods

Summary

Remarks

Parameters

Example

Page-2 - Trainer Estimator Class

Summary

Remarks

Example

See also

Page-3 - Trainer Options Class

Summary

Remarks

Example

Parameters

See also

This comment has been minimized.

shmoradims commented Apr 10, 2019 • edited Loading

Page-1 - Creation extension methods

Summary

Remarks

Parameters

Example

Page-2 - Trainer Estimator Class

Summary

Remarks

Input / Output Columns

Trainer Characteristics

Training Algorithm Details

Example

See also

Page-3 - Trainer Options Class

Summary

Remarks

Example

Parameters

See also

This comment has been minimized.

shmoradims commented Apr 25, 2019

shmoradims commented Apr 5, 2019 •

edited

Loading

shmoradims commented Apr 10, 2019 •

edited

Loading