You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/Microsoft.ML.Transforms/Text/NgramHashingTransformer.cs
+2-2
Original file line number
Diff line number
Diff line change
@@ -866,11 +866,11 @@ public VBuffer<ReadOnlyMemory<char>>[] SlotNamesMetadata(out VectorDataViewType[
866
866
/// | | |
867
867
/// | -- | -- |
868
868
/// | Does this estimator need to look at the data to train its parameters? | Yes |
869
-
/// | Input column data type | Vector of [Key](<xref:Microsoft.ML.Data.KeyDataViewType>) |
869
+
/// | Input column data type | Vector of [Key](xref:Microsoft.ML.Data.KeyDataViewType) |
870
870
/// | Output column data type | Vector of known size of <xref:System.Single> |
871
871
///
872
872
/// The resulting <xref:Microsoft.ML.Transforms.Text.NgramHashingTransformer/> creates a new column, named as specified in the output column name parameters, and
873
-
/// produces a vector of counts of n-grams (sequences of consecutive words of length 1-n) from a given data.
873
+
/// produces a vector of n-gram counts (sequences of consecutive words of length 1-n) from a given data.
874
874
/// It does so by hashing each n-gram and using the hash value as the index in the bag.
875
875
///
876
876
/// <xref:Microsoft.ML.Transforms.Text.NgramHashingEstimator> is different from <xref:Microsoft.ML.Transforms.Text.WordHashBagEstimator>
/// <param name="outputColumnName">Name of the column resulting from the transformation of <paramref name="inputColumnName"/>.</param>
331
-
/// <param name="inputColumnName">Name of the column to transform. If set to <see langword="null"/>, the value of the <paramref name="outputColumnName"/> will be used as source.</param>
329
+
/// <remarks>
330
+
/// <see cref="WordBagEstimator"/> is different from <see cref="NgramExtractingEstimator"/> in that the former
331
+
/// tokenizes text internally and the latter takes tokenized text as input.
/// <param name="skipLength">Maximum number of tokens to skip when constructing an ngram.</param>
334
340
/// <param name="useAllLengths">Whether to include all ngram lengths up to <paramref name="ngramLength"/> or only <paramref name="ngramLength"/>.</param>
@@ -346,12 +352,18 @@ public static WordBagEstimator ProduceWordBags(this TransformsCatalog.TextTransf
/// <param name="skipLength">Maximum number of tokens to skip when constructing an ngram.</param>
357
369
/// <param name="useAllLengths">Whether to include all ngram lengths up to <paramref name="ngramLength"/> or only <paramref name="ngramLength"/>.</param>
@@ -369,12 +381,18 @@ public static WordBagEstimator ProduceWordBags(this TransformsCatalog.TextTransf
/// <param name="outputColumnName">Name of the column resulting from the transformation of <paramref name="inputColumnName"/>.</param>
377
-
/// <param name="inputColumnName">Name of the column to transform. If set to <see langword="null"/>, the value of the <paramref name="outputColumnName"/> will be used as source.</param>
387
+
/// <remarks>
388
+
/// <see cref="WordHashBagEstimator"/> is different from <see cref="NgramHashingEstimator"/> in that the former
389
+
/// tokenizes text internally and the latter takes tokenized text as input.
/// <param name="outputColumnName">Name of the column resulting from the transformation of <paramref name="inputColumnNames"/>.</param>
409
-
/// <param name="inputColumnNames">Name of the columns to transform. If set to <see langword="null"/>, the value of the <paramref name="outputColumnName"/> will be used as source.</param>
425
+
/// <remarks>
426
+
/// <see cref="WordHashBagEstimator"/> is different from <see cref="NgramHashingEstimator"/> in that the former
427
+
/// tokenizes text internally and the latter takes tokenized text as input.
0 commit comments