-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Any reason for inconsistency in params for TextLoader? #2472
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@rogancarr and @sfilipi, should be made consistent. |
@TomFinley . Could you kindly clarify what you mean by should be made consistent ? To me it seems reasonable that the "simple" APIs (1 and 2 above) take a If a user wants to specify multiple separator characters they can use the "advanced" API which takes in the machinelearning/src/Microsoft.ML.Data/DataLoadSave/Text/TextLoaderSaverCatalog.cs Lines 39 to 41 in eecf272
|
Yes @abgoswam I agree, the convenience methods should continue to take Of particular interest to me was the lack of these: bool allowQuotedStrings = TextLoader.DefaultArguments.AllowQuoting,
bool supportSparse = TextLoader.DefaultArguments.AllowSparse,
bool trimWhitespace = TextLoader.DefaultArguments.TrimWhitespace) It is not clear to me that the overloads should differ in this respect. Also why does one have the possibility of handling a file handle (which enables it to read feature names into the schema) and the other does not? So these two overloads are inconsistent leading to loss of capability in one or the other, and I think this should be made consistent. |
Also, see #2512: We only support a limited set of characters for saving a text file, so perhaps we should move away from an explicit |
This Method :
machinelearning/src/Microsoft.ML.Data/DataLoadSave/Text/TextLoaderSaverCatalog.cs
Line 55 in 834e471
and this one :
machinelearning/src/Microsoft.ML.Data/DataLoadSave/Text/TextLoaderSaverCatalog.cs
Line 22 in 834e471
Looks like the first one has params like :
Where as the second one doesn't have those. In order to use those params there is another method overload :
This takes Arguments class and particularly one param is interesting in this Arguments:
Any reason this is char[] and not char like in other overloads? and why inconsistency in params for the methods 1) and 2).
The text was updated successfully, but these errors were encountered: