-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Error due to ShuffleTransform in pipeline. #1106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
It should not give this error during training, only during scoring, right? |
Hi @zeahmed , different objection... Shuffle transform simply cannot be a row to row mapper. The two notions are incompatible. One is based on the idea that it is applicable to situations where an operation can be applied as an operation of one row to another row. One has to do with the permutation of the rows themselves. So, cannot be done, sorry. Perhaps if you were to describe your scenario, we could say what the actual problem is? |
@Zruty0, I will copy exact exception. It is applied during training. I have a parameter to set shuffle on/off during training. Its a good practice to shuffle the training data before each iteration to reduce variance and skip local minimum. @TomFinley, yes the title got wrong. It should be "Convert ShuffleTransform to Transformer/Estimator". The idea is to have option for user to shuffle the data on each iteration i.e. whenever I open a new cursor on dataview I should get the data in different ordering. I have done it previously with direct instantiation. Since the design has changed I think it needs conversion or there should be some other way to used it. Let me know if you guys know the other way to use it? |
Ah. Thanks @zeahmed ! Then I have another objection, these things should just not be estimators/transformers. I feel pretty strongly about this: See #933 . This has been the cause of a large amount of suffering in the past. It should still exist, but purely an operation over |
So the question is; does this or other similar transforms need conversion? if not are they supposed to work as-is? or this is the work in progress? |
Need to convert
ShuffleTransform
into Transformer/Estimator design. Currently, when it is used inTensorFlowTransform
to enable shuffling of data during training, it gives error regardingShuffleTransform is not RowToRowMapper
.Update
It seems like the transform works fine during training. The error is somewhere when creating prediction engine.
The following is the exact location where assertion fails during creation of prediction engine.
machinelearning/src/Microsoft.ML.Data/DataLoadSave/TransformWrapper.cs
Line 133 in 8ca1c93
Here is the test failure log
The text was updated successfully, but these errors were encountered: