You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As noted in #933, the BootstrapSamplingTransform isn't something that we want recorded in pipelines (e.g. having it executed on a test set). However, it is super useful to have in our library, so it would be nice to have a way to use it in user-facing code.
The text was updated successfully, but these errors were encountered:
Right, thanks Rogan. As mentioned in the issue it strikes me as being kind of "filter like" (it is sort of a very special sort of probabilistic filter), so probably belongs alongside the others in DataOperationsCatalog.
Two arguments? int? seed=null (if null, seed taken from, I guess, the catalog's environment?), and a bool to control whether we want the compliment of the sample or not ... so you could get a training set, and a test set. I think that's pretty much all we'd need for v1, would be pretty easy to add.
As noted in #933, the
BootstrapSamplingTransform
isn't something that we want recorded in pipelines (e.g. having it executed on a test set). However, it is super useful to have in our library, so it would be nice to have a way to use it in user-facing code.The text was updated successfully, but these errors were encountered: