Skip to content

How to dump intermediate data in pipeline? #617

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
WladdGorshenin opened this issue Jul 31, 2018 · 6 comments
Closed

How to dump intermediate data in pipeline? #617

WladdGorshenin opened this issue Jul 31, 2018 · 6 comments
Labels
API Issues pertaining the friendly API question Further information is requested

Comments

@WladdGorshenin
Copy link

Hi,

What would be the best way to get intermediate data in a pipeline? I'd like to debug data transformation steps.

Looking forward for reply

@rauhs
Copy link
Contributor

rauhs commented Aug 2, 2018

I just want to note that I'd also very much appreciate a way to dump the intermediate data view in the pipeline. The more transformations I do the less I'm certain about how this all plays together. Even a simple way to just print the data as JSON would be great. Then one can feed the pipeline 2-3 artificial instances and see the pipeline result.

@Zruty0
Copy link
Contributor

Zruty0 commented Aug 5, 2018

It is great to see interest in this scenario, as this is something we specifically want to enable in #583 via #581

@shauheen shauheen added question Further information is requested API Issues pertaining the friendly API labels Aug 6, 2018
@WladdGorshenin
Copy link
Author

Hi,

any updates on the topic?

@Zruty0
Copy link
Contributor

Zruty0 commented Aug 27, 2018

We are working on the new API: https://github.com/dotnet/machinelearning/projects/9

After this is done, you will be able to inspect the intermediate results of pipeline steps much similar to this test:
https://github.com/dotnet/machinelearning/blob/master/test/Microsoft.ML.Tests/Scenarios/Api/Estimators/FileBasedSavingOfData.cs

We will add SaveAsText extension to save data as text too.
We also have some plans to build custom Visual Studio visualizers for IDataView, which would enable you to see the values in the debugger.

@abgoswam
Copy link
Member

We now have a couple of unit tests showcasing this functionality . Please check out New_Visibility and Visibility

DRI RESPONSE : Planning to close this issue (resolved)

@ghost ghost locked as resolved and limited conversation to collaborators Mar 29, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
API Issues pertaining the friendly API question Further information is requested
Projects
None yet
Development

No branches or pull requests

6 participants