-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Model Size is big in auto sklearn #1359
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
This is likely due to all the imports, saved predictions and everything else we use for optimization bundled into the object. I would not use the full auto-sklearn model in production at the moment and instead try to export or retrain the found models. |
@eddiebergman You mean I will get the best configurations and create an sklearn ensemble model out of it. |
There are ways to do that by extracting out the models, this is made easier in a recent PR by @userfindingself in #1321. This is available in the development branch, otherwise you can view the code there to suit your needs. We would like an export option down the road to eventually export more production ready models. |
@eddiebergman In the next version of auto-sklearn can we get the final model with small size for prediction. |
We are reworking some internals, this will likely not be a feature in the next release, apologies. |
@eddiebergman Thank you sir for your quick reponse |
One additional note: Auto-sklearn uses 512 trees, while scikit-learn by default only uses 100 trees. This explains why the models are in the ballbark of > 500MB. |
Model Size is big in auto sklearn
Auto Sklearn model size is big with respect to sklearn, Below are the examples:
I run autoskearn without feature or data preprocessing but still model size is very huge.
If it is due to ensemble size then I tried with different values of ensemble size 30, 10, 1 but the model size is almost the same, Why?
The text was updated successfully, but these errors were encountered: