[Question] How to know the data and feature preprocessing used in the ensemble? #1633

TuanDTr · 2022-12-19T16:19:58Z

Hi, the method AutoSklearnClassifier().show_models() displays the models found in the ensemble. I wonder if it is possible to know exactly which data or feature preprocessing steps have been done before training the model. The method show_models only gives only the object:

{'model_id': 2, 
'rank': 1, 
'cost': 0.04255319148936165, 
'ensemble_weight': 0.04, 
'data_preprocessor': <autosklearn.pipeline.components.data_preprocessing.DataPreprocessorChoice object at 0x7f704fb2dee0>, 
'balancing': Balancing(random_state=1), 
'feature_preprocessor': <autosklearn.pipeline.components.feature_preprocessing.FeaturePreprocessorChoice object at 0x7f70a7e4e7f0>,
'classifier': <autosklearn.pipeline.components.classification.ClassifierChoice object at 0x7f70a7e4e1c0>, 
'sklearn_classifier': RandomForestClassifier(max_features=5, n_estimators=512, n_jobs=1,
                       random_state=1, warm_start=True)}

and it is not clear to know which steps they are. Is it possible to get the preprocessing steps in such a case?

Many thanks!

The text was updated successfully, but these errors were encountered:

eddiebergman · 2023-01-11T15:45:17Z

Sorry for the delay,

You could also use estimator.leaderboard(detailed=True) to get a pandas version which gives the str of the choices made. However data_preprocessor encompassing many sub things that are chosen is kind of a pain point.

You can use this most likely:

models = estimator.show_models()
model = ...  # Select one

model_id = model["model_id"]

runhistory = estimator.automl_.runhistory_
full_config = runhistory.ids_config[model_id]

bonfire666666 · 2023-06-15T21:57:06Z

Sorry for the delay,

You could also use estimator.leaderboard(detailed=True) to get a pandas version which gives the str of the choices made. However data_preprocessor encompassing many sub things that are chosen is kind of a pain point.

You can use this most likely:
models = estimator.show_models()
model = ...  # Select one

model_id = model["model_id"]

runhistory = estimator.automl_.runhistory_
full_config = runhistory.ids_config[model_id]

I dont think the model_id matches the id in ids_config

eddiebergman mentioned this issue Jul 21, 2023

What's in store for Auto-Sklearn? -- From the Developers #1677

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Question] How to know the data and feature preprocessing used in the ensemble? #1633

[Question] How to know the data and feature preprocessing used in the ensemble? #1633

TuanDTr commented Dec 19, 2022 •

edited

Loading

eddiebergman commented Jan 11, 2023

Uh oh!

bonfire666666 commented Jun 15, 2023

Uh oh!

[Question] How to know the data and feature preprocessing used in the ensemble? #1633

[Question] How to know the data and feature preprocessing used in the ensemble? #1633

Comments

TuanDTr commented Dec 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

eddiebergman commented Jan 11, 2023

Uh oh!

bonfire666666 commented Jun 15, 2023

Uh oh!

TuanDTr commented Dec 19, 2022 •

edited

Loading