DataFrameMapper.inverse_transform() for simple transformations #133

erikjandevries · 2017-11-14T08:25:21Z

I've added an inverse_transform() method to the DataFrameMapper that works for simple transformations.
I've included tests using the LabelEncoder and LabelBinarizer, which are passed.

This still fails for more complicated transformations such as Pipelines. I hope it's a useful start at least.

erikjandevries · 2017-11-14T08:34:32Z

Not sure what's going wrong - when I tested the solution, all tests passed. Should I have tested differently?

$ python -m pytest -s -q tests/test_dataframe_mapper.py
/usr/lib/python3.6/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20.
  "This module will be removed in 0.20.", DeprecationWarning)
...................................................
============================================================================================================= warnings summary ==============================================================================================================
tests/test_dataframe_mapper.py::test_list_transformers
  /usr/lib/python3.6/site-packages/sklearn/utils/validation.py:444: DataConversionWarning: Data with input dtype int64 was converted to float64 by StandardScaler.
    warnings.warn(msg, DataConversionWarning)

-- Docs: http://doc.pytest.org/en/latest/warnings.html
51 passed, 1 warnings in 4.82 seconds

devforfu · 2017-11-14T09:06:22Z

@erikjandevries Click Details link near CircleCI message to see what is going wrong. Mostly - PEP8 violations, as I can see.

Merge branch 'master' of github.com:erikjandevries/sklearn-pandas # Conflicts: # sklearn_pandas/dataframe_mapper.py # tests/test_dataframe_mapper.py

erikjandevries · 2017-11-14T14:01:24Z

@devforfu Thanks for the hints, indeed they were PEP8 violations, which I've now fixed.
I guess in my opinion, some PEP8 rules make my code less readable, but I understand the need for standardisation when working in (larger) teams :)

dukebody

Can you make the suggested change to avoid creating more internal attributes?

dukebody · 2018-03-25T15:57:32Z

sklearn_pandas/dataframe_mapper.py

@@ -283,6 +285,10 @@ def transform(self, X):
            self.transformed_names_ += self.get_names(
                columns, transformers, Xt, alias)

+            self.transformed_cols_ += [


I don't think we really need to store this. We already have the columns and transformers at self.built_features, and can get the names from self.transformed_names_.

dukebody · 2018-03-25T16:03:58Z

sklearn_pandas/dataframe_mapper.py

+
+        # Let's keep track of the column we've processed
+        prev_col = 0
+        for columns, transformers, transformed_cols in self.transformed_cols_:


Can be replaced by:

for built_feature, transformed_cols in zip(self.built_features, self.transformed_names_): transformed_cols = self.get_names(columns, transformers, X, alias) columns, transformers, _ = built_feature

devforfu · 2018-09-05T11:14:22Z

@erikjandevries Do you think that it is possible to address the issues pointed by @dukebody? Then we can do a final review and merge into master.

adithyabsk · 2018-11-09T05:38:54Z

After a failed PR and some fiddling around, I figured out why that new sub-field was necessary. In the case of one-to-many transformers, it is necessary to maintain a label list that preserves the grouping of the columns. (i.e. ['A'_1, 'A_2', 'A_3'] in the case of the label encoder) The field that @dukebody suggested to use only has these columns preserved in a flat structure. I would vote to merge this PR in (@devforfu) as it looks good otherwise.

anatol-grabowski · 2018-11-09T09:38:56Z

mapper = sklearn_pandas.DataFrameMapper([
    ('index', None),
   ...

With None transforms I get the error:

'NoneType' object has no attribute 'inverse_transform'

Though it isn't critical at all, the feature is nice and useful and can be merged as is. Just pointing a direction for further improvement.

edit: Actually, I was unable to make it work for me... TypeError: unhashable type: 'slice'

hu-minghao · 2023-12-14T09:03:33Z

你好，已收到，谢谢。

DataFrameMapper.inverse_transform() for simple transformations

1b4edd9

erikjandevries added 2 commits November 14, 2017 14:36

DataFrameMapper.inverse_transform() for simple transformations

58812ac

DataFrameMapper.inverse_transform() for simple transformations

1e54516

Merge branch 'master' of github.com:erikjandevries/sklearn-pandas # Conflicts: # sklearn_pandas/dataframe_mapper.py # tests/test_dataframe_mapper.py

dukebody requested changes Mar 25, 2018

View reviewed changes

erikjandevries mentioned this pull request Jul 11, 2018

Pandas In, Pandas Out? .inverse_transform() method #41

Open

adithyabsk mentioned this pull request Nov 9, 2018

Update to .inverse_transform from PR #133 #182

Closed

erikjandevries closed this by deleting the head repository Dec 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DataFrameMapper.inverse_transform() for simple transformations #133

DataFrameMapper.inverse_transform() for simple transformations #133

Uh oh!

erikjandevries commented Nov 14, 2017

Uh oh!

erikjandevries commented Nov 14, 2017

Uh oh!

devforfu commented Nov 14, 2017

Uh oh!

erikjandevries commented Nov 14, 2017

Uh oh!

dukebody left a comment

Uh oh!

dukebody Mar 25, 2018

Uh oh!

dukebody Mar 25, 2018

Uh oh!

devforfu commented Sep 5, 2018

Uh oh!

adithyabsk commented Nov 9, 2018 •

edited

Loading

Uh oh!

anatol-grabowski commented Nov 9, 2018 •

edited

Loading

Uh oh!

hu-minghao commented Dec 14, 2023 via email

Uh oh!

Uh oh!

DataFrameMapper.inverse_transform() for simple transformations #133

DataFrameMapper.inverse_transform() for simple transformations #133

Uh oh!

Conversation

erikjandevries commented Nov 14, 2017

Uh oh!

erikjandevries commented Nov 14, 2017

Uh oh!

devforfu commented Nov 14, 2017

Uh oh!

erikjandevries commented Nov 14, 2017

Uh oh!

dukebody left a comment

Choose a reason for hiding this comment

Uh oh!

dukebody Mar 25, 2018

Choose a reason for hiding this comment

Uh oh!

dukebody Mar 25, 2018

Choose a reason for hiding this comment

Uh oh!

devforfu commented Sep 5, 2018

Uh oh!

adithyabsk commented Nov 9, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anatol-grabowski commented Nov 9, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hu-minghao commented Dec 14, 2023 via email

Uh oh!

Uh oh!

adithyabsk commented Nov 9, 2018 •

edited

Loading

anatol-grabowski commented Nov 9, 2018 •

edited

Loading