ENH: column label filtering via regexes to work for numeric names #10384

cyrusmaher · 2015-06-18T06:59:19Z

Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]")

closes #10506

Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]")

jreback · 2015-06-18T12:52:52Z

can you add some tests?

cyrusmaher · 2015-06-18T21:06:09Z

For search(x) -> search(str(x))?

cyrusmaher · 2015-07-03T18:39:04Z

Any advice on what to add or where? I don't see any existing tests for this function...

jreback · 2015-07-03T18:45:45Z

look in pandas/tests/test_frame for test_filter

cyrusmaher · 2015-07-03T19:40:43Z

Thanks Jeff! Added the test. Let me know what you think...

jreback · 2015-07-03T19:45:10Z

pandas/tests/test_frame.py

-
+
+        # regex with ints in column names
+        df = DataFrame(0., index=[0, 1, 2], columns=[0, 1, 'A1', 'B'])


add the issue number as a comment (this PR number since no associated issue)

jreback · 2015-07-03T19:48:19Z

add a not in whatsnew/0.17.0. Put in Other Enhancements section

What would this do in 0.16.2 (if you passed the regex), not fitler anything? or raise?

cyrusmaher · 2015-07-03T20:06:50Z

Done! In 0.16.2 re.search will raise if a column name is numeric...

jreback · 2015-07-03T21:42:18Z

doc/source/whatsnew/v0.17.0.txt

@@ -26,7 +26,8 @@ New features

 Other enhancements
 ^^^^^^^^^^^^^^^^^^
-
+- `regex` argument to DataFrame.filter now handles numeric column names instead of raising an exception.


use double backticks here (and around DateFrame.filter)

add the issue number (this PR number) onto the end (see how the other issues are done)

say instead of raising ValueError

jreback · 2015-07-03T21:45:42Z

when you are all done, pls rebase/squash see contributing docs here

Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]") Add test for regex filter on numeric column names Add release note Add second regex test

…atch-1

cyrusmaher · 2015-07-03T22:44:49Z

I'm having trouble with squashing the commits. I don't have a ton of experience with git, so I'm not sure what to do next. Below is the message. Seems to have to do with a merge conflict in test_frame? Any advice?

error: could not apply ac90352... Add test for regex filter on numeric column names

When you have resolved this problem, run "git rebase --continue".
If you prefer to skip this patch, run "git rebase --skip" instead.
To check out the original branch and stop rebasing, run "git rebase --abort".

jreback · 2015-07-03T22:46:36Z

contributing docs are here: http://pandas.pydata.org/pandas-docs/stable/contributing.html

you have a conflict and need to fix it

# The first commit's message is: Fix regex filter for numeric columns Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]") Add test for regex filter on numeric column names Add release note Add second regex test # This is the 2nd commit message: Update generic.py Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]")

Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]")

cyrusmaher · 2015-07-03T23:37:52Z

Hmm, when I rebase it detects conflicts, then I resolve them using git mergetool, and commit. Doesn't seem to change anything. When I run git merge master I get that everything is up-to-date. I'm probably missing something simple?

jreback · 2015-07-05T16:23:51Z

FYI, you don't normally need to add an issue if you just create a PR (like you did), but no biggie.

jreback · 2015-07-05T16:40:42Z

I rebase you: https://travis-ci.org/jreback/pandas/builds/69631109

FYI don't use merge master. This is not pandas standard practice. This makes rebasing much more difficult.

jreback · 2015-07-06T12:01:38Z

merged via bfe5a7f

thanks!

Update generic.py

ac58777

Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]")

jreback added the Indexing Related to indexing on series/frames, not to indexes themselves label Jun 18, 2015

jreback changed the title ~~Update generic.py~~ ENH: column label filtering via regexes to work for numeric names Jun 18, 2015

jreback added this to the 0.17.0 milestone Jul 3, 2015

jreback added the API Design label Jul 3, 2015

Add test for regex filter on numeric column names

ac90352

jreback reviewed Jul 3, 2015
View reviewed changes

cyrusmaher added 2 commits July 3, 2015 12:56

Add release note

b70b9c1

Add second regex test

3a9934d

jreback reviewed Jul 3, 2015
View reviewed changes

cyrusmaher and others added 4 commits July 3, 2015 15:10

Fix regex filter for numeric columns

12d79e7

Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]") Add test for regex filter on numeric column names Add release note Add second regex test

Merge branch 'patch-1' of https://github.com/cyrusmaher/pandas into p…

ccc7490

…atch-1

Update docs, test

009422c

Fix merge conflict

b46133f

cyrusmaher and others added 4 commits July 3, 2015 16:05

Fix merge conflict

88a8e3e

Fix merge conflict?

49b607f

Update generic.py

2a9ddd1

Simple fix to allow regex filtering to work for numeric column labels, e.g. df.filter(regex="[12][34]")

cyrusmaher and others added 6 commits July 3, 2015 16:28

Add test for regex filter on numeric column names

0d3af4c

Add release note

94626cc

Add second regex test

3bb6d05

Update docs, test

86d523a

Fix merge conflict

f562f7f

Maybe this merge fix worked

d9c4523

cyrusmaher mentioned this pull request Jul 4, 2015

regex option for DataFrame.filter raises error on numeric column names #10506

Closed

jreback closed this Jul 6, 2015

cyrusmaher deleted the patch-1 branch July 7, 2015 04:17

jreback mentioned this pull request May 6, 2016

BUG: .filter with unicode labels when can't encode #13101

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: column label filtering via regexes to work for numeric names #10384

ENH: column label filtering via regexes to work for numeric names #10384

cyrusmaher commented Jun 18, 2015

jreback commented Jun 18, 2015

cyrusmaher commented Jun 18, 2015

cyrusmaher commented Jul 3, 2015

jreback commented Jul 3, 2015

cyrusmaher commented Jul 3, 2015

jreback Jul 3, 2015

jreback commented Jul 3, 2015

cyrusmaher commented Jul 3, 2015

jreback Jul 3, 2015

jreback Jul 3, 2015

jreback Jul 3, 2015

jreback commented Jul 3, 2015

cyrusmaher commented Jul 3, 2015

jreback commented Jul 3, 2015

cyrusmaher commented Jul 3, 2015

jreback commented Jul 5, 2015

jreback commented Jul 5, 2015

jreback commented Jul 6, 2015



		# regex with ints in column names
		df = DataFrame(0., index=[0, 1, 2], columns=[0, 1, 'A1', 'B'])

ENH: column label filtering via regexes to work for numeric names #10384

ENH: column label filtering via regexes to work for numeric names #10384

Conversation

cyrusmaher commented Jun 18, 2015

jreback commented Jun 18, 2015

cyrusmaher commented Jun 18, 2015

cyrusmaher commented Jul 3, 2015

jreback commented Jul 3, 2015

cyrusmaher commented Jul 3, 2015

jreback Jul 3, 2015

Choose a reason for hiding this comment

jreback commented Jul 3, 2015

cyrusmaher commented Jul 3, 2015

jreback Jul 3, 2015

Choose a reason for hiding this comment

jreback Jul 3, 2015

Choose a reason for hiding this comment

jreback Jul 3, 2015

Choose a reason for hiding this comment

jreback commented Jul 3, 2015

cyrusmaher commented Jul 3, 2015

jreback commented Jul 3, 2015

cyrusmaher commented Jul 3, 2015

jreback commented Jul 5, 2015

jreback commented Jul 5, 2015

jreback commented Jul 6, 2015