Add multiple file support #188

dgzlopes · 2019-05-29T10:48:17Z

Signed-off-by: Daniel González Lopes [email protected]

Small fix for #180. Multiple file support might be interesting for the new benchmark script.

domanchi

Please write test cases for this new feature.

detect_secrets/core/baseline.py

dgzlopes · 2019-05-29T18:19:05Z

@domanchi Added multiple valid files and multiple invalid files tests on the last update! Should I add test for some more edge case?

And thank you for your patience and taking some time to review the PR!

tests/main_test.py

detect_secrets/core/baseline.py

tests/core/baseline_test.py

detect_secrets/core/baseline.py

tests/main_test.py

detect_secrets/core/baseline.py

dgzlopes · 2019-06-09T15:55:29Z

Learned a lot from the review! Both about Python and the problem. So I tried to fix the missing logic and all the small bits. Now for each element on the path, I check if It's a directory or a file:

If It's a directory:
- If we have the scan_all_files flag active It runs _get_files_recursively(element).
- If not It runs _get_git_tracked_files(element).
- In both cases, It extends the returned list to the files_to_scan list.
If it's a file It's appended to the files_to_scan list.
If the element doesn't exist the error is logged!

Seems to work with only files, only dirs, a mix of files and dirs, mix of valid and invalid files/dirs and with scan_all_files flag too!

A little bit offtopic... but I would love to learn more about Pythonic code style and the Python way of doing things. I have read this resource [0] (Well, in reality, all the guide!) but felt a little short... So, do you know any other great resources (Talks, Books...Whatever) for learning more about this topic? (As the Types talk from my last PR! Was really helpful)

[0] https://docs.python-guide.org/writing/style/

domanchi

Looking a lot better!

@dgzlopes, my favorite resource (actually recommended by @KevinHock) for making your code more Pythonic is: https://www.youtube.com/watch?v=OSGv2VnC0go

tests/core/baseline_test.py

KevinHock · 2019-06-10T19:54:06Z

detect_secrets/core/usage.py

@@ -120,7 +120,7 @@ def add_arguments(self):
    def _add_initialize_baseline_argument(self):
        self.parser.add_argument(
            'path',
-            nargs='?',
+            nargs='*',


I'm not sure, but would it be possible to use '+'? So you wouldn't have to do

:type path: str|list

and the isinstance check

Well, as '+' generates an error message if there isn't at least one command-line argument present... I thought that as we are using default's, '*' would be a better fit. Running the test suite with '+' breaks with detect-secrets scan: error: too few arguments (Well, as intended!).

Maybe I'm missing something... But If you think it's an improvement I can take a closer look at it :)

You’re completely right :)

From reading https://stackoverflow.com/questions/23490152/list-of-arguments-with-argparse#23490179 it seems it should be always be a list with *, so maybe that’s true?

Yes! Looks like all is passed as a list, even the default argument '.'

So the isInstance() might be redundant as when we pass (on initialize()) path='.' it's never used... Running scan without path(s) will trigger argparse's default.

This was closely related to #188 (comment) so maybe using the list as a default isn't that great... But I think '+' passes all as a list too! What's the best way to handle this?

If I understand, the

if not isinstance(path, list): path = [path]

is always dead-code, so we can remove that.

From reading the code, it seems a path is always passed in to initialize, so there doesn't seem to be a need for a default argument I think, right?

If so we can change

def initialize( plugins, exclude_files_regex=None, exclude_lines_regex=None, path='.', scan_all_files=False, ):

to

def initialize( path, plugins, exclude_files_regex=None, exclude_lines_regex=None, scan_all_files=False, ):

As I thought :) Thanks for the writeup. Updated the PR and fixed the failing tests. I hope It looks better now!

Looks awesome 👏

MVrachev · 2019-06-11T14:00:19Z

I believe it will be useful to add information about this feature in the README.
Maybe just a small example calling detect-secret on multiple files.

dgzlopes · 2019-06-12T20:25:12Z

@MVrachev I thought that multiple file support was an obvious feature for this type of tool! But I added a little Unix-like doc to the readme explaining scan mode inputs as this might not be that obvious :)

And thank you both @domanchi and @KevinHock for the link to the talk!

Signed-off-by: Daniel González Lopes <[email protected]> Improve multiple file support Signed-off-by: Daniel González Lopes <[email protected]> Fixing multiple files logic Signed-off-by: dgzlopes <[email protected]> Fixing baseline tests Signed-off-by: dgzlopes <[email protected]> Fix initialize and tests Signed-off-by: dgzlopes <[email protected]>

KevinHock

LGTM 🚢✅

Add multiple file support

domanchi suggested changes May 29, 2019

View reviewed changes

detect_secrets/core/baseline.py Show resolved Hide resolved

detect_secrets/core/baseline.py Outdated Show resolved Hide resolved

detect_secrets/core/baseline.py Outdated Show resolved Hide resolved

dgzlopes commented May 29, 2019

View reviewed changes

tests/main_test.py Show resolved Hide resolved

domanchi reviewed Jun 3, 2019

View reviewed changes

domanchi approved these changes Jun 10, 2019

View reviewed changes

tests/core/baseline_test.py Outdated Show resolved Hide resolved

KevinHock reviewed Jun 10, 2019

View reviewed changes

dgzlopes force-pushed the fix-180-multiple-file-support branch from aeed00e to 8fd3289 Compare June 12, 2019 20:28

dgzlopes force-pushed the fix-180-multiple-file-support branch from 13c72c6 to 75cb635 Compare June 14, 2019 08:11

KevinHock approved these changes Jun 15, 2019

View reviewed changes

KevinHock merged commit 2e06c6d into Yelp:master Jun 15, 2019

KevinHock added a commit that referenced this pull request Jun 15, 2019

↪️ Merge pull request #188 from dgzlopes/fix-180-multiple-file-support

0825875

Add multiple file support

domanchi mentioned this pull request Jun 15, 2019

Support scanning multiple git repositories in one invocation #193

Merged

dgzlopes deleted the fix-180-multiple-file-support branch June 15, 2019 06:39

domanchi mentioned this pull request Jun 15, 2019

Support scanning multiple specified files on the command line #180

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multiple file support #188

Add multiple file support #188

dgzlopes commented May 29, 2019

domanchi left a comment

dgzlopes commented May 29, 2019 •

edited

Loading

dgzlopes commented Jun 9, 2019 •

edited

Loading

domanchi left a comment

KevinHock Jun 10, 2019

dgzlopes Jun 12, 2019

KevinHock Jun 13, 2019

dgzlopes Jun 13, 2019 •

edited

Loading

KevinHock Jun 13, 2019 •

edited

Loading

dgzlopes Jun 14, 2019 •

edited

Loading

KevinHock Jun 15, 2019

MVrachev commented Jun 11, 2019

dgzlopes commented Jun 12, 2019 •

edited

Loading

KevinHock left a comment

Add multiple file support #188

Add multiple file support #188

Conversation

dgzlopes commented May 29, 2019

domanchi left a comment

Choose a reason for hiding this comment

dgzlopes commented May 29, 2019 • edited Loading

dgzlopes commented Jun 9, 2019 • edited Loading

domanchi left a comment

Choose a reason for hiding this comment

KevinHock Jun 10, 2019

Choose a reason for hiding this comment

dgzlopes Jun 12, 2019

Choose a reason for hiding this comment

KevinHock Jun 13, 2019

Choose a reason for hiding this comment

dgzlopes Jun 13, 2019 • edited Loading

Choose a reason for hiding this comment

KevinHock Jun 13, 2019 • edited Loading

Choose a reason for hiding this comment

dgzlopes Jun 14, 2019 • edited Loading

Choose a reason for hiding this comment

KevinHock Jun 15, 2019

Choose a reason for hiding this comment

MVrachev commented Jun 11, 2019

dgzlopes commented Jun 12, 2019 • edited Loading

KevinHock left a comment

Choose a reason for hiding this comment

dgzlopes commented May 29, 2019 •

edited

Loading

dgzlopes commented Jun 9, 2019 •

edited

Loading

dgzlopes Jun 13, 2019 •

edited

Loading

KevinHock Jun 13, 2019 •

edited

Loading

dgzlopes Jun 14, 2019 •

edited

Loading

dgzlopes commented Jun 12, 2019 •

edited

Loading