-
-
Notifications
You must be signed in to change notification settings - Fork 6
Conversation
nowcasting_dataset/data_sources/metadata/metadata_data_source.py
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good! Just have a few suggestions possibly
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
wow, what a beast!
I think this PR is waaaaay too big. We should see if we can split up future work better to end up with smaller PRs. I find it really hard to review PRs this size and it takes quite a long time.
Google's eng-practices make a good case for smaller PRs (they call them CL: Change list): https://google.github.io/eng-practices/review/developer/small-cls.html
Also, my github browser window got really slow during reviewing haha.
I agree with @jacobbieker that we should try to remove commented-out code, as it's contained in the git history anyways.
There are also a bunch of unresolved todo items that are being introduced - I think we should generally try to either:
a) resolve those before merging the PR, or
b) raise an issue or link them to one, and then reference the issue number in the todo comment.
Otherwise I feel like we'll just forget about them or it's unclear who will work on resolving them in the future.
thanks for the suggestions Flo Co-authored-by: Flo <[email protected]>
Yea I agree, this was a big one. I really dont like doing big PRs, so I totally agree about breaking them down. Thanks very much @flowirtz for all the comments |
2. option to only delete files in folder, not folders
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Thanks for doing all this work!
Pull Request
Description
There are two different flows for the data representting, 1. saving to a batch file, 2. loading ready for ml
*** note that 2. could move to dataloader repo, but wanted to get it all sorted here first
Fixes #209
How Has This Been Tested?
Adjust unittest accordingly
written new unittest for new code
ran script/prepare_ml_data.py - it works!!!
No
Yes
Checklist: