This repository was archived by the owner on Sep 11, 2023. It is now read-only.
-
-
Notifications
You must be signed in to change notification settings - Fork 6
Remove PyTorch from the code #86
Comments
This was referenced Oct 6, 2021
#213 (comment) Idea is to use optional requirements for pytorch |
If it's OK, I'll keep this issue open until we've removed the pytorch dataloader and pytorch lightning from the "batch pre-processing" code :) |
sure thing, where is that? |
Tthe specific places where pytorch / pytorch lightning are still used are:
I might strip out these |
This was referenced Oct 19, 2021
30 tasks
linked with - #315 |
Repository owner
moved this from Todo
to Done
in Nowcasting
Nov 2, 2021
Repository owner
moved this from Done
to In Progress
in Nowcasting
Nov 2, 2021
Maybe its not quite closed, would be good to remove it from requirements too, ill have a go at this |
Oops, you're exactly right, sorry - this issue should still be open! |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
When I started
nowcasting_dataset
, the intention was to usenowcasting_dataset
to generate batches on-the-fly during ML training from separate Zarr stores for the satellite data, NWPs, and PV. But that turned out to be too slow and fragile :) So, we swapped to usingnowcasting_dataset
to pre-prepare batches ahead-of-time, and save them to disk. During ML training, we just need to load the batches from disk, and we're good-to-go. (Pre-preparing batches has a number of other advantages, too).But, this development history means that
nowcasting_dataset
still uses PyTorch (e.g. using the PyTorch DataLoader to run multiple processes). The code may become cleaner and faster and more flexible if we strip out PyTorch, and instead (maybe) useconcurrent.futures.ProcessPoolExecutor
to use multiple processes.TODO:
The text was updated successfully, but these errors were encountered: