Remove PyTorch from the code #86

JackKelly · 2021-09-03T11:15:17Z

When I started nowcasting_dataset, the intention was to use nowcasting_dataset to generate batches on-the-fly during ML training from separate Zarr stores for the satellite data, NWPs, and PV. But that turned out to be too slow and fragile :) So, we swapped to using nowcasting_dataset to pre-prepare batches ahead-of-time, and save them to disk. During ML training, we just need to load the batches from disk, and we're good-to-go. (Pre-preparing batches has a number of other advantages, too).

But, this development history means that nowcasting_dataset still uses PyTorch (e.g. using the PyTorch DataLoader to run multiple processes). The code may become cleaner and faster and more flexible if we strip out PyTorch, and instead (maybe) use concurrent.futures.ProcessPoolExecutor to use multiple processes.

TODO:

Remove Datasets and Datamodule. Done in PR Big new design Part 2 :) #307
Remove pytorch lightning from requirements.txt and environment.yaml. Done in PR Big new design Part 2 :) #307.
Remove torch from other python files
Remove torch from requirements.txt and environment.yaml

The text was updated successfully, but these errors were encountered:

peterdudfield · 2021-10-11T09:56:37Z

#213 (comment)
from big issue

Idea is to use optional requirements for pytorch

JackKelly · 2021-10-11T10:53:35Z

If it's OK, I'll keep this issue open until we've removed the pytorch dataloader and pytorch lightning from the "batch pre-processing" code :)

peterdudfield · 2021-10-11T11:18:23Z

sure thing, where is that?

JackKelly · 2021-10-11T12:57:31Z

Tthe specific places where pytorch / pytorch lightning are still used are:

NowcastingDataModule inherits from pl.LightningDataModule. I think we can remove the dependency from pl.LightningDataModule and have NowcastingDataModule inherit from nothing.
- NowcastingDataModule.train_dataloader(), val_dataloader(), and test_dataloader() all return torch.utils.data.DataLoader objects.
NowcastingDataset inherits from torch.utils.data.IterableDataset.

I might strip out these PyTorch things in one of the sub-steps of #202 (but I haven't fully thought this through!)

peterdudfield · 2021-11-01T15:40:52Z

linked with - #315

peterdudfield · 2021-11-02T17:11:19Z

Maybe its not quite closed, would be good to remove it from requirements too, ill have a go at this

JackKelly · 2021-11-02T17:52:39Z

Oops, you're exactly right, sorry - this issue should still be open!

JackKelly · 2021-11-02T17:58:31Z

FWIW, these are the lines where "torch" is still mentioned in our code:

JackKelly self-assigned this Sep 3, 2021

JackKelly changed the title ~~Remove PyTorch~~ Remove PyTorch from the code Sep 3, 2021

JackKelly added this to the WP1 stretch goals milestone Sep 7, 2021

JackKelly mentioned this issue Sep 25, 2021

RuntimeError: unable to open shared memory object </torch_2276740_2849291446> in read-write mode #158

Closed

peterdudfield mentioned this issue Oct 11, 2021

Issue/86 remove torch #215

Merged

7 tasks

peterdudfield closed this as completed in #215 Oct 11, 2021

JackKelly reopened this Oct 11, 2021

This was referenced Oct 19, 2021

Multi process when saving to netcdf #244

Closed

Remove pytorch from scripts/validate_ml_data.py #282

Open

JackKelly mentioned this issue Oct 29, 2021

Big new design Part 2 :) #307

Merged

30 tasks

JackKelly linked a pull request Oct 29, 2021 that will close this issue

Big new design Part 2 :) #307

Merged

30 tasks

JackKelly added this to Nowcasting Nov 2, 2021

JackKelly moved this to Todo in Nowcasting Nov 2, 2021

JackKelly added enhancement New feature or request refactoring labels Nov 2, 2021

JackKelly closed this as completed in #307 Nov 2, 2021

Repository owner moved this from Todo to Done in Nowcasting Nov 2, 2021

peterdudfield reopened this Nov 2, 2021

Repository owner moved this from Done to In Progress in Nowcasting Nov 2, 2021

peterdudfield mentioned this issue Nov 3, 2021

Issue/86 remove torch #329

Merged

7 tasks

peterdudfield closed this as completed in #329 Nov 3, 2021

Repository owner moved this from In Progress to Done in Nowcasting Nov 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove PyTorch from the code #86

Remove PyTorch from the code #86

JackKelly commented Sep 3, 2021 •

edited

Loading

peterdudfield commented Oct 11, 2021

JackKelly commented Oct 11, 2021

peterdudfield commented Oct 11, 2021

JackKelly commented Oct 11, 2021 •

edited

Loading

peterdudfield commented Nov 1, 2021

peterdudfield commented Nov 2, 2021 •

edited

Loading

JackKelly commented Nov 2, 2021

JackKelly commented Nov 2, 2021 •

edited

Loading

Remove PyTorch from the code #86

Remove PyTorch from the code #86

Comments

JackKelly commented Sep 3, 2021 • edited Loading

peterdudfield commented Oct 11, 2021

JackKelly commented Oct 11, 2021

peterdudfield commented Oct 11, 2021

JackKelly commented Oct 11, 2021 • edited Loading

peterdudfield commented Nov 1, 2021

peterdudfield commented Nov 2, 2021 • edited Loading

JackKelly commented Nov 2, 2021

JackKelly commented Nov 2, 2021 • edited Loading

JackKelly commented Sep 3, 2021 •

edited

Loading

JackKelly commented Oct 11, 2021 •

edited

Loading

peterdudfield commented Nov 2, 2021 •

edited

Loading

JackKelly commented Nov 2, 2021 •

edited

Loading