[ENH] EXPERIMENTAL: Example notebook based on the new data pipeline #1813

phoeenniixx · 2025-04-06T13:51:55Z

Description

This PR adds example notebook for the new v2 data pipeline vignette, having the basic implementation of the tft model using this version. For more info see #1812 , #1811

Colab link: https://colab.research.google.com/drive/148MyhcNfYEh4CZ6vBXLqQNsUBF0n6_0v?usp=sharing

review-notebook-app · 2025-04-06T13:52:00Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Update comment

phoeenniixx · 2025-04-06T14:45:07Z

Hi @fkiraly, I am getting this error:

I just downloaded the notebook from colab and pasted it in the repo, is there anything else I should do to avoid this? Really have no idea 😅

codecov · 2025-04-11T02:34:59Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Please upload report for BASE (main@cfc7fc6). Learn more about missing BASE report.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1813   +/-   ##
=======================================
  Coverage        ?   83.14%           
=======================================
  Files           ?       61           
  Lines           ?     6153           
  Branches        ?        0           
=======================================
  Hits            ?     5116           
  Misses          ?     1037           
  Partials        ?        0

Flag	Coverage Δ
cpu	`83.14% <ø> (?)`
pytest	`83.14% <ø> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…-notebook

xandie985 · 2025-05-29T05:48:26Z

Hi @phoeenniixx , the implementation looks good and insightful. Here are some questions that I have about the implementation. Its possible that there is difference between the objectives we had with DSIPTS and SKTIME, still I would like to discuss about your pov wrt sktime.

EncoderDecoderTimeSeriesDataModule assumes that the data would fit int he memory. How would your approach scale to datasets that do not fit into RAM? Are there plans to incorporate memory-efficient loading strategies like chunking or on-demand loading from disk?
Does your module have any mechanisms to detect or correct irregular time series within its scope?
Your _create_windows method checks for sufficient sequence length.. how does your module handle missing values within a window, and what are the potential consequences for model training if windows contain significant NaNs?
Adding random seed for reproducibility, especially in setup() where random shuffling takes place.

phoeenniixx · 2025-05-29T08:39:34Z

Thank you for the review @xandie985!

EncoderDecoderTimeSeriesDataModule assumes that the data would fit int he memory. How would your approach scale to datasets that do not fit into RAM? Are there plans to incorporate memory-efficient loading strategies like chunking or on-demand loading from disk?

So right now we assume that data could fit in the memory, but yes, in future we plan to add features like chunking, on-demand loading etc

Does your module have any mechanisms to detect or correct irregular time series within its scope?

Your _create_windows method checks for sufficient sequence length.. how does your module handle missing values within a window, and what are the potential consequences for model training if windows contain significant NaNs?

Adding random seed for reproducibility, especially in setup() where random shuffling takes place.

These are some open questions, we still need to work on - We will tackle these questions in future iterations once an end-to-end prototype is ready and we get some reviews from the users of the package on this prototype.

fkiraly

More detailed review.

please remove the install from the start of the notebook
we should test that this is running, while we are working on v2. One way is to move the content to docs/examples/tutorials, the contents of which are automatically run an tested.
the data generation cell is useful, but not too illustrative. Can you move the code to a function load_toydata or similar, in pytorch_forecasting.data, new module, e.g., toydata? Then we can also use this in testing later!
can you add basic markdown cells that explain what the notebook is showing, and what each steps are? E.g., a summary at the top of the multiple steps, and then again small headers for the steps with minimal explanations.

phoeenniixx · 2025-05-30T05:30:27Z

Thanks! I would make the changes accordingly, Just one doubt:

the data generation cell is useful, but not too illustrative. Can you move the code to a function load_toydata or similar, in pytorch_forecasting.data, new module, e.g., toydata? Then we can also use this in testing later!

I think we can add it to pytorch_forecasting.data.examples? Like right now people import get_stallion_data from there, so they can import toydata from there as well. Just think that this would follow an already available mapping of "test data" to examples...

fkiraly · 2025-05-30T05:42:58Z

Like right now people import get_stallion_data from there, so they can import toydata from there as well.

Makes sense, to add it to the established location with data loaders.

Would it make sense to split the file up and have on loader per file? Need not be done in this PR.

phoeenniixx · 2025-05-30T05:47:24Z

Would it make sense to split the file up and have on loader per file?

Then I think we need to should create a new folder called loaders or datasets and have these files there, and we can add more loaders to that folder in future

phoeenniixx added 4 commits April 6, 2025 18:43

D1, D2 layer commit

252598d

remove one comment

d0d1c3e

model layer commit

80e64d2

Example notebook

0319c29

phoeenniixx requested review from benHeid, fkiraly, fnhirwa, jdb78 and yarnabrina as code owners April 6, 2025 13:51

phoeenniixx added 3 commits April 6, 2025 19:34

update docstring

6364780

Merge branch 'refactor-d1-d2' into refactor-model

82b3dc7

Merge branch 'refactor-d1-d2' into refactor-notebook

5d80532

Update comment

PranavBhatP added this to Dec 2024 - Mar 2025 mentee projects Apr 7, 2025

PranavBhatP moved this to PR in progress in Dec 2024 - Mar 2025 mentee projects Apr 7, 2025

phoeenniixx added 8 commits April 11, 2025 01:54

update data_module.py

257183c

update data_module.py

9cdcb19

Merge branch 'refactor-d1-d2' into refactor-model

a83bf32

Merge branch 'refactor-d1-d2' into refactor-notebook

6290dc2

Add disclaimer

ac56d4f

Merge branch 'refactor-d1-d2' into refactor-model

0e7e36f

Merge branch 'refactor-d1-d2' into refactor-notebook

a23ad8a

update notebook as well

25bc7ee

phoeenniixx added 5 commits April 11, 2025 12:44

update docstring

4bfff21

Merge branch 'refactor-d1-d2' into refactor-model

ef98273

Merge branch 'refactor-d1-d2' into refactor-notebook

7a175e9

update comments in nb

8dfcac1

Add tests for D1,D2 layer

8a53ed6

phoeenniixx mentioned this pull request May 13, 2025

[ENH] Experimental PR- New Dataset Version-B #1791

Closed

phoeenniixx added 5 commits May 14, 2025 18:51

add tests

38c28dc

add tests

9d80eb8

add tests

a8ccfe3

add more docstrings

f900ba5

add note about the commented out tests

ed1b799

fkiraly moved this to PR in progress in May - Sep 2025 mentee projects May 15, 2025

fkiraly added this to May - Sep 2025 mentee projects May 15, 2025

fkiraly assigned phoeenniixx May 15, 2025

phoeenniixx and others added 5 commits May 17, 2025 01:53

Merge branch 'main' into refactor-model

c947910

add the commented out tests

c0ceb8a

remove note

3828c26

Merge branch 'main' into refactor-model

6d6d18e

Merge branch 'main' into refactor-notebook

4641ebe

fkiraly moved this from PR in progress to PR under review in May - Sep 2025 mentee projects May 19, 2025

phoeenniixx and others added 5 commits May 21, 2025 00:52

make the modules private

30b541b

Merge remote-tracking branch 'origin/refactor-model' into refactor-model

3f1e11f

Merge branch 'refactor-model' into refactor-notebook

0d143cf

Merge remote-tracking branch 'origin/refactor-notebook' into refactor…

f121add

…-notebook

Merge branch 'main' into refactor-notebook

66d9257

fkiraly mentioned this pull request May 28, 2025

[MNT] Replace Black with Ruff formatting and update configuration #1853

Open

4 tasks

Merge branch 'main' into pr/1813

b4b4e4a

fkiraly requested changes May 29, 2025

View reviewed changes

fkiraly added the documentation Improvements or additions to documentation label May 29, 2025

fkiraly moved this from PR under review to PR in progress in May - Sep 2025 mentee projects May 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] EXPERIMENTAL: Example notebook based on the new data pipeline #1813

[ENH] EXPERIMENTAL: Example notebook based on the new data pipeline #1813

Uh oh!

phoeenniixx commented Apr 6, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Apr 6, 2025

Uh oh!

phoeenniixx commented Apr 6, 2025

Uh oh!

codecov bot commented Apr 11, 2025 •

edited

Loading

Uh oh!

xandie985 commented May 29, 2025

Uh oh!

phoeenniixx commented May 29, 2025

Uh oh!

fkiraly left a comment •

edited

Loading

Uh oh!

phoeenniixx commented May 30, 2025

Uh oh!

fkiraly commented May 30, 2025

Uh oh!

phoeenniixx commented May 30, 2025

Uh oh!

Uh oh!

[ENH] EXPERIMENTAL: Example notebook based on the new data pipeline #1813

Are you sure you want to change the base?

[ENH] EXPERIMENTAL: Example notebook based on the new data pipeline #1813

Uh oh!

Conversation

phoeenniixx commented Apr 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

review-notebook-app bot commented Apr 6, 2025

Uh oh!

phoeenniixx commented Apr 6, 2025

Uh oh!

codecov bot commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

xandie985 commented May 29, 2025

Uh oh!

phoeenniixx commented May 29, 2025

Uh oh!

fkiraly left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

phoeenniixx commented May 30, 2025

Uh oh!

fkiraly commented May 30, 2025

Uh oh!

phoeenniixx commented May 30, 2025

Uh oh!

Uh oh!

phoeenniixx commented Apr 6, 2025 •

edited

Loading

codecov bot commented Apr 11, 2025 •

edited

Loading

fkiraly left a comment •

edited

Loading