[ENH] [WIP] Standardize model output to 4d tensor by PranavBhatP · Pull Request #1895 · sktime/pytorch-forecasting

PranavBhatP · 2025-06-18T04:50:20Z

Reference Issues/PRs

Fixes #1894 . Stacks on #1874.

Files changed:
- _base_model_v2.py

What does this implement/fix? Explain your changes.

The PR implements a standardize_model_output function in the v2 BaseModel class. This will be used as a helper function, to enforce the proposed standard for model tensor outputs, in the format - (batch_size, timesteps, n_features, quantiles). The PR is still a draft and needs discussion before actually enforcing the output change at the model level, since it will lead to widespread test failure due to a major breaking change in the output format.

Once the design is decided, we can proceed with trying this out with two models - dlinear and timexer.

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

PR checklist

The PR title starts with either [ENH], [MNT], [DOC], or [BUG]. [BUG] - bugfix, [MNT] - CI, test framework, [ENH] - adding or improving code, [DOC] - writing or improving documentation or docstrings.
Added/modified tests
Used pre-commit hooks when committing to ensure that code is compliant with hooks. Install hooks with pre-commit install.
To run hooks independent of commit, execute pre-commit run --all-files

pytorch_forecasting/models/base/_base_model_v2.py

fkiraly · 2025-06-20T17:10:50Z

pytorch_forecasting/models/base/_base_model_v2.py

+            but any value can be explicitly provided. A fallback of 1 is used in case where
+            no information is available on ``last_dim``.
+
+        [2] This can currently handle situations where a single target is used


can you explain that and give an example? Can you give the simplest example that shows a 4D tensor is not possible in this case? Is this, for instance, that we use squared loss for variable 1 but parametric log-loss on mean/variance (of normal) on variable 2?

maybe using nestedtensor-s?
https://docs.pytorch.org/tutorials/prototype/nestedtensor.html
(is this a stable feature? Looks like it?)

I can try looking at this..

fkiraly · 2025-06-20T17:17:29Z

nice, I now get it. I left some minor remraks.

We should add tests for all individual conversion pathways here.

…atP/pytorch-forecasting into tslib-dlinear-model

…anavBhatP/pytorch-forecasting into standardise-output-format

agobbifbk · 2025-06-23T11:25:18Z

My 2 cents after the discussion of today (both with @PranavBhatP and @phoeenniixx ) maybe is better to setup the output format as a list of tensors (one for each output channels) and each tensor with the shape B x L x M where B is the batch size, L the length and M is a number that depends on the loss function chosen (1 for MSE, RMSE, N_quantiles for quantile loss and 2 for NorlmalDistribuion). This solves the problem related to the loss function (we are in most the same format of V1) and the output model shape. If we plan to have different loss function for the same output channel this needs to be refined (@fkiraly is this possible in the V1 of PTF?).
For computing the number of outputs the model should provided, we can compute it in the base model class summing up all the Ms and computing a list of shapes containing the Ms so we are able to cut the final 4D tensor according to the losses defined (e.g. [1,5,2] in case of MSE, quantile and ND).
Reintroducing this feature (different loss per different channels) will imply to rethink about the final output given to the user since the number of columns can be different for each output channel, maybe a list of pandas.DataFrames?

PranavBhatP · 2025-06-23T11:50:17Z

maybe is better to setup the output format as a list of tensors

Sorry, I didn't get you fully, are you suggesting that we make all outputs as a list of tensor?

fkiraly · 2025-06-23T20:04:43Z

agree with @agobbifbk - the only way I see how we can be consistent across the multioutput loss case is having the "list-of-tensor" approach or something equivalent, like nestedtensor. I do not think "list-of-tensor" is so much worse, so we can just stick with that?
regarding the tensors in the list, we can again try to be polymorphic.

Overall though, I think we should move towards writing an enhancement proposal, i.e., a proper API design document. This episode has shown me that this is a difficult problem where we should map out all use case vignettes and features before committing to a specific design - we have already changed it twice, partly also due to me not catching the difficulty earlier and recommending to go to API design first.

PranavBhatP · 2025-06-26T18:42:13Z

@fkiraly @agobbifbk @phoeenniixx

The proposal by this PR for a standard 4d tensor, can be scrapped. A better approach is outlined in the design doc. (high-level). Why should this be scrapped?

Design misconception: This is something which stemmed from the lack of a design document for this PR's changes. This PR assumes two things wrong
1. n_features being greater than one. At a fundamental level, all multi-target forecasts are list of tensor where each tensor is a single target, so an n_features dimension is redundant and does not match with the current API design.
2. Standardizing across 2 dimensions in some cases. in case of 2d tensor inputs, we unsqueeze the tensor to a 4d tensor and handling this with the ground truth would be expensive since we again have to maintain backwards compatibility while also ensuring 4d tensor compatibility in every loss function. very tedious work.

Imo, working on the design document was a really useful step and it did cover lots of gaps. We can ignore this proposal PR for now. Maybe close this pr also?

fkiraly · 2025-06-27T20:08:47Z

I would agree - for me the main argument would be the mixed loss case.

Though I would not delete the branch yet, the code might still be useful as lookup later on, and we have not fully decided on a target design yet.

fkiraly and others added 30 commits February 22, 2025 23:18

test suite

b3644a6

Merge branch 'main' into test-suite

a1d64c6

skeleton

4b2486e

skeleton

02b0ce6

Update test_all_estimators.py

41cbf66

Update _base_object.py

cef62d3

Update _lookup.py

bc2e93b

Update _lookup.py

eee1c86

base metadatda

164fe0d

registry

20e88d0

fix private name

318c1fb

Update _base_object.py

012ab3d

test failure

86365a0

Update test_all_estimators.py

f6dee46

Update test_all_estimators.py

9b0e4ec

Update test_all_estimators.py

7de5285

test folders

57dfe3a

Update test.yml

c9f12db

test integration

fa8144e

fixes

232a510

Update _conftest.py

1c8d4b5

try scenarios

f632e32

D1, D2 layer commit

252598d

remove one comment

d0d1c3e

model layer commit

80e64d2

update docstring

6364780

Merge branch 'refactor-d1-d2' into refactor-model

82b3dc7

update data_module.py

257183c

update data_module.py

9cdcb19

Merge branch 'refactor-d1-d2' into refactor-model

a83bf32

fkiraly reviewed Jun 20, 2025

View reviewed changes

pytorch_forecasting/models/base/_base_model_v2.py Outdated Show resolved Hide resolved

fkiraly reviewed Jun 20, 2025

View reviewed changes

fkiraly mentioned this pull request Jun 21, 2025

[ENH] Request for Quantiles Implementation for NHiTS #1896

Open

PranavBhatP added 10 commits June 22, 2025 11:55

make name changes to DLinear

8a0e673

Merge branch 'main' into tslib-dlinear-model

9a2979c

Merge branch 'tslib-dlinear-model' of https://www.github.com/PranavBh…

fa8cc3b

…atP/pytorch-forecasting into tslib-dlinear-model

update DLinearModel -> DLinear for model name

7e7688e

rename layers folder to private module

ea827a8

Merge branch 'main' into standardise-output-format

06b898b

Merge branch 'tslib-dlinear-model' into standardise-output-format

5994af3

Merge branch 'standardise-output-format' of https://www.github.com/Pr…

e82ab68

…anavBhatP/pytorch-forecasting into standardise-output-format

changes to clarify multi-target forecasting in docstring

1f60883

address code feedback and clarify docstring explanations

ce4eeae

PranavBhatP marked this pull request as ready for review June 23, 2025 05:31

PranavBhatP requested review from benHeid, fnhirwa, jdb78 and yarnabrina as code owners June 23, 2025 05:31

PranavBhatP moved this from In Progress to PR in progress in May - Sep 2025 mentee projects Jun 23, 2025

PranavBhatP moved this from PR in progress to In Progress in May - Sep 2025 mentee projects Jun 23, 2025

PranavBhatP moved this from In Progress to PR in progress in May - Sep 2025 mentee projects Jun 23, 2025

PranavBhatP mentioned this pull request Jun 24, 2025

[ENH] Create design document for model output and metrics standardization v2 #1900

Open

Merge branch 'main' into pr/1895

a33b9ba

jgyasu moved this from PR in progress to Todo in May - Sep 2025 mentee projects Jul 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] [WIP] Standardize model output to 4d tensor#1895

[ENH] [WIP] Standardize model output to 4d tensor#1895
PranavBhatP wants to merge 225 commits intosktime:mainfrom
PranavBhatP:standardise-output-format

PranavBhatP commented Jun 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

fkiraly Jun 20, 2025

Uh oh!

fkiraly Jun 20, 2025

Uh oh!

PranavBhatP Jun 22, 2025

Uh oh!

fkiraly commented Jun 20, 2025

Uh oh!

agobbifbk commented Jun 23, 2025

Uh oh!

PranavBhatP commented Jun 23, 2025 •

edited

Loading

Uh oh!

fkiraly commented Jun 23, 2025

Uh oh!

PranavBhatP commented Jun 26, 2025 •

edited

Loading

Uh oh!

fkiraly commented Jun 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

PranavBhatP commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

PR checklist

Uh oh!

Uh oh!

fkiraly Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

fkiraly Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

PranavBhatP Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

fkiraly commented Jun 20, 2025

Uh oh!

agobbifbk commented Jun 23, 2025

Uh oh!

PranavBhatP commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fkiraly commented Jun 23, 2025

Uh oh!

PranavBhatP commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fkiraly commented Jun 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

PranavBhatP commented Jun 18, 2025 •

edited

Loading

PranavBhatP commented Jun 23, 2025 •

edited

Loading

PranavBhatP commented Jun 26, 2025 •

edited

Loading