Fix incorrect input routing for models #3186

shchur · 2024-05-31T09:50:56Z

Description of changes:

There is currently a bug where the model inputs may be routed incorrect by the forecast generator. This effectively results in past_feat_dynamic_real and past_feat_dynamic_cat being ignored by the TFT model.

MWE:

from unittest import mock
import numpy as np
import pandas as pd
from gluonts.torch.model.tft import TemporalFusionTransformerEstimator

freq = "D"
N = 50
data = [
    {"target": np.arange(N), "past_feat_dynamic_real": np.random.rand(1, N).astype("float32"), "start": pd.Period("2020-01-01", freq=freq)}
]

predictor = TemporalFusionTransformerEstimator(prediction_length=1, freq=freq, past_dynamic_dims=[1], trainer_kwargs={"max_epochs": 1}).train(data)

with mock.patch("gluonts.torch.model.tft.module.TemporalFusionTransformerModel._preprocess") as mock_fwd:
    try:
        fcst = list(predictor.predict(data))
    except:
        pass   
    call_kwargs = mock_fwd.call_args[1]

call_kwargs["feat_dynamic_cat"]  
# tensor([[[0.8073]]])
call_kwargs["past_feat_dynamic_real"]  
# None

The bug occurs because model inputs are passed as positional arguments instead of keyword arguments.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Please tag this pr with at least one of these labels to make our release process faster: BREAKING, new feature, bug fix, other change, dev setup

shchur · 2024-05-31T12:12:51Z

src/gluonts/model/forecast_generator.py

@@ -82,6 +82,14 @@ def make_distribution_forecast(distr, *args, **kwargs) -> Forecast:
    raise NotImplementedError


+def make_predictions(prediction_net, inputs: dict):
+    # MXNet predictors only support positional arguments
+    if prediction_net.__class__.__module__.startswith("gluonts.mx"):


I couldn't find a more elegant way to use different logic for MXNet and PyTorch models :/

I tried @singledispatch, but that doesn't work for subclasses (i.e., we'd need to define it for all subclasses of pl.LightningModule in GluonTS, and same for MXNet).

Fixes awslabs#3185 *Description of changes:* There is currently a bug where the model inputs may be routed incorrect by the forecast generator. This effectively results in `past_feat_dynamic_real` and `past_feat_dynamic_cat` being ignored by the TFT model. MWE: ```python from unittest import mock import numpy as np import pandas as pd from gluonts.torch.model.tft import TemporalFusionTransformerEstimator freq = "D" N = 50 data = [ {"target": np.arange(N), "past_feat_dynamic_real": np.random.rand(1, N).astype("float32"), "start": pd.Period("2020-01-01", freq=freq)} ] predictor = TemporalFusionTransformerEstimator(prediction_length=1, freq=freq, past_dynamic_dims=[1], trainer_kwargs={"max_epochs": 1}).train(data) with mock.patch("gluonts.torch.model.tft.module.TemporalFusionTransformerModel._preprocess") as mock_fwd: try: fcst = list(predictor.predict(data)) except: pass call_kwargs = mock_fwd.call_args[1] call_kwargs["feat_dynamic_cat"] # tensor([[[0.8073]]]) call_kwargs["past_feat_dynamic_real"] # None ``` The bug occurs because model inputs are passed as positional arguments instead of keyword arguments. By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice. **Please tag this pr with at least one of these labels to make our release process faster:** BREAKING, new feature, bug fix, other change, dev setup

*Description of changes:* backporting fixes - #3186 By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice. **Please tag this pr with at least one of these labels to make our release process faster:** BREAKING, new feature, bug fix, other change, dev setup Co-authored-by: Oleksandr Shchur <shchuro@amazon.com>

Fixes awslabs#3185 *Description of changes:* There is currently a bug where the model inputs may be routed incorrect by the forecast generator. This effectively results in `past_feat_dynamic_real` and `past_feat_dynamic_cat` being ignored by the TFT model. MWE: ```python from unittest import mock import numpy as np import pandas as pd from gluonts.torch.model.tft import TemporalFusionTransformerEstimator freq = "D" N = 50 data = [ {"target": np.arange(N), "past_feat_dynamic_real": np.random.rand(1, N).astype("float32"), "start": pd.Period("2020-01-01", freq=freq)} ] predictor = TemporalFusionTransformerEstimator(prediction_length=1, freq=freq, past_dynamic_dims=[1], trainer_kwargs={"max_epochs": 1}).train(data) with mock.patch("gluonts.torch.model.tft.module.TemporalFusionTransformerModel._preprocess") as mock_fwd: try: fcst = list(predictor.predict(data)) except: pass call_kwargs = mock_fwd.call_args[1] call_kwargs["feat_dynamic_cat"] # tensor([[[0.8073]]]) call_kwargs["past_feat_dynamic_real"] # None ``` The bug occurs because model inputs are passed as positional arguments instead of keyword arguments. By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice. **Please tag this pr with at least one of these labels to make our release process faster:** BREAKING, new feature, bug fix, other change, dev setup

Fix incorrect input routing

25812d9

shchur requested a review from lostella May 31, 2024 09:51

shchur added bug Something isn't working bug fix (one of pr required labels) and removed bug Something isn't working labels May 31, 2024

Fix input for MXNet models

4022835

shchur commented May 31, 2024

View reviewed changes

Fix MXNet tests

e8afdde

lostella added pending v0.15.x backport This contains a fix to be backported to the v0.15.x branch pending v0.14.x backport This contains a fix to be backported to the v0.14.x branch labels May 31, 2024

lostella approved these changes May 31, 2024

View reviewed changes

lostella merged commit 5e30960 into awslabs:dev May 31, 2024
20 checks passed

lostella mentioned this pull request May 31, 2024

Backports v0.15.1 #3187

Merged

lostella removed the pending v0.15.x backport This contains a fix to be backported to the v0.15.x branch label May 31, 2024

lostella removed the pending v0.14.x backport This contains a fix to be backported to the v0.14.x branch label Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incorrect input routing for models #3186

Fix incorrect input routing for models #3186

shchur commented May 31, 2024 •

edited

Loading

shchur May 31, 2024

Fix incorrect input routing for models #3186

Fix incorrect input routing for models #3186

Conversation

shchur commented May 31, 2024 • edited Loading

shchur May 31, 2024

Choose a reason for hiding this comment

shchur commented May 31, 2024 •

edited

Loading