Hello I have a question about the iTransformer in neuralfore Nixtla Community #neural-forecast

Hello. I have a question about the iTransformer in...

田口天晴

02/24/2025, 9:53 AM

Hello. I have a question about the iTransformer in neuralforecast. When I use the transformer with the same data, I always encounter the following error at the same point. Upon checking, it turns out that the values in a 32×90×32 tensor are all NaN. What could be the possible reasons for this occurring? Especially, I want to know what kind of things in _base_multivariate.py.

Olivier

02/24/2025, 10:12 AM

It probably means your loss is NaN. From the posted code, I'd guess that most likely culprit is using

identity

as scaler_type. I'd suggest to use

robust

standard

and try again.

Olivier

02/24/2025, 10:12 AM

But I could help more if you post a piece of code that I can run that reproduces the error

Olivier

02/24/2025, 10:15 AM

Secondly, it seems you're using a custom loss function? This loss function in conjunction with the scaler_type is highly likely to be the culprit here. E..g. try different scaler and loss function (e.g. 'robust' and 'MAE') -> probably no errors.

田口天晴

02/24/2025, 10:16 AM

Yes, my loss is nan! Ok, I try it on your suggestion. Thanks !!

田口天晴

02/24/2025, 2:17 PM

We created a custom loss function for business reasons. Instead of forecasting daily values, we aim to predict the total score over 30-day, 45-day, and 60-day periods. From the perspective of the neuralforecast developers, do you think it's feasible to configure the loss function in this way?

無題

Olivier

02/24/2025, 2:31 PM

First, the loss function should inherit from losses.pytorch.BasePointLoss. See also how MAE or RMSE has been defined in losses.pytorch. Second, this loss function doesn't really make sense imho. As you sum all values, there's no real 'incentive' for the network to perform well on individual timestamps. If you want to weight timestamps, I'd use horizon_weight parameter in RMSE, which makes more sense, and then set the weights comparable to your alpha/betta/gamma parameters.

田口天晴

02/24/2025, 3:45 PM

Thank you! I'll make the adjustments as per your advice and give it a try. I have one more question. The tensor sizes are as follows: •

insample_y

→

torch.Size([32, 16, 32])

•

outsample_y

→

torch.Size([32, 90, 32])

•

output

→

torch.Size([32, 90, 32])

In the

output

tensor, the dimensions are

32 × 90 × 32

. I understand that: •

= batch size •

= predict length (h) •

= What is this?

Olivier

02/24/2025, 6:20 PM

You gave n_series=32 as input

田口天晴

02/25/2025, 5:04 AM

Got it! Thanks for everything, Olivier.

👍 1

4 Views

Open in Slack

Previous Next