This message was deleted Nixtla Community #neural-forecast

Join Slack

This message was deleted.

# neural-forecast

Slackbot

07/13/2023, 9:13 AM

This message was deleted.

Antoine SCHWARTZ -CROIX-

07/13/2023, 10:41 AM

Hey Manuel, if you set the random_seed AND use the pl.trainer option « deterministic » (set to True on CPU, and « warn » on GPU) you will be able to have deterministic predictions! 😉

Manuel

07/13/2023, 12:33 PM

@Antoine SCHWARTZ -CROIX- Yes, the problem is not that they are not deterministic, the problem is that just changing even a little bit of the training data or the random seed or increasing/decreasing the number of steps by a little bit can lead to very different predictions.

Kin Gtz. Olivares

07/13/2023, 1:40 PM

Hey @Manuel, Some ideas that can help to robustify the training procedure: • Early stopping is one of the strongest regularization techniques on regression problems. Are you using it? • Pretraining on a large dataset and fine tuning on your data. • Exploring the complexity of the model, if you have limited data you might one to make the model smaller. • Using HuberLoss/HuberMQLoss, that helps the model's convergence and clips the gradients.

Manuel

07/13/2023, 2:00 PM

@Kin Gtz. Olivares Thanks! Currently, I'm already using HuberLoss and I'm using a custom early stopping criteria where the validation set is made by some cherry picked AutoARIMA forecasts for the most important time series. The reason is that many time series are quite short and I can't sacrifice a full horizon of timesteps to be used as validation set, otherwise the remaining data is not enough to produce meaningful forecasts (I'm using

use_init_models=False

to incrementally fit the model while checking for the custom early stopping criteria).

Kin Gtz. Olivares

07/13/2023, 2:35 PM

The two remaining ideas for helping the stability are: • As long as the series are similar to your problem, pre-training on a larger dataset may help. • Using very simple predictions (SeasonalNaive/Naive.) through

futr_exog_list

that anchor the NeuralForecast model to learn residuals. Learning residual predictions is a much simpler problem.

👍 1

Cristian (Nixtla)

07/13/2023, 4:08 PM

Hi @Manuel. Kin already suggested a lot of great tips. Bad local minima and "chaotic" behavior when changing hyperparameters or the data is one of the main drawbacks of deep-learning methods. In our experience some methods are very robust to these changes. For example, this is our main table of the NHITS paper, std of the performance between different runs (with the Auto models that change hyperparameters) are in parenthesis. As you can see, the performance of many models such as the NHITS is very stable (on average). Another common practice to have more stable predictions is ensembling multiple models, for example initializing each at a different random seed.

Cristian (Nixtla)

07/13/2023, 4:10 PM

But in this cases we have a rather long validation set, so the regularization of the early stopping and validation signal are very stable, leading to more stable results and forecasts

Manuel

07/13/2023, 4:16 PM

@Cristian (Nixtla) Thanks! Unfortunately NHITS does not perform well on my specific data set. The one that seems to perform better is TFT but it is not very "stable" (even with NBEATSx I encountered little "stability"). In the case of TFT the tuning of hyperparameters is also problematic because it is very slow (even using 1 GPU) and optimizing them takes me days each time.

marah othman

07/15/2023, 9:48 PM

early stopping is one of the strongest regularization techniques on regression problems. Are you using it? Pretraining on a large dataset and fine tuning on your data. could please tell me how can i use the early stop i try to change the param and give it 2 for example but idont know if that is correct and also about pretraining do you mean should train my model on another data and transfer the learning to my data ?? @Kin Gtz. Olivares

13 Views

Open in Slack

Previous Next