This message was deleted Nixtla Community #neural-forecast

Join Slack

This message was deleted.

# neural-forecast

Slackbot

06/15/2023, 2:39 PM

This message was deleted.

Kin Gtz. Olivares

06/15/2023, 2:57 PM

Hey @Yang Guo, The insample validation evaluation is a feature that we have not implemented. It would require to modify BaseWindows validation step to sample within.

Yang Guo

06/15/2023, 3:05 PM

Could you please expand on that? I thought I could simply use the

nf.predict

and directly apply subset of the dataset.

Cristian (Nixtla)

06/15/2023, 4:48 PM

Hi @Yang Guo, we have other options as well. For example, the

cross_validation

method uses a validation set (of size

val_size

) for model selection, and then automatically produces the forecast for the entire test set (of size

test_size

n_windows

of size

Cristian (Nixtla)

06/15/2023, 4:50 PM

Alternatively, you can use

predict_insample

(run it after the

fit

cross_validation

) method to recover the forecasts for the entire train AND validation sets. You can then filter the forecasts however you want.

Cristian (Nixtla)

06/15/2023, 4:51 PM

The

predict_insample

already returns the true values in the

column as well, here is the tutorial: https://nixtla.github.io/neuralforecast/examples/predictinsample.html

Cristian (Nixtla)

06/15/2023, 4:52 PM

Be careful on focusing only on the validation set, it is not ideal to do model selection on the train set.

Cristian (Nixtla)

06/15/2023, 4:53 PM

And depending on your use case,

cross_validation

is already doing model selection for you. It essentially covers the entire pipeline. (train model on the train set, select on validation set, predict on test set).

Yang Guo

06/15/2023, 4:53 PM

I think the issue is that predict_insample can only apply to the default dataset. So I think you point is that after performing the cross validation, then apply predict_insample?

Yang Guo

06/15/2023, 4:54 PM

Could you please explain the cross validation a little bit? It seems to require re-training (call model.fit multiple times)

Cristian (Nixtla)

06/15/2023, 4:55 PM

Can you provide more details on the pipeline you are trying to build? No,

cross_validation

only trains the model once.

Yang Guo

06/15/2023, 4:57 PM

Actually, let me rephrase my question. Cross_validation performs actual training on the dataset though with some additional features. To evaluate the model, I cannot use the same dataset used in the cross_validation dataset. However, predict_insample does not take dataset as input.

Cristian (Nixtla)

06/15/2023, 5:02 PM

If your objective is to compare between models on historic data,

cross_validation

is the way to go. This is the function we have used in our published research, and it is the standard way of comparing the performance of models. The historic data is separated chronologically in train/val/test. Models are trained on the train set, and then it uses the validation set for model selection and hyperparameter tuning (for example, if you use an

auto

model such as the

AutoPatchTST

). Finally, it returns the forecast on the test set, which was never seen by the model during training.

Cristian (Nixtla)

06/15/2023, 5:02 PM

We compare the performance on the test set.

Cristian (Nixtla)

06/15/2023, 5:05 PM

The more common use case of the

predict_insample

is to recover the forecasts for the train set and validation set. Was this useful? Let me know if you have additional doubts, we can chat using direct messages as well.

👍 1

Yang Guo

06/15/2023, 5:05 PM

I agree with this pipeline, but is there any way that I can simply evaluate a trained model? The evaluation is done in a similar way to predict_insample but on a different dataset.

Cristian (Nixtla)

06/15/2023, 5:07 PM

Ok, I understand you point now. So essentially you want to train a model on one dataset and predict on a new one? (not necessarily with the same time series)

Yang Guo

06/15/2023, 5:08 PM

Yep

Cristian (Nixtla)

06/15/2023, 5:09 PM

You should use the

predict

method for that, but as you said, it can only forecast one window. You just need to send

predict(df=new_df)

. We actually have a tutorial on transfer learning with this use case here: https://nixtla.github.io/neuralforecast/examples/transfer_learning.html

Cristian (Nixtla)

06/15/2023, 5:14 PM

If you want more windows, you can hack it. For example, after training, set

nf.models[0].max_steps=0

. Then pass the new dataset to the

fit

method (set

use_init_models=False

), then call

predict_insample

61 Views

Open in Slack

Previous Next