This message was deleted Nixtla Community #general

Join Slack

This message was deleted.

# general

Slackbot

09/07/2023, 2:07 PM

This message was deleted.

José Morales

09/07/2023, 4:51 PM

Hey. With respect to the Auto models, they find the best parameters for each window

09/17/2023, 1:48 PM

@José Morales Do you know if the same is true for the neuralforecast Models? Judging be the time needed to finish the AutoMLP cross-validation (which takes roughly 30secs on my 4090 and my dataset, and about 1,9 hours for AutoARIMA on the same dataset) it seems like neuralforecast does one hyperparameter search per series and not per window. Is that correct?

José Morales

09/18/2023, 4:32 PM

Hey. neuralforecast uses a single model for all series, so it's still per window but it only searches n_windows instead of n_windows * n_series

09/20/2023, 10:20 AM

@José Morales Hey, so it randomly chooses one series X from my "big dataframe" and then for each window in X it chooses the best parameters? So it has one parameter combination for one particular window and uses that parameter combination for all the corresponding windows from the N-1 time-series? Wouldn't that mean that the hyperparameter tuning highly depends on the chosen time-series X and not on all time-series?

09/20/2023, 11:27 AM

Or do you mean that it uses a single model which is trained on all time series (global model)?

José Morales

09/20/2023, 2:58 PM

It uses a global model

🙌 1

09/24/2023, 12:43 PM

@José Morales When using neuralforecast, I changed the n_windows parameters from 1 to 700 and it did not have any negative impact on the cross-validation speed. On the contrary, it was even faster when using a bigger number for n_windows (700 instead of 1)? I took roughly 350 seconds using n_windows=1 and roughly 220 seconds when using n_windows=700. Are you sure that there is hyperparameter search per window? For me, it seems like it is searching parameters on the remaining data (data that does not interfere the first day to be forecasted in the cross-validation data) for tuning the best hyperparameters and then uses this model for cross-validation. Or why is the process faster when doing more n_windows?

José Morales

09/25/2023, 4:33 PM

Can you share the code you're running?

Cristian (Nixtla)

09/25/2023, 6:21 PM

Hi @J. ! The hyperparameter is done once for the entire dataset since the model is global. The

n_windows

only controls the length of the validation set. Each model with a particular configuration is trained once on the train data, and the validation loss is computed once for the entire validation set (consisting of

n_windows

). A longer

n_windows

only increases the length of the validation set. It does not increase the total time because the inference on the validation set is done efficiently on batches.

3 Views

Open in Slack

Previous Next