This message was deleted Nixtla Community #neural-forecast

Join Slack

This message was deleted.

# neural-forecast

Slackbot

05/03/2023, 12:30 PM

This message was deleted.

Cristian (Nixtla)

05/03/2023, 6:44 PM

Hi @Raghuvansh! Thanks for using our library! Regarding your questions: 1. The models do not learn unique_ids, nor use them to forecast. Models can learn time-series specific dynamics only if you add static variables (such as one-hot encoding of the unique_id). 2. During training, each batch is created by randomly sampling a subset of

batch_size

time series, and

windows_batch_size

windows from the subset. We dont have a parameter to limit the total number of windows from each time series. However, we dont usually train models with all possible windows from each time series, we recommend to train models with a fix number of training steps (

max_steps

Raghuvansh

05/03/2023, 6:54 PM

Regarding 2: My full dataset comprises of multiple small datasets and each small dataset has multiple time series. So rather than combining the small datasets I should train the model one by one separately on each small dataset using max_steps so that the model does not learn too much from one dataset. Does this approach make sense to you ?

Cristian (Nixtla)

05/04/2023, 7:08 AM

This is a very interesting question. I believe it would be better to pool all the data together, so that the model does not end specializing more to forecast the latest datasets. Usually is better to have variety in each batch during the training step, than grouping potentially similar time series in each batch. In competitions such as the M4, high performing models such as the ESRNN and NBEATS were simultaneously trained on many time series from different domains and sizes.

Cristian (Nixtla)

05/04/2023, 7:09 AM

In the near future we will also add the possibility to add weights to each unique_id and timestamp for gradient computation

👍 1

Raghuvansh

05/04/2023, 9:27 AM

I was hoping to do that as well but there are some datasets which have significantly longer and more number of time series than others so more number of samples which would lead the overall dataset to become imbalanced. I could potentially randomly sample some fixed number of time series from larger datasets and then combine the smaller datasets to form a larger dataset. Would this be an ideal approach or should I think of something else ? Your thoughts are always appreciated

2 Views

Open in Slack

Previous Next