Nixtla's Open Source Time Series Ecosystem.

Nixtla Community

Hi <@U03DDFKMT0X>! We are working on improving the library to scale training in multiple GPUs to improve training times (data parallel). We usually observed much more GPU usage, so I wonder if something else might be happening.

hmm okay I’ll do a bit more digging and see if I can add more details

have you noticed any bottlenecks on the data loaders when you’re using very large datasets?

The current limitation is that the model fits in the GPU, we do not have the capability right now to parallelize the model. We have trained NHITS with almost 100M parameters in one (large) GPU without any issue.

Yes, the dataset needs to fit in the memory. And then each training batch with the windows must fit in the GPU memory.

okay thanks. do you mind me asking what the memory footprint of the 100M is? we’re about a tenth that but also using the NHITS backbone

I think we’d expect the worker to crash if we were out of memory so we're probably fine there