I'd like to better understand how the lag transforms work. In particular, how do they get applied when predicting future values when they are unknown for predictions but algorithms require them in the future dataframe?
To help me understand, let's say I was not using Nixlta but trying to replicate MLForecast using lightgbm. I could calculate rolling stats using pandas similar to what is now natively built into MLForecast. But, how would the future values be created?