Brandon Barber
10/24/2024, 2:53 PMOlivier
10/24/2024, 6:50 PMBrandon Barber
10/24/2024, 7:08 PMbooked1
, but then not know what to do when h=2 and it is nullOlivier
10/25/2024, 7:42 AMhist_exog_list = ['booked1', 'booked2', ....]
in a NF model that supports historical exogenous variables (see the first link I shared above)
2. Consider the future values known by treating them as future exogenous variables, setting futr_exog_list = ['booked1', 'booked2', ....]
in a NF model that supports future exogenous variables. For the prediction phase, you will need to make predictions for the unknown values of these features, i.e. fill up the blank spots of the red part in your picture. Just start with something simple, e.g. (seasonal) naive prediction (I'd just start with forward-filling the unknown values, for example).
3. For each exogenous column, randomly set values to Null with probability n / 6, with n the amount missing in the prediction horizon. So for booked1
, you'd set 5/6 values in the training data Null. Next, add an indicator column booked1_missing
that indicates whether the value in booked1
is Null or not. Do the same for all exogenous columns but change the missingness amount based on the probability of the value appearing in the prediction horizon. Now assign a dummy value to the Null values, e.g. -1. Add all the exogenous variables (so both the booked1
and booked1_missing
) as a future exogenous (i.e. futr_exog_list = [ ]
). During prediction, use the values you have available, and don't forget to convert the prediction values (i.e. set Null to -1, and add the missingness column). This approach tries to simulate the availability of these exogenous of the prediction phase during training. Note that booked6
seems always available, so you don't have to do anything for that variable.
Hope this helpsBrandon Barber
10/25/2024, 8:54 PMOlivier
10/28/2024, 7:14 PM