This message was deleted Nixtla Community #mlforecast

Join Slack

This message was deleted.

# mlforecast

Slackbot

09/06/2023, 1:50 PM

This message was deleted.

👍 1

José Morales

09/06/2023, 3:07 PM

Thanks. We'll work on a fix soon

José Morales

09/06/2023, 7:03 PM

Hey. Looking at this I think it may be best to not apply the inverse transformations at all because some of the transformations can't recover the original values of the target for the training set. For example if you use Differences([24]) the last 24 values of each serie are stored and then added back when predicting, so for the training set this produces wrong values. WDYT? The fix would consist in having the target in the same scale as the model predictions (right now it's in the original scale)

Isaac

09/06/2023, 7:24 PM

I noticed the issues with Differences as well. I'm not sure what a good fix would be, but it seems pretty important to have. Otherwise, I'm not sure how you'd use mlforecast with hierarchicalforecast.

José Morales

09/12/2023, 5:37 PM

Hey. We just released 0.9.3 with a couple of bug fixes related to restoring the fitted values with target_transformations. Please let us know if you run into any more issues, it turned out to be more complex than I expected

👀 1

Isaac

09/12/2023, 6:55 PM

I'll take a look. Thanks for the update!

Isaac

09/12/2023, 6:56 PM

Do you know when you'll be pushing the new version of hierarchicalforecast? It was last updated in March, so I've been using the git version since then.

José Morales

09/12/2023, 7:31 PM

I have to fix some issue with the aggregation, after that I think we can make a release. I think next week

Isaac

09/12/2023, 7:52 PM

Sweet - it's great how fast and responsive your team is.

Isaac

10/03/2023, 7:05 PM

@José Morales I'm taking a look at the updated

forecast_fitted_values

and still having some trouble. If I fit MLForecast with

dropna=True

, I get an error because the forecast_fitted_values will have groups that were dropped in training but should be used

Found different number of groups in fitted differences.

, but if I do

dropna=False

then I'm unable to fit, since some of the models can't handle NA values. What do you recommend I do?

José Morales

10/03/2023, 7:07 PM

Damn haha. Can you provide an example? I'm guessing this happens for series shorter than the max difference, right?

Isaac

10/03/2023, 7:09 PM

Yep. I think that's the case

Isaac

10/03/2023, 7:09 PM

Let me come up with some example data

José Morales

10/03/2023, 7:14 PM

Can you try removing those series? They'd be removed from the training anyway and I think the fitted values takes this into account, so it wouldn't compute predictions for them

Isaac

10/03/2023, 7:17 PM

Should I put the example in github or post it here?

José Morales

10/03/2023, 7:18 PM

Either one is fine, whichever you prefer

Isaac

10/03/2023, 7:19 PM

Copy code

import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
from sklearn.preprocessing import FunctionTransformer
from sklearn.linear_model import LinearRegression
from mlforecast import MLForecast
from mlforecast.utils import generate_daily_series
from mlforecast.target_transforms import Differences, LocalStandardScaler
from mlforecast.target_transforms import GlobalSklearnTransformer
from window_ops.rolling import rolling_mean


horizon = 10

# Create a basic series
df = generate_daily_series(
    n_series=10,
    min_length=10,
    max_length=25,
    equal_ends=True,
)
df.groupby(['unique_id'])['ds'].agg(['min', 'max'])

test_end = df['ds'].max()
train_end = test_end - pd.Timedelta(days=10)

train_df = (
    df
    .query("ds <= @train_end")
)


#
fcst = MLForecast(
    models={'lr': LinearRegression()},
    freq='D',
    lags=range(1, 7),
    lag_transforms={
        i: [(rolling_mean, 3), (rolling_mean, 4)]
        for i in range(1, 7)
    },
    target_transforms=[Differences([1])],
    date_features=['day', 'dayofweek', 'week', 'month', 'quarter', 'year'],
)

# THIS WILL FAIL WHEN FITTING THE FORECASTED VALUE
fcst.fit(
    df,
    fitted=True,
    dropna=True,
)

# THIS WILL FAIL WHEN TRAINING
fcst.fit(
    df,
    fitted=True,
    dropna=False,
)

Isaac

10/03/2023, 7:21 PM

If I remove the series with NA's how would you go about reconciling them after the fact? I need

forecast_fitted_values

for

aggregate

in hierarchicalforecast.

José Morales

10/03/2023, 7:26 PM

In order to unblock you at least partially, is it ok to impute null values for you?

José Morales

10/03/2023, 7:27 PM

If you use a pipeline as a model that imputes first you could use linear regression and use dropna=False

Copy code

from sklearn.impute import SimpleImputer
from sklearn.pipeline import make_pipeline

lr = make_pipeline(SimpleImputer(strategy='constant', fill_value=0), LinearRegression())
fcst = MLForecast(
    models={'lr': lr},
    ...

Isaac

10/03/2023, 7:32 PM

I don't mind trying that at all, but it looks like it'll fail because it doesn't impute

Copy code

lr = make_pipeline(SimpleImputer(strategy='constant', fill_value=0), LinearRegression())
fcst = MLForecast(
    models={'lr': lr},
    freq='D',
    lags=range(1, 7),
    lag_transforms={
        i: [(rolling_mean, 3), (rolling_mean, 4)]
        for i in range(1, 7)
    },
    target_transforms=[Differences([1])],
    date_features=['day', 'dayofweek', 'week', 'month', 'quarter', 'year'],
)
fcst.fit(
    df,
    fitted=True,
    dropna=False,
)

José Morales

10/03/2023, 7:35 PM

Ah, right. My bad. Let me take a look at a possible fix

gratitude thank you 1

José Morales

10/03/2023, 7:37 PM

In the meantime you can use an approach like here and manually impute X and y

José Morales

10/03/2023, 7:37 PM

In that example I call

MLForecast.ts.predict

but you can also override the

MLForecast.models_

attribute with a dict from str to model.

José Morales

10/03/2023, 7:41 PM

Nevermind that, I think that won't work either for the fitted values haha. Sorry

Isaac

10/03/2023, 7:42 PM

How about removing the offending series from MlForecast at all. Then when it comes to

aggregate

filling in the missing series with the fitted values and the predicted values from

statsforecast.HistoricAverage

? It may mess with reconciliation, but since we have so little data about the missing series it may help nevertheless

José Morales

10/03/2023, 7:44 PM

Yeah sure, if you can model those series in some other way that'd work

Isaac

10/03/2023, 7:49 PM

It'll mess with the reconcile, but code-wise it should work. Should I write a github issue to track the issue going forward?

José Morales

10/03/2023, 7:53 PM

Yes, please.

Isaac

10/03/2023, 8:04 PM

https://github.com/Nixtla/mlforecast/issues/233

🙌 1

José Morales

10/03/2023, 8:27 PM

So you need to have the fitted values even if the data wasn't actually seen by the model, right?

Isaac

10/04/2023, 2:17 PM

Let me take a step back and describe my problem. I have N times series I need to predict. They follow a hierarchical set-up: A -> B -> C ->D. (Separately, is there a good way to include A, B, C, D as features in MLForecast rather than just as unique id?) They all share an end date, but have different start dates. I'd like to predict all of them using statsforecast and mlforecast, reconcile them with hierarchicalforecast and ensemble the best predictions per time series. Therefore, I would need the fitted values for all the time series. Is there anything I could be doing to avoid the issue?

José Morales

10/04/2023, 6:53 PM

You can add them as static features. Another thing you could try with the short series problem is adding leading zeros or backfilling to get the minimum number of samples

José Morales

10/04/2023, 7:15 PM

I'm going to work on trying to raise an error sooner for those cases but I don't think there's much that can be done if we aren't able to get a single row with all the features to predict

Isaac

10/04/2023, 8:15 PM

Addressing You can add them as static features: How do you go about adding static features with a hierarchy. Given the unique id is A/B/C, we have rows that are A, A/B and A/B/C, what values do B and C get for the A rows?

Isaac

10/04/2023, 8:16 PM

Addressing trying to raise an error: Is there a default behavior you can put in? For instance, if a unique_id will be totally dropped, default to historic average so at least we'll have results for all unique_id's?

José Morales

10/04/2023, 10:11 PM

I think you could try giving the rows that only have A the same value for B, could be like "B", "Aggregate" or something like that

Isaac

10/05/2023, 1:33 PM

That's a good idea. It'd be great to automatically have that handled in

aggregate

. Would it help if I added a github issue proposing that?

Isaac

10/05/2023, 1:59 PM

Just to be clear on what you're suggesting, would it look something like this (assuming a strict hierarchy of Country -> State -> Purpose):

Isaac

10/05/2023, 3:28 PM

Made with this function:

Copy code

from sklearn.preprocessing import OneHotEncoder
from scipy import sparse
from typing import Callable, Dict, List, Optional, Iterable

def aggregate2(
    df: pd.DataFrame,
    spec: List[List[str]],
    is_balanced: bool = False,
    sparse_s: bool = False,
):
    """Utils Aggregation Function.
    Aggregates bottom level series contained in the pandas DataFrame `df` according
    to levels defined in the `spec` list.

    Parameters
    ----------
    df : pandas DataFrame
        Dataframe with columns `['ds', 'y']` and columns to aggregate.
    spec : list of list of str
        List of levels. Each element of the list should contain a list of columns of `df` to aggregate.
    is_balanced : bool (default=False)
        Deprecated.
    sparse_s : bool (default=False)
        Return `S_df` as a sparse dataframe.

    Returns
    -------
    Y_df : pandas DataFrame
        Hierarchically structured series.
    S_df : pandas DataFrame
        Summing dataframe.
    tags : dict
        Aggregation indices.
    """
    # Checks
    if df.isnull().values.any():
        raise ValueError('`df` contains null values')
    if is_balanced:
        warnings.warn(
            "`is_balanced` is deprecated and will be removed in a future version. "
            "Don't set this argument to suppress this warning.",
            category=DeprecationWarning,
        )
    spec = sorted(spec, key=len)
    all_spec = set()
    all_spec = [x for x in [item for sublist in spec for item in sublist] if not (x in all_spec or all_spec.add(x))]
    all_dummies = [f'{i}_{j}' for i in all_spec for j in df[i].unique()]
    all_agg = [f'agg_{i}' for i in all_spec]
    bottom = spec[-1]
    aggs = []
    tags = {}
    for i, levels in enumerate(spec):
        agg = df.groupby(levels + ['ds'])['y'].sum().reset_index('ds')
        group = agg.index.get_level_values(0)
        agg[levels[0]] = agg.index.get_level_values(levels[0]).values
        for j, level in enumerate(levels):
            if j > 0:
                group = group + '/' + agg.index.get_level_values(level).str.replace('/', '_')
            agg[level] = agg.index.get_level_values(level).values
            agg = pd.concat([agg, pd.get_dummies(agg[level], prefix=level, dtype=int)], axis=1)
        agg.index = group
        agg.index.name = 'unique_id'
        tags['/'.join(levels)] = group.unique().values

        for j in all_spec:
            if j not in agg:
                agg[j] = np.NaN
            agg[f'agg_{j}'] = agg[j].isna().astype(int)

        for j in all_dummies:
            if j not in agg:
                agg[j] = 0
        aggs.append(agg)

    Y_df = (
        pd.concat(aggs)
        [['ds', 'y'] + all_spec + all_dummies + all_agg]
    )

    # construct S
    bottom_key = '/'.join(bottom)
    bottom_levels = tags[bottom_key]
    S = np.empty((len(bottom_levels), len(spec)), dtype=object)
    for j, levels in enumerate(spec[:-1]):
        S[:, j] = _to_upper_hierarchy(bottom, bottom_levels, '/'.join(levels))
    S[:, -1] = tags[bottom_key]
    categories = list(tags.values())
    try:
        encoder = OneHotEncoder(categories=categories, sparse_output=sparse_s, dtype=np.float32)
    except TypeError:  # sklearn < 1.2
        encoder = OneHotEncoder(categories=categories, sparse=sparse_s, dtype=np.float32)    
    S = encoder.fit_transform(S).T
    if sparse_s:
        df_constructor = pd.DataFrame.sparse.from_spmatrix
    else:
        df_constructor = pd.DataFrame
    S_df = df_constructor(S, index=np.hstack(categories), columns=bottom_levels)
    return Y_df, S_df, tags

José Morales

10/05/2023, 3:30 PM

Is that for the dummies?

Isaac

10/05/2023, 3:30 PM

Yep yep

José Morales

10/05/2023, 3:38 PM

I was thinking using strsplit and then maybe a fillna. For example:

Copy code

Y_df, S_df, tags = aggregate(df, spec)
Y_df = Y_df.reset_index()
Y_df[['country', 'cat1', 'cat2']] = Y_df['unique_id'].str.split('/', n=3, expand=True)
for col in ('cat1', 'cat2'):
    Y_df[col] = Y_df[col].fillna(col)

You could then use these directly in LightGBM or use pd.get_dummies or a one hot encoder

José Morales

10/05/2023, 3:39 PM

Oh btw you were interested in a new release of hierarchicalforecast, right? We released 0.4.0 yesterday

🙌 1

Isaac

10/05/2023, 3:43 PM

I added an enhancement idea to github. The code I wrote is slightly more complex because with non-strictly hierarchical datasets, you're not guaranteed to get the same variable in 'cat1' every time. I included the Tourism example in my issue.

👍 1

Isaac

10/05/2023, 3:52 PM

Let me know what you think

7 Views

Open in Slack

Previous Next