Fiveable

๐Ÿ“ŠBusiness Forecasting Unit 7 Review

QR code for Business Forecasting practice questions

7.2 Integrated (I) processes and differencing

๐Ÿ“ŠBusiness Forecasting
Unit 7 Review

7.2 Integrated (I) processes and differencing

Written by the Fiveable Content Team โ€ข Last updated September 2025
Written by the Fiveable Content Team โ€ข Last updated September 2025
๐Ÿ“ŠBusiness Forecasting
Unit & Topic Study Guides

ARIMA models are key tools for forecasting time series data. They rely on understanding integrated processes, which are non-stationary series that need differencing to become stationary. This concept is crucial for proper model selection and accurate predictions.

Differencing is a technique used to transform non-stationary data into stationary data. It involves subtracting lagged values from current values to remove trends. The Augmented Dickey-Fuller test helps determine if differencing is necessary and how many times to apply it.

Integrated Processes

Understanding Integrated Processes and Their Order

  • Integrated process refers to a time series that requires differencing to achieve stationarity
  • Order of integration denotes the number of times a series must be differenced to become stationary
  • I(0) process represents a stationary series, requiring no differencing
  • I(1) process becomes stationary after first differencing
  • I(2) process requires second-order differencing to achieve stationarity
  • Unit root indicates non-stationarity in a time series, causing persistent effects of shocks
  • Random walk constitutes a common example of an I(1) process (stock prices, exchange rates)
    • Follows the equation: Yt=Ytโˆ’1+ฯตtY_t = Y_{t-1} + \epsilon_t
    • Current value depends entirely on the previous value plus a random shock

Distinguishing Between Trend-Stationary and Difference-Stationary Processes

  • Trend-stationary process becomes stationary after removing a deterministic trend
    • Can be modeled as: Yt=ฮฑ+ฮฒt+ฯตtY_t = \alpha + \beta t + \epsilon_t
    • Detrending involves subtracting the trend component
  • Difference-stationary process achieves stationarity through differencing
    • Modeled as: Yt=Ytโˆ’1+ฮผ+ฯตtY_t = Y_{t-1} + \mu + \epsilon_t
    • First difference removes the stochastic trend: ฮ”Yt=ฮผ+ฯตt\Delta Y_t = \mu + \epsilon_t
  • Importance of correctly identifying the process type
    • Misspecification leads to incorrect model selection and unreliable forecasts
    • Trend-stationary series require detrending, while difference-stationary series need differencing

Differencing and Testing

Applying Differencing Techniques

  • Differencing involves subtracting lagged values from current values to remove trends and achieve stationarity
  • First-order differencing calculates the change between consecutive observations
    • Equation: ฮ”Yt=Ytโˆ’Ytโˆ’1\Delta Y_t = Y_t - Y_{t-1}
  • Second-order differencing applies the first-difference operator twice
    • Equation: ฮ”2Yt=ฮ”(ฮ”Yt)=(Ytโˆ’Ytโˆ’1)โˆ’(Ytโˆ’1โˆ’Ytโˆ’2)\Delta^2 Y_t = \Delta(\Delta Y_t) = (Y_t - Y_{t-1}) - (Y_{t-1} - Y_{t-2})
  • Seasonal differencing removes seasonal patterns by subtracting values from previous seasons
    • For monthly data with annual seasonality: ฮ”12Yt=Ytโˆ’Ytโˆ’12\Delta_{12} Y_t = Y_t - Y_{t-12}
  • Overdifferencing can introduce unnecessary complexity and reduce forecast accuracy
    • Symptoms include increased variance and oscillatory behavior in the differenced series

Conducting and Interpreting the Augmented Dickey-Fuller Test

  • Augmented Dickey-Fuller (ADF) test determines the presence of a unit root in a time series
  • Null hypothesis assumes the presence of a unit root (series is non-stationary)
  • Alternative hypothesis suggests the series is stationary
  • Test statistic compared to critical values determines rejection or failure to reject the null hypothesis
  • P-value interpretation guides decision-making
    • P-value < significance level (typically 0.05) rejects the null hypothesis, indicating stationarity
    • P-value > significance level fails to reject the null hypothesis, suggesting non-stationarity
  • ADF test equation includes lagged differences to account for serial correlation
    • ฮ”Yt=ฮฑ+ฮฒt+ฮณYtโˆ’1+ฮด1ฮ”Ytโˆ’1+...+ฮดpฮ”Ytโˆ’p+ฯตt\Delta Y_t = \alpha + \beta t + \gamma Y_{t-1} + \delta_1 \Delta Y_{t-1} + ... + \delta_p \Delta Y_{t-p} + \epsilon_t
  • Multiple versions of the ADF test accommodate different trend assumptions (no constant, constant, trend)