library(tidymodels)
library(modeldata)
data("drinks")
glimpse(drinks)
#> Rows: 309
#> Columns: 2
#> $ date <date> 1992-01-01, 1992-02-01, 1992-03-01, 1992-04-01, 1992-0…
#> $ S4248SM144NCEN <dbl> 3459, 3458, 4002, 4564, 4221, 4529, 4466, 4137, 4126, 4…
Modeling time series with tidy resampling
Calculate performance estimates for time series forecasts using resampling.
Introduction
To use code in this article, you will need to install the following packages: forecast, sweep, tidymodels, timetk, and zoo.
“Demo Week: Tidy Forecasting with sweep” is an excellent article that uses tidy methods with time series. This article uses their analysis with rsample to find performance estimates for future observations using rolling forecast origin resampling.
Example data
The data for this article are sales of alcoholic beverages originally from the Federal Reserve Bank of St. Louis website.
Each row represents one month of sales (in millions of US dollars).
Time series resampling
Suppose that we need predictions for one year ahead and our model should use the most recent data from the last 20 years. To set up this resampling scheme:
<- rolling_origin(
roll_rs
drinks, initial = 12 * 20,
assess = 12,
cumulative = FALSE
)
nrow(roll_rs)
#> [1] 58
roll_rs#> # Rolling origin forecast resampling
#> # A tibble: 58 × 2
#> splits id
#> <list> <chr>
#> 1 <split [240/12]> Slice01
#> 2 <split [240/12]> Slice02
#> 3 <split [240/12]> Slice03
#> 4 <split [240/12]> Slice04
#> 5 <split [240/12]> Slice05
#> 6 <split [240/12]> Slice06
#> 7 <split [240/12]> Slice07
#> 8 <split [240/12]> Slice08
#> 9 <split [240/12]> Slice09
#> 10 <split [240/12]> Slice10
#> # ℹ 48 more rows
Each split
element contains the information about that resample:
$splits[[1]]
roll_rs#> <Analysis/Assess/Total>
#> <240/12/309>
For plotting, let’s index each split by the first day of the assessment set:
<- function(x) {
get_date min(assessment(x)$date)
}
<- map(roll_rs$splits, get_date)
start_date $start_date <- do.call("c", start_date)
roll_rshead(roll_rs$start_date)
#> [1] "2012-01-01" "2012-02-01" "2012-03-01" "2012-04-01" "2012-05-01"
#> [6] "2012-06-01"
This resampling scheme has 58 splits of the data so that there will be 58 ARIMA models that are fit. To create the models, we use the auto.arima()
function from the forecast package. The rsample functions analysis()
and assessment()
return a data frame, so another step converts the data to a ts
object called mod_dat
using a function in the timetk package.
library(forecast) # for `auto.arima`
library(timetk) # for `tk_ts`
library(zoo) # for `as.yearmon`
<- function(x, ...) {
fit_model # suggested by Matt Dancho:
%>%
x analysis() %>%
# Since the first day changes over resamples, adjust it
# based on the first date value in the data frame
tk_ts(start = .$date[[1]] %>% as.yearmon(),
frequency = 12,
silent = TRUE) %>%
auto.arima(...)
}
Save each model in a new column:
$arima <- map(roll_rs$splits, fit_model)
roll_rs
# For example:
$arima[[1]]
roll_rs#> Series: .
#> ARIMA(4,1,1)(0,1,2)[12]
#>
#> Coefficients:
#> ar1 ar2 ar3 ar4 ma1 sma1 sma2
#> -0.1852 -0.0238 0.3577 -0.1517 -0.8311 -0.193 -0.3244
#> s.e. 0.1466 0.1656 0.1440 0.0809 0.1377 0.067 0.0640
#>
#> sigma^2 = 72198: log likelihood = -1591.15
#> AIC=3198.3 AICc=3198.97 BIC=3225.7
(There are some warnings produced by these regarding extra columns in the data that can be ignored.)
Model performance
Using the model fits, let’s measure performance in two ways:
- Interpolation error will measure how well the model fits to the data that were used to create the model. This is most likely optimistic since no holdout method is used.
- Extrapolation or forecast error evaluates the performance of the model on the data from the following year (that were not used in the model fit).
In each case, the mean absolute percent error (MAPE) is the statistic used to characterize the model fits. The interpolation error can be computed from the Arima
object. To make things easy, let’s use the sweep package’s sw_glance()
function:
library(sweep)
$interpolation <- map_dbl(
roll_rs$arima,
roll_rsfunction(x)
sw_glance(x)[["MAPE"]]
)
summary(roll_rs$interpolation)
#> Min. 1st Qu. Median Mean 3rd Qu. Max.
#> 2.841 2.921 2.950 2.947 2.969 3.135
For the extrapolation error, the model and split objects are required. Using these:
<- function(split, mod) {
get_extrap <- nrow(assessment(split))
n # Get assessment data
<- assessment(split) %>%
pred_dat mutate(
pred = as.vector(forecast(mod, h = n)$mean),
pct_error = ( S4248SM144NCEN - pred ) / S4248SM144NCEN * 100
)mean(abs(pred_dat$pct_error))
}
$extrapolation <-
roll_rsmap2_dbl(roll_rs$splits, roll_rs$arima, get_extrap)
summary(roll_rs$extrapolation)
#> Min. 1st Qu. Median Mean 3rd Qu. Max.
#> 2.371 3.231 3.629 3.654 4.113 5.453
What do these error estimates look like over time?
%>%
roll_rs select(interpolation, extrapolation, start_date) %>%
pivot_longer(cols = matches("ation"), names_to = "error", values_to = "MAPE") %>%
ggplot(aes(x = start_date, y = MAPE, col = error)) +
geom_point() +
geom_line()
It is likely that the interpolation error is an underestimate to some degree, as mentioned above.
It is also worth noting that rolling_origin()
can be used over calendar periods, rather than just over a fixed window size. This is especially useful for irregular series where a fixed window size might not make sense because of missing data points, or because of calendar features like different months having a different number of days.
The example below demonstrates this idea by splitting drinks
into a nested set of 26 years, and rolling over years rather than months. Note that the end result accomplishes a different task than the original example; in this new case, each slice moves forward an entire year, rather than just one month.
# The idea is to nest by the period to roll over,
# which in this case is the year.
<- drinks %>%
roll_rs_annual mutate(year = as.POSIXlt(date)$year + 1900) %>%
nest(data = c(date, S4248SM144NCEN)) %>%
rolling_origin(
initial = 20,
assess = 1,
cumulative = FALSE
)
analysis(roll_rs_annual$splits[[1]])
#> # A tibble: 20 × 2
#> year data
#> <dbl> <list>
#> 1 1992 <tibble [12 × 2]>
#> 2 1993 <tibble [12 × 2]>
#> 3 1994 <tibble [12 × 2]>
#> 4 1995 <tibble [12 × 2]>
#> 5 1996 <tibble [12 × 2]>
#> 6 1997 <tibble [12 × 2]>
#> 7 1998 <tibble [12 × 2]>
#> 8 1999 <tibble [12 × 2]>
#> 9 2000 <tibble [12 × 2]>
#> 10 2001 <tibble [12 × 2]>
#> 11 2002 <tibble [12 × 2]>
#> 12 2003 <tibble [12 × 2]>
#> 13 2004 <tibble [12 × 2]>
#> 14 2005 <tibble [12 × 2]>
#> 15 2006 <tibble [12 × 2]>
#> 16 2007 <tibble [12 × 2]>
#> 17 2008 <tibble [12 × 2]>
#> 18 2009 <tibble [12 × 2]>
#> 19 2010 <tibble [12 × 2]>
#> 20 2011 <tibble [12 × 2]>
The workflow to access these calendar slices is to use bind_rows()
to join each analysis set together.
mutate(
roll_rs_annual,extracted_slice = map(splits, ~ bind_rows(analysis(.x)$data))
)#> # Rolling origin forecast resampling
#> # A tibble: 6 × 3
#> splits id extracted_slice
#> <list> <chr> <list>
#> 1 <split [20/1]> Slice1 <tibble [240 × 2]>
#> 2 <split [20/1]> Slice2 <tibble [240 × 2]>
#> 3 <split [20/1]> Slice3 <tibble [240 × 2]>
#> 4 <split [20/1]> Slice4 <tibble [240 × 2]>
#> 5 <split [20/1]> Slice5 <tibble [240 × 2]>
#> 6 <split [20/1]> Slice6 <tibble [240 × 2]>
Session information
#> ─ Session info ─────────────────────────────────────────────────────
#> setting value
#> version R version 4.3.1 (2023-06-16)
#> os macOS Ventura 13.5.2
#> system aarch64, darwin20
#> ui X11
#> language (EN)
#> collate en_US.UTF-8
#> ctype en_US.UTF-8
#> tz America/Los_Angeles
#> date 2023-09-26
#> pandoc 3.1.1 @ /Applications/RStudio.app/Contents/Resources/app/quarto/bin/tools/ (via rmarkdown)
#>
#> ─ Packages ─────────────────────────────────────────────────────────
#> package * version date (UTC) lib source
#> broom * 1.0.5 2023-06-09 [1] CRAN (R 4.3.0)
#> dials * 1.2.0 2023-04-03 [1] CRAN (R 4.3.0)
#> dplyr * 1.1.3 2023-09-03 [1] CRAN (R 4.3.0)
#> forecast * 8.21.1 2023-08-31 [1] CRAN (R 4.3.0)
#> ggplot2 * 3.4.3 2023-08-14 [1] CRAN (R 4.3.0)
#> infer * 1.0.5 2023-09-06 [1] CRAN (R 4.3.0)
#> parsnip * 1.1.1 2023-08-17 [1] CRAN (R 4.3.0)
#> purrr * 1.0.2 2023-08-10 [1] CRAN (R 4.3.0)
#> recipes * 1.0.8 2023-08-25 [1] CRAN (R 4.3.0)
#> rlang 1.1.1 2023-04-28 [1] CRAN (R 4.3.0)
#> rsample * 1.2.0 2023-08-23 [1] CRAN (R 4.3.0)
#> sweep * 0.2.5 2023-09-25 [1] Github (business-science/sweep@8dcfa11)
#> tibble * 3.2.1 2023-03-20 [1] CRAN (R 4.3.0)
#> tidymodels * 1.1.1 2023-08-24 [1] CRAN (R 4.3.0)
#> timetk * 2.8.4 2023-09-22 [1] CRAN (R 4.3.1)
#> tune * 1.1.2 2023-08-23 [1] CRAN (R 4.3.0)
#> workflows * 1.1.3 2023-02-22 [1] CRAN (R 4.3.0)
#> yardstick * 1.2.0 2023-04-21 [1] CRAN (R 4.3.0)
#> zoo * 1.8-12 2023-04-13 [1] CRAN (R 4.3.0)
#>
#> [1] /Users/emilhvitfeldt/Library/R/arm64/4.3/library
#> [2] /Library/Frameworks/R.framework/Versions/4.3-arm64/Resources/library
#>
#> ────────────────────────────────────────────────────────────────────