Box-Cox transformation is applied to time series to stabilize variation and make the data stationary to be more appropriate for modelling. The formula is as follows:
\(\lambda\) is estimated by maximum likelihood. From the formulas, when \(\lambda\)=1, there is no transformation, and when \(\lambda\)=0, the data becomes the natural log transformation.
In the context of the Australian electricity production time series, there is an upward trend and multiplicative seasonality. The Box-Cox method compresses larger values more to make seasonal changes constant over time. We use this because models like ARIMA, ETS, and TLSM all require stationary data.
Basic Method using aus_production from fpp3
library(fpp3)
Warning: package 'fpp3' was built under R version 4.5.2
Registered S3 method overwritten by 'tsibble':
method from
as_tibble.grouped_df dplyr
The accuracy metrics for the seasonal naive were better across all metrics for box-cox. The only difference is that during modelling, GWh is no longer the correct label for output because the values have been transformed by lambda. This difference is then reversed during forecasting to put it back into the original scale, GWh.