Introduction
This assignment looks at the conceptual understanding of decomposing
time series and forecasting with decomposing. We want to enhance our
conceptual understanding of methods of decomposition and forecasting.
Also want to find the appropriate training size to produce the best
performance.
Data Description
- Month- Month of the year (in numbers. Ex: 1-Jan, 2-Feb…)
- Year- The year it was
- LowTemp- Lowest temperature
- HighTemp- Highest temperature
- WarmestMin- The lowest warm temperature
- ColdestHigh- The highest cold temperature
- AveMin-The average minimum temperature
- AveMax- The average maximum temperature
- meanTemp- The mean temperature
- TotPrecip- The total precipitation
- TotSnow- The total snow
- Max24hrPrecip- The maximum amount of precipitation in 24 hours
Define time series
object
Since this is monthly data, frequency =12 will be used the define the
time series object.
Forecasting with
Decomposing
The following visual representations show the different behaviors of
the two methods of decomposition.
The second model seems to be better at visualizing the trends of the
data. Looking at the decomposition visuals we can see that there is a
maximum around 2015 and a minimum around 2018. Looking at the graphs
both look easy to interpret but the second one has a more simple
approach to showing you the trend. The second decomposing model has a
more smooth way of showing us the data compared to the first.
We next perform error analysis.
Error comparison between forecast results with different sample
sizes
| n.144 |
0.1555844 |
0.0001788 |
| n.109 |
0.1603712 |
0.0001822 |
| n. 73 |
0.1732466 |
0.0001917 |
| n. 48 |
0.1981214 |
0.0002088 |

Now we can see the values for the MSE and MAPE. We used the same
algorithm with 4 different sample sizes and compared the resulting
accuracy measures. The sample size of 144 gives the lowest MSE and
lowest MAPE. This means that it is the best model due to its lower
error. It seems like the n=144 model outperforms the rest of the models.
We are confident that the larger sample size (n=144) will be a good
representative of the data. We do not have any concerns of things like
over fitting since we have a large sample size.
Conclusion
We just showed the initial time series, the decomposition of the time
series, and the error analysis. We concluded which graph we thought was
better and then talked about the best value for n that gave us the
lowest error which was n=144.
