Kathy, VP, IOT Analytics has asked to perform an in-depth analysis of the power consumption dataset. As a part of the analysis, we explored the data via visualization and time series regression modeling. In this process.
Step: 1 :: Understand the importance of granularity and subsetting data into meaninful time periods.
Step: 2 :: Explore data using visualization techniques ans identify visualization that comtains most useful information and present toclient.
Step: 3 :: Develop atleast three time series regression models and eork with seasonal and non-seasonal forecasting.
Step: 4 :: Summarizing the analysis and make recommendations to client.
## Loading required package: lubridate
##
## Attaching package: 'lubridate'
## The following object is masked from 'package:base':
##
## date
## Loading required package: dplyr
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:lubridate':
##
## intersect, setdiff, union
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
## Loading required package: tidyr
## Warning in strptime(xx, f <- "%Y-%m-%d %H:%M:%OS", tz = tz): unknown
## timezone '%Y/%m/%d %H:%M:%S'
## Warning in as.POSIXct.POSIXlt(x): unknown timezone '%Y/%m/%d %H:%M:%S'
## Warning in strptime(x, f, tz = tz): unknown timezone '%Y/%m/%d %H:%M:%S'
## Warning in as.POSIXct.POSIXlt(as.POSIXlt(x, tz, ...), tz, ...): unknown
## timezone '%Y/%m/%d %H:%M:%S'
In this step,
I try to understnad how subsetting of data works by adjusting granularity and create visualizationsof data subsets. Visualization techniques are used to understand the data to find patterns and then perfrom regression analysis.
Data subsetting helps to adjust granularity and maximize the infromation gain.Granularity describes the frequency of observations within a time series data set.From the data description we know that the observations were taken once per minute over the period of almost 4 years. That’s over 2 million observations.
## Loading required package: plotly
## Loading required package: ggplot2
##
## Attaching package: 'plotly'
## The following object is masked from 'package:ggplot2':
##
## last_plot
## The following object is masked from 'package:stats':
##
## filter
## The following object is masked from 'package:graphics':
##
## layout