This presentation is included in the last assignment from the online course Developing Data Products (https://www.coursera.org/learn/data-products)
The main challenge of these slides is to provide some key information regarding the course project
Due to te requirements, it has been generated using RStudio (https://www.rstudio.com) and Slidify (http://slidify.org)
The main challenge is to create an application. I decided to create an application using the dataset ‘mtcars’ to predict the consumption regarding the other factors.Hence I deciced to name my application MPG Prediction.
A Random Forest prediction model is generated, based on parameters such as the type of transmission. This application is dedicated to the users as they can play with all the parameters to predict the consumption of their car.
The application and the current presentation source codes can be found at https://github.com/maximeverges/Developing-Data-Products. 3 files are included: ui.R, server.R and RD_model.R.
The dataset which has been used by the application is the Motor Trend Car Road Tests which is called ‘mtcars’ in R. The dataset highlights the fuel consumption regarding 10 factors concerning 32 automobiles. The structure is detailed:
## 'data.frame': 32 obs. of 11 variables:
## $ mpg : num 21 21 22.8 21.4 18.7 18.1 14.3 24.4 22.8 19.2 ...
## $ cyl : num 6 6 4 6 8 6 8 4 4 6 ...
## $ disp: num 160 160 108 258 360 ...
## $ hp : num 110 110 93 110 175 105 245 62 95 123 ...
## $ drat: num 3.9 3.9 3.85 3.08 3.15 2.76 3.21 3.69 3.92 3.92 ...
## $ wt : num 2.62 2.88 2.32 3.21 3.44 ...
## $ qsec: num 16.5 17 18.6 19.4 17 ...
## $ vs : num 0 0 1 1 0 1 0 1 1 1 ...
## $ am : num 1 1 1 0 0 0 0 0 0 0 ...
## $ gear: num 4 4 4 3 3 3 3 4 4 4 ...
## $ carb: num 4 4 1 1 2 1 4 2 2 4 ...
A Random Forest prediction model is prefered. The aim is to predict the fuel consumption (mpg variable) regarding the other variables:
data("mtcars")
dataStructure <- capture.output(str(mtcars))
set.seed(12345)
training_control <- trainControl(method = "cv", number = 10)
random_forest_model <- function() {
return(
train(
mpg ~ .,
data = mtcars,
method = "rf",
trControl = training_control
)
)
}
random_forest_predicition <- function(model, parameters) {
prediction <- predict(
model,
newdata = parameters
)
return(prediction)
}