10/16/2020

Coursera Reproducible Pitch

See the Regression Models Course Project

  • URL: https://github.com/LeoEras/Developing-Data-Products
  • Find here all the data that have been use for this presentation and also for the first part of the data Science Project: “First, you will create a Shiny application and deploy it on Rstudio’s servers.Second, you will use Slidify or Rstudio Presenter to prepare a reproducible pitch presentation about your application.”

Find all details here

URL: https://github.com/LeoEras/Developing-Data-Products

Iris Dataset

This famous (Fisher’s or Anderson’s) iris data set gives the measurements in centimeters of the variables sepal length and width and petal length and width, respectively, for 50 flowers from each of 3 species of iris. The species are Iris setosa, versicolor, and virginica.

Source

Fisher, R. A. (1936) The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7, Part II, 179–188. The data were collected by Anderson, Edgar (1935). The irises of the Gaspe Peninsula, Bulletin of the American Iris Society, 59, 2–5.

Iris Dataset - Format

A data frame with 150 observations on 5 variables.

Index Field
[, 1] Sepal length
[, 2] Sepal width
[, 3] Petal length
[, 4] Petal width
[, 5] Species

Analysis - Main Code

  formulaText <- reactive({
        paste(input$variable, " ~ Species", sep="") })
        
  modFit <- train(Species ~., method = "rpart", data=irisData) #Fit model
  
  output$fit <- renderPrint({
        print(modFit$finalModel)})
  
  output$irisCT <- renderPlot({
        fancyRpartPlot(modFit$finalModel)})