8/10/2020

R Markdown

The Iris flower data set or Fisher iris data set is a multivariate data set introduced by Ronald Fisher in his 1936 article, The use of multiple measurements in taxonomic problems as an example of linear discriminant analysis.1 Sometimes called Anderson’s Iris data set because Edgar Anderson collected the data to quantify the morphological variation of the Iris flower from three related species. Two of the three species were collected in the Gaspesian Peninsula "all from the same pasture, and all collected on the same day and measured at the same time by the same person with the same device.

For more details on the iris database see https://es.wikipedia.org/wiki/Conjunto_de_datos_flor_iris.

Content

  • Summary
  • Plot iris

Summary

summary(iris)
##   Sepal.Length    Sepal.Width     Petal.Length    Petal.Width   
##  Min.   :4.300   Min.   :2.000   Min.   :1.000   Min.   :0.100  
##  1st Qu.:5.100   1st Qu.:2.800   1st Qu.:1.600   1st Qu.:0.300  
##  Median :5.800   Median :3.000   Median :4.350   Median :1.300  
##  Mean   :5.843   Mean   :3.057   Mean   :3.758   Mean   :1.199  
##  3rd Qu.:6.400   3rd Qu.:3.300   3rd Qu.:5.100   3rd Qu.:1.800  
##  Max.   :7.900   Max.   :4.400   Max.   :6.900   Max.   :2.500  
##        Species  
##  setosa    :50  
##  versicolor:50  
##  virginica :50  
##                 
##                 
## 

Plot iris

#library(plotly)
plot_ly(iris, x = iris$Sepal.Length, y = iris$Sepal.Width)