The famous (Fisher’s or Anderson’s) iris data set contains 150 observations of iris flowers and includes the following variables:
The average sepal length, sepal width, petal length, and petal width across all observations are 5.8, 3.1, 3.8, 1.2 respectively. Additional descriptive statistics are detailed below:
| Sepal.Length | Sepal.Width | Petal.Length | Petal.Width | Species | |
|---|---|---|---|---|---|
| Min. :4.300 | Min. :2.000 | Min. :1.000 | Min. :0.100 | setosa :50 | |
| 1st Qu.:5.100 | 1st Qu.:2.800 | 1st Qu.:1.600 | 1st Qu.:0.300 | versicolor:50 | |
| Median :5.800 | Median :3.000 | Median :4.350 | Median :1.300 | virginica :50 | |
| Mean :5.843 | Mean :3.057 | Mean :3.758 | Mean :1.199 | NA | |
| 3rd Qu.:6.400 | 3rd Qu.:3.300 | 3rd Qu.:5.100 | 3rd Qu.:1.800 | NA | |
| Max. :7.900 | Max. :4.400 | Max. :6.900 | Max. :2.500 | NA |
The following graphic illustrates the relationship between sepal length and petal length categorized by Iris species.
library(ggplot2)
ggplot(iris, aes(Sepal.Length, Petal.Length, color = Species)) +
geom_point()