Introduction

The famous (Fisher’s or Anderson’s) iris data set contains 150 observations of iris flowers and includes the following variables:

Descriptive Statistics

The average sepal length, sepal width, petal length, and petal width across all observations are 5.8, 3.1, 3.8, 1.2 respectively. Additional descriptive statistics are detailed below:

Sepal.Length Sepal.Width Petal.Length Petal.Width Species
Min. :4.300 Min. :2.000 Min. :1.000 Min. :0.100 setosa :50
1st Qu.:5.100 1st Qu.:2.800 1st Qu.:1.600 1st Qu.:0.300 versicolor:50
Median :5.800 Median :3.000 Median :4.350 Median :1.300 virginica :50
Mean :5.843 Mean :3.057 Mean :3.758 Mean :1.199 NA
3rd Qu.:6.400 3rd Qu.:3.300 3rd Qu.:5.100 3rd Qu.:1.800 NA
Max. :7.900 Max. :4.400 Max. :6.900 Max. :2.500 NA

Illustration

The following graphic illustrates the relationship between sepal length and petal length categorized by Iris species.

library(ggplot2)

ggplot(iris, aes(Sepal.Length, Petal.Length, color = Species)) +
        geom_point()