The Iris dataset consists of measurements of sepal and petal lengths and widths for three iris flower species: Setosa, Versicolor, and Virginica. With 150 samples, each species has 50 observations. This widely-used dataset is a go-to example for illustrating classification and clustering techniques in machine learning due to its distinct species characteristics.
The dataset consists of the following four numerical attributes (features):
Sepal Length: The length of the sepal (the leaf-like structure at the base of the flower).
Sepal Width: The width of the sepal.
Petal Length: The length of the petal (the colorful part of the flower).
Petal Width: The width of the petal.
For this prediction we only use the Petal Length and Petal Width to predict the species of iris.
Sepal.Length Sepal.Width Petal.Length Petal.Width
Min. :4.300 Min. :2.000 Min. :1.000 Min. :0.100
1st Qu.:5.100 1st Qu.:2.800 1st Qu.:1.600 1st Qu.:0.300
Median :5.800 Median :3.000 Median :4.350 Median :1.300
Mean :5.843 Mean :3.057 Mean :3.758 Mean :1.199
3rd Qu.:6.400 3rd Qu.:3.300 3rd Qu.:5.100 3rd Qu.:1.800
Max. :7.900 Max. :4.400 Max. :6.900 Max. :2.500
Species
setosa :50
versicolor:50
virginica :50