Overview

Row

Project Information

Dataset: Iris Dataset
Author: FARANGIZ ABDUL
Student ID: 2307075125
Date: 2026-01-14

About the Dataset

The Iris dataset is a classic multivariate dataset commonly used in statistics and data visualization. It contains 150 observations of iris flowers from three different species: setosa, versicolor, and virginica.

For each flower, four numerical measurements are recorded: sepal length, sepal width, petal length, and petal width (all in centimeters). The dataset is well-suited for exploratory data analysis and visualization, as it includes both numerical and categorical variables.

Dataset Structure

Below is an overview of the variables and their data types in the dataset.
Rows: 150
Columns: 5
$ Sepal.Length <dbl> 5.1, 4.9, 4.7, 4.6, 5.0, 5.4, 4.6, 5.0, 4.4, 4.9, 5.4, 4.…
$ Sepal.Width  <dbl> 3.5, 3.0, 3.2, 3.1, 3.6, 3.9, 3.4, 3.4, 2.9, 3.1, 3.7, 3.…
$ Petal.Length <dbl> 1.4, 1.4, 1.3, 1.5, 1.4, 1.7, 1.4, 1.5, 1.4, 1.5, 1.5, 1.…
$ Petal.Width  <dbl> 0.2, 0.2, 0.2, 0.2, 0.2, 0.4, 0.3, 0.2, 0.2, 0.1, 0.2, 0.…
$ Species      <fct> setosa, setosa, setosa, setosa, setosa, setosa, setosa, s…

Interactive Visualization

This section presents an interactive version of a scatter plot to explore the relationship between Sepal Length and Petal Length across different Iris species.

Visualizations

Row

Histogram of Sepal Length

This histogram shows the distribution of sepal length values in the Iris dataset.

Boxplot of Petal Length by Species

This boxplot compares petal length across different Iris species.

Scatter Plot: Sepal Width vs Petal Width

This scatter plot visualizes the relationship between sepal width and petal width, with points colored by species.