program9

Author

Sirisha BA

Create multiple histograms using ggplot2 :: factor_wrap() to visuaalize how a variable (e.g Sepal.Length) is distributed across different groups ( e.g. Species in a built-in R data set)

Step 1: Load necessary libraries.

library(ggplot2)
library(dplyr)

Attaching package: 'dplyr'
The following objects are masked from 'package:stats':

    filter, lag
The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

Step2 : Load the built-in dataset.

data(iris) #Load the iris dataset
head(iris) #view the first few rows of dataset.
  Sepal.Length Sepal.Width Petal.Length Petal.Width Species
1          5.1         3.5          1.4         0.2  setosa
2          4.9         3.0          1.4         0.2  setosa
3          4.7         3.2          1.3         0.2  setosa
4          4.6         3.1          1.5         0.2  setosa
5          5.0         3.6          1.4         0.2  setosa
6          5.4         3.9          1.7         0.4  setosa

Step3: create histograms using factor_wrap

ggplot(iris, aes(x = Sepal.Length)) +
  geom_histogram(binwidth = 0.3, fill = "steelblue", color = "pink") +
  facet_wrap(~ Species) + # Wrap by Species, arrange in 2 columns
  labs(
    title = "Distribution of Sepal Length across Iris Species",
    x = "Sepal Length (cm)",
    y = "Frequency"
  ) +
  theme_minimal()