PROGRAM-2

Author

1NT23IS027-A SEC-ANKITHA

2.write a r script to create a scatter plot, incoporating categorical analysis through color coded data points representing different groups,using ggplot2

Step1:Load the libraries

library(ggplot2)
library(tidyverse)
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.5
✔ forcats   1.0.0     ✔ stringr   1.5.1
✔ lubridate 1.9.4     ✔ tibble    3.2.1
✔ purrr     1.0.4     ✔ tidyr     1.3.1
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors

Step2:

# Load the iris dataset
data<- iris
# display first few rows

head(data, n=10)
   Sepal.Length Sepal.Width Petal.Length Petal.Width Species
1           5.1         3.5          1.4         0.2  setosa
2           4.9         3.0          1.4         0.2  setosa
3           4.7         3.2          1.3         0.2  setosa
4           4.6         3.1          1.5         0.2  setosa
5           5.0         3.6          1.4         0.2  setosa
6           5.4         3.9          1.7         0.4  setosa
7           4.6         3.4          1.4         0.3  setosa
8           5.0         3.4          1.5         0.2  setosa
9           4.4         2.9          1.4         0.2  setosa
10          4.9         3.1          1.5         0.1  setosa
table(data$Species)

    setosa versicolor  virginica 
        50         50         50 

Step 3: create a scatter plot

ggplot(data, aes(x = Sepal.Length, y = Sepal.Width, color = Species))+
  geom_point(size = 3, alpha = 0.7) + 
  labs(title = "Scatter plot ofsepal Dimensions", 
       x = "Sepal Length",
       y = "Sepal Width",
       color = "Species") + 
  theme_minimal() +
  theme(legend.position = "top")