Program - 2

Author

Atharsh

Write an R script to create a scatter plot incorperating categorical analysis through color coded data points represent different groups using ggplot2

Step 1:Load the necessary libraries

library(ggplot2)
library(dplyr)

Attaching package: 'dplyr'
The following objects are masked from 'package:stats':

    filter, lag
The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

Step 2:Load the dataset

data<-iris
head(data)
  Sepal.Length Sepal.Width Petal.Length Petal.Width Species
1          5.1         3.5          1.4         0.2  setosa
2          4.9         3.0          1.4         0.2  setosa
3          4.7         3.2          1.3         0.2  setosa
4          4.6         3.1          1.5         0.2  setosa
5          5.0         3.6          1.4         0.2  setosa
6          5.4         3.9          1.7         0.4  setosa
table(data$Species)

    setosa versicolor  virginica 
        50         50         50 

Step 3:Create scatter plot

ggplot(data,aes(x=Sepal.Length,y=Sepal.Width,color=Species))+
geom_point(size=3,alpha=0.7)+
  labs(title = "Scatter Plot of Sepal Dimensions",
       x="Sepal Length",
       y="Sepal width",
       color="Species")+
  theme_minimal()+
  theme(legend.position = "top")