R BOX PLOT

data(airquality)
str(airquality)
## 'data.frame':    153 obs. of  6 variables:
##  $ Ozone  : int  41 36 12 18 NA 28 23 19 8 NA ...
##  $ Solar.R: int  190 118 149 313 NA NA 299 99 19 194 ...
##  $ Wind   : num  7.4 8 12.6 11.5 14.3 14.9 8.6 13.8 20.1 8.6 ...
##  $ Temp   : int  67 72 74 62 56 66 65 59 61 69 ...
##  $ Month  : int  5 5 5 5 5 5 5 5 5 5 ...
##  $ Day    : int  1 2 3 4 5 6 7 8 9 10 ...
boxplot(airquality$Ozone)

INTERPRETATIONS FROM THE DATA PLOT

  • We can see that data above the median is more dispersed.
  • We can also notice two outliers at the higher extreme.

MAKING THE PLOT MORE MEANINGFUL

boxplot(airquality$Ozone,main="Mean ozone in parts per billion",xlab="Parts per Billion (ppb)",ylab="Ozone",col="RED",border="BLACK",horizontal=T,notch=T)