Questions

Find the mtcars data in R. This is the dataset that you will use to create your graphics. Use the dataset to draw graphics.

data("mtcars")
summary(mtcars)
##       mpg             cyl             disp             hp       
##  Min.   :10.40   Min.   :4.000   Min.   : 71.1   Min.   : 52.0  
##  1st Qu.:15.43   1st Qu.:4.000   1st Qu.:120.8   1st Qu.: 96.5  
##  Median :19.20   Median :6.000   Median :196.3   Median :123.0  
##  Mean   :20.09   Mean   :6.188   Mean   :230.7   Mean   :146.7  
##  3rd Qu.:22.80   3rd Qu.:8.000   3rd Qu.:326.0   3rd Qu.:180.0  
##  Max.   :33.90   Max.   :8.000   Max.   :472.0   Max.   :335.0  
##       drat             wt             qsec             vs        
##  Min.   :2.760   Min.   :1.513   Min.   :14.50   Min.   :0.0000  
##  1st Qu.:3.080   1st Qu.:2.581   1st Qu.:16.89   1st Qu.:0.0000  
##  Median :3.695   Median :3.325   Median :17.71   Median :0.0000  
##  Mean   :3.597   Mean   :3.217   Mean   :17.85   Mean   :0.4375  
##  3rd Qu.:3.920   3rd Qu.:3.610   3rd Qu.:18.90   3rd Qu.:1.0000  
##  Max.   :4.930   Max.   :5.424   Max.   :22.90   Max.   :1.0000  
##        am              gear            carb      
##  Min.   :0.0000   Min.   :3.000   Min.   :1.000  
##  1st Qu.:0.0000   1st Qu.:3.000   1st Qu.:2.000  
##  Median :0.0000   Median :4.000   Median :2.000  
##  Mean   :0.4062   Mean   :3.688   Mean   :2.812  
##  3rd Qu.:1.0000   3rd Qu.:4.000   3rd Qu.:4.000  
##  Max.   :1.0000   Max.   :5.000   Max.   :8.000
# import ggplot2 library
library(ggplot2)
# proportions of cars by 'carb' value
ggplot(data = mtcars, aes(x = factor(1), fill = factor(carb))) + 
      ylab("Proportions of cars by 'carb' value") + 
      xlab("") + 
      geom_bar(width = 1) + 
      coord_polar(theta = "y")

Observed from pie chart, it shows that cars with two and four carburetors have the highest proportion of 31.25%. Cars with one carburetor has proportion of 21.88%, cars with three carburetors has proportion of 9.37% and cars with six and eight carburetors have the smallest proportion of 3.12%.

# show the gear type in mtcars
ggplot(data = mtcars, aes(x = gear)) + geom_bar(stat = "count")

Observed from bar chart by gear type, it shows that 15 cars have three gears, 12 cars have four gears and 5 cars have five gears.

# show a stacked bar graph of the number of each gear type and divided out by cyl.
ggplot(mtcars, aes(x = factor(cyl), fill = factor(gear))) + 
    xlab("Values of 'cyl'") + 
    ylab("Values of 'count of gear'") + 
    geom_bar(color = "black")

Observed from the stached bar graph, it shows that for four cylinder cars, only 1 car has three gears, 8 cars have four gears and 2 cars have five gears; For six cyliner cars, 2 cars have three gears, 4 cars have four gears and 1 car has five years; For 8 cylinder cars, 12 cars have three gears, 2 cars have five gears and no car has four gears.

ggplot(mtcars, aes(x = wt, y = mpg)) +
    xlab("wt") + 
    ylab("mpg") +
    geom_point() +
    geom_line() +
    ggtitle("Relationship between 'wt' and 'mpg'") +
    stat_smooth(method = "loess", formula = y ~ x, size = 1, col = "red")

Observed from scatter plot, it shows that it is a negative relationship between weight and mpg. As weight increase, miles per gallon decreases.

I chose ggplot for the data visualization for this problem set, because the graphics are widely used to draw graphs and it is easy to use to plot a different graph, it has higher aesthetic values.