Scenario

We are interested in the Gapminder data set, which records measurements (such as life expectancy, GDP per capita, and population) for different countries over different years. Specifically, we will focus on the values from the year 2007. This will require us to create a new data set, gap_2007, which we will do here:

gap_2007 <- gap %>% filter(year == 2007)

Exploring the Data

Here, we calculate the dimensions of the data set and identify the names of the different variables in our gap_2007 data set. The results are recorded below:

We are interested in looking at histograms of each of our quantitative variables. The results are below:

Calculating Statistics for one Variable

We decide to hone in on one of our variables, namely gdpPercap/pop/lifeExp (choose one and erase the others). For this variable, we calculate the mean, median, IQR, and standard deviation in the space below:

Summary

(Here, write a bit about the shape of the data – skewed right or left – and discuss the relationship between the mean and the median)