Data Description

The coasters data set has the information on 766 roller coasters. There are twelve variables (columns), and the relevant ones are:

  1. name: Name of the roller coaster
  2. type: The type of roller coaster (wood/steel/other)
  3. lift: The way the coaster is lifted up the first hill (chain/launch/other)
  4. length_ft & height_ft: The length and height of the roller coaster, in feet
  5. speed_mph: The max speed of the coaster, in miles per hour

Question 1: Basic GGPlot Scatterplot

Create a scatterplot of the speed_mph by height_ft with the points colored by length_ft, shaped by lift, and partially transparent. See what the graph should look like in Brightspace.

Save the graph as gg_q1 and display the graph in the knitted document.

Does something seem wrong with the data? Describe what stands out as likely incorrect

Question 2: Theme and Text

Create the graph seen in question 2 in Brightspace. Save the graph as gg_q2 and display it in the knitted document.

Question 3: Changing axis and colors

Using the graph created in question 2, create the graph seen in Brightspace for question three. Make sure to pay close attention to the labels used for all 4 aesthetics!

Hint: You can use the help menu for the approbriate function to find how to get the colors to match!

Save the graph as gg_q3 and make sure to display it in the knitted document!

Question 4: Small Multiples

Using gg_q3, create three small multiples, one for Wood, one for Steel, and one for Other.

Since this is the final question, you don’t need to save the graph

Which type of coaster is the most common?

Which type of coaster is the rarest?