The coasters data set has the information on 766 roller coasters. There are twelve variables (columns), and the relevant ones are:
name
: Name of the roller coastertype
: The type of roller coaster
(wood/steel/other)lift
: The way the coaster is lifted up the first hill
(chain/launch/other)length_ft
& height_ft
: The length and
height of the roller coaster, in feetspeed_mph
: The max speed of the coaster, in miles per
hourCreate a scatterplot of the speed_mph
by
height_ft
with the points colored by
length_ft
, shaped by lift
, and partially
transparent. See what the graph should look like in Brightspace.
Save the graph as gg_q1
and display the graph in the
knitted document.
Does something seem wrong with the data? Describe what stands out as likely incorrect
Create the graph seen in question 2 in Brightspace. Save the graph as
gg_q2
and display it in the knitted document.
Using the graph created in question 2, create the graph seen in Brightspace for question three. Make sure to pay close attention to the labels used for all 4 aesthetics!
Hint: You can use the help menu for the approbriate function to find how to get the colors to match!
Save the graph as gg_q3
and make sure to display it in
the knitted document!
Using gg_q3
, create three small multiples, one for Wood,
one for Steel, and one for Other.
Since this is the final question, you don’t need to save the graph
Which type of coaster is the most common?
Which type of coaster is the rarest?