1 Group Homework

  • You will work with your group to complete this assignment. .

  • Submit your group’s shared .Rmd AND your kintted html on Collab.

  • You don’t need to upload the html file on RPubs.

  • Confirm this with the following comment included in your submission text box: “Honor Pledge: I have recreated my group submission using using the tools I have installed on my own computer”

  • Name the files with a group name and YOUR name for your submission.

2 Part 1: gganimate

  • First, get the latest data from OWID.

    • The file should likely be named “owid-covid-data.xlsx” or “owid-covid-data.csv.” (or jason file)
  • Second, choose six countries that are of interest to you, and create a subset with the six countries.

  • Third, create an animated line plot of “Covid Deaths Each Day Per 100,000 People in Countries” as in our lecture note on Animations.

  • Fourth, describe your findings from the plot.

This animated line plot shows that among the six countries we picked to show in the graph, Switzerland and the United States were the two leading countries for having the highest COVID-19 deaths each day per 100,000 people for the vast majority of the time period shown. Near July of 2021, Afghanistan exceeds both countries in this rate for a short period of time, before the United States quickly surpasses it again. Yemen, Turkmenistan, and Taiwan were the three countries with the lowest rates among the six countries shown on this graph. The highest rate of COVID-19 deaths each day per 100,000 people occured at approximately January of 2021 in Switzerland.

3 Part 2: plotly

  • Create an animated plot with the Play button from package plotly. Please refer to our lecture note on Animations, feature of plotly, and cumulative anmiations.

  • First, use an appropriate dataset that is available in data repositories (e.g., kaggle). But don’t use the built-in datasets or datasets from packages. Don’t use any datasets you used for the previous assignments.

  • Second, briefly describe the dataset you’re using for this assignment (e.g., means to access data, context, sample, variables, etc…)

    • describe your data.
  • Third, create any animated plot with the Play button.

  • Fourth, describe your findings from the animated plot. This animated plot shows the number of Python related questions that were asked on Stack Overflow from the years 2008 to 2020. The graph shows that at the beginning of 2008, the number of questions asked was significantly under 5,000, but by the end of 2020 the number exceeded 20,000 questions asked in Python. This gives insight to how the usage of this programming language in the computer science field has grown at a significant rate, and shows parallels with how Python has emerged as one of the most common programming languages in the field in current day.