This final project for JayTerm requires you to choose one of the following papers below. From your choice, you should complete the following steps.

  1. Choose your paper and download everything in the folder. Each data folder has in it 4 things - a word document to explain the paper, the paper itself, the figure picture, and a dataset. The dataset reflects the figure named in the title of the folder.

  2. Your first task is to provide the reader (me) some summary information about the data. Use whatever information and ways you would like to visualize this summary data as you would like, drawing on various strategies gained in the swirl() courses.

  3. Your second task is to replicate as perfectly as possible the Figure. You should consider each part of the Figure as something to change, regardless of whether or not we have learned it directly in class. This includes things like:

    • Backgrounds of plots
    • Formatting of any plot placement
    • Shape of plots
    • Colors of plots
    • Lines on plots
    • Type of lines
    • Capitalization of words.
  4. Produce a different visualization of the data. This visualization should:

    • Be of a different geom_() then the current data.
    • Utilize at least four different options.
    • Utilize an additionaal two options in theme().
    • Utilize a color palette from either the wesanderson, harrypotter, or RColorBrewer packages.
    • Switch the order or placement of one of the grouping variables.
      • Note: If there is not a second grouping variable in the plot, you should create a new variable in the data and vary it in some way.
  5. All code should be presented and submitted via an .Rmd (RMarkdown file) so that I can read your code and see the output right below it.

  6. Below this code, you should also explain in written form what your code is doing at each step (“I used geom_box to create barplots, and I did this option to do this thing…”).

  7. You should include a brief write up (one paragraph) on why your visualization is better than the paper’s based on data visualization suggestions learned in class or through the Swirl() modules (Principles in Data Visualization may be helpful to review).

  8. This should be turned in both as a a.Rmd and a .pdf. To turn the .Rmd to the .pdf, remember to click the dropdown under Knit to turn into a .pdf.

Papers To Choose From

Suggested Other Things You Might Need

You might need the following packages or commands, depending on your choice, and depending how you wish to solve this problem.

library(cowplot) provides options for moving and placing plots. As in, you can make one plot, save it, and then make another, save the other, and then place them in specific ways.

geom_segment() is a ggplot2 function to draw tiny lines given an x and xend, and a y and yend. You can create many of these to make various shapes on graphs, including where one might draw significance lines.

Standard error is equal to the standard deviation divided by the square root of N. This comes in handy to create variables.

Packages and tools used to summarize data include, but are not limited to: