Communicating Experimental Results with R

This is Bob:

Bob is a scientist.

Bob likes science, and Bob likes doing science. Bob likes to run lots of experiments, and Bob has lots of results from these experiments. Not only that, Bob wants to share his results with the world. To do that, he has to publish papers. There is just one thing in his way…

This is Bill:

Bill is a reviewer.

Bill also likes science, but Bill hates bad science. Bob has to prove to Bill that his science is good science, and that people in the world deserve to read his science.

Bob's Science

One day, Bob did some science. Bob is a Taco Scientist. He came up with some great new taco fillings and wanted to compare them to the traditional Beef, Chicken, Steak, Pork, Fish, and Shrimp tacos. He also compared them using hard and soft taco shells, and asked different age groups what they thought of each taco. He sent his paper to Bill for review, along with this table summarizing his results:

Bob's Table

Bill's Review

Bill did not like this at all. While he is an avid taco fan, Bill is a very busy man, and he does not have time to stare at a large table full of numbers to try to understand Bob's results. Bill sent the paper back, and Bob was sad. Bob was sad because he knew his experiments were good, but he did not communicate the results effectively.

To convince Bill that he has done good science, Bob must communicate his ideas and experiments effectively. The first step in this process is actually doing good science. The experiments must be well designed, the impact clearly articulated, and the results well documented. Today, we focus on the latter, and more specifically, how to document and communicate large amounts of results in bite-sized pieces called figures.

Bob's Ultimatum

So, after doubting his scientific abilities and questioning his purpose in life, Bob set out to change his destiny. Bob was determined to get his paper accepted. To do this, he knew he had to summarize his results better. After much soul and Internet searching, Bob discovered R.

This is R:

R is a letter.

R is also a free software environment for statistical computing and graphics. Bob fell in love. He found that R was able to take his horrendous tables and turn them into wonderful representations of his beautiful experimental results.

Let's Get Started!

First, we will load Bob's taco results into a Dataframe. Dataframe's are a lot like tables, except you can perform operations on them and use them as input for figures. The kable() function is just a way of printing your Dataframes out in an easy-to-read manner, and it is part of the knitr package, so we use the library() function to load it. The head() function prints just the top few rows of the dataframe. To see how any function or library works, just do ??my_func to bring up the help content for it.

library(knitr)
taco.results <- read.csv('taco_results.csv')
kable(head(taco.results))

Let's Get Started!

library(knitr)
taco.results <- read.csv('taco_results.csv')
kable(head(taco.results, 4))

ShellType	Filling	AgeGroup	Rating
Hard	Beef	<13	0.983
Hard	Beef	13-20	0.918
Hard	Beef	21-39	0.866
Hard	Beef	40+	0.629

Following Along

If you would like to follow along, you can download the taco_results.csv file here

If you are following along and don't have any packages that Bob uses, just call install.packages("my_package") and R will install it for you.

Beef Tacos

First, Bob just wanted to know how people like his beef tacos. To do that, he has to subset his taco.results dataframe to only get those talking about the Beef filling. He uses the dplyr package. He can do this with the functions in R, but dplyr makes things nice and simple. Check out this quick cheat sheet on how to manipulate dataframes.

library(dplyr)
beef.ratings <- filter(taco.results, Filling == "Beef")
kable(head(beef.ratings))
mean(beef.ratings$Rating)

Beef Tacos

library(dplyr)
beef.ratings <- filter(taco.results, Filling == "Beef")
kable(head(beef.ratings, 3))

ShellType	Filling	AgeGroup	Rating
Hard	Beef	<13	0.983
Hard	Beef	13-20	0.918
Hard	Beef	21-39	0.866

mean(beef.ratings$Rating)

## [1] 0.856

Bob' First Figure

Time for some graphics! Let's see how the different age groups like both the hard and soft beef tacos. The ggplot2 package is our best friend here. You always start by calling the ggplot() function, passing your dataframe as the first argument, then using the aes() function to specify the various aestheic aspects of the graph. You then add other functions to the ggplot() call depending on what you want to do. In this case, we want a bar chart, so we use geom_bar(). Looks like older people dont seem to like beef tacos, and the ratings don't change much between the hard or soft taco shells.

library(ggplot2)
ggplot(beef.ratings, aes(x = AgeGroup, y = Rating, fill = ShellType)) + 
  geom_bar(stat = "identity", position = "dodge")

Bob's First Figure

library(ggplot2)
ggplot(beef.ratings, aes(x = AgeGroup, y = Rating, fill = ShellType)) + 
  geom_bar(stat = "identity", position = "dodge")

ggplot2

For a full overview of the ggplot2 package, check out the R Graphics Cookbook, written by the author of the package. There is a lot more that ggplot can do that we do not discuss here.

Facets

Bob is feeling good about his ggplot skills, let's try comparing all the different taco fillings. To make multiple plots, we can use the facet_grid() or facet_wrap() functions. We do the same thing as before, except we are using the original taco.results dataframe to compare every filling.

ggplot(taco.results, aes(x = AgeGroup, y = Rating, fill = ShellType)) + 
  geom_bar(stat = "identity", position = "dodge") +
  facet_wrap( ~ Filling, ncol = 3, scales = "free_x")

Facets

Histograms

Bob wondered what the distribution of his taco ratings were. A simple histogram will suffice.

ggplot(taco.results, aes(x = Rating)) + geom_histogram(binwidth=0.01)

Horizontal Bar Chart

Bob wants to see how each filling is rated overall. To do this, we take the average rating for each filling across all age groups and shell types. Notice that we flip the axis and do some theme-related things to make the plot look nicer.

filling.results <- taco.results %>%
  group_by(Filling) %>%
  summarise(Rating = mean(Rating))

ggplot(filling.results, aes(x = Filling, y = Rating, fill = Filling)) + 
  geom_bar(stat = "identity", position = "dodge", alpha = 0.7) + 
  coord_flip(ylim=c(0.8,0.875)) +
  theme_bw(base_size = 18)  + guides(fill=FALSE)

Horizontal Bar Chart

ggplot(filling.results, aes(x = Filling, y = Rating, fill = Filling)) + 
  geom_bar(stat = "identity", position = "dodge", alpha = 0.7) + 
  coord_flip(ylim=c(0.8,0.875)) +
  theme_bw(base_size = 18)  + guides(fill=FALSE)

Theme

Let's keep this simple theme for all the following graphs using theme_set():

theme_set(theme_bw(base_size = 18))

Boxplots

Remember Bob's big table? The second half of it is just descriptive stats about each row (Shell Type / Filling). A boxplot can summarize these numbers with ease (quick refresher on boxplots). Let's replicate that part of the table with a facet.

ggplot(taco.results, aes(x = Filling, y = Rating, fill = Filling)) + 
  geom_boxplot(alpha = 0.5) + guides(fill=FALSE) +
  facet_grid(. ~ ShellType) + coord_flip()

Boxplots

ggplot(taco.results, aes(x = Filling, y = Rating, fill = Filling)) + 
  geom_boxplot(alpha = 0.5) + guides(fill=FALSE) +
  facet_grid(. ~ ShellType) + coord_flip()

Something is still missing.

Bob can see that there is deviation between ratings, but he doesn't know why. This is when things tend to get complicated. Bob's experiment has a lot of factors, so he can't just do a simple xy plot to see all the trends in his data. Let's try different ways of grouping the data with different plots to find the best one. We've already seen that bar charts faceted by Filling do not work.

Jitter

Let's say Bob wants a quick overview of his results in just one plot. He uses geom_jitter() to make a scatter plot that jitters the x values to better visualize the different values in the data.

ggplot(taco.results, aes(x = AgeGroup, y = Rating,  color = ShellType)) + 
  geom_jitter(size=2)

Density Curves

Bob starts thinking about his histogram again, what are the distributions for each age group?

ggplot(taco.results, aes(x = Rating, fill=AgeGroup)) + 
  geom_density(alpha=0.3)

Violin

Bob's getting fancy now. Violin plots allow us to visualize the multiple distributions on one plot. Each violin is a density curve flipped and mirrored for each of the groups.

ggplot(taco.results, aes(x = AgeGroup, y = Rating, fill = AgeGroup)) + 
  geom_violin(color = "black", alpha = 0.3)

These two plots show some trends between the age groups and shell types, but what about the fillings?

Line Graphs

Bob had an idea. He wants to see how ratings for each filling change as the age group increases. Perfect time for a line graph, wouldn't you say, Bob?

ggplot(taco.results, aes(x = AgeGroup, 
                         y = Rating, group=ShellType, color=ShellType)) + 
  geom_line(size=1) + geom_point(size=3) + 
  theme(axis.text.x  = element_text(size=12)) + 
  facet_wrap( ~ Filling, ncol = 3,scales = "free_x")

Line Graphs

Heatmaps

The more mathemtical folks in the audience might have trouble with Bob's last plot. Bob did a line plot without having a continous x variable. Let's see if a heatmap will show the same trend while keeping the math people happy.

ggplot(taco.results, aes(AgeGroup, Filling)) + 
  geom_tile(aes(fill = Rating), color = "white") + 
  scale_fill_gradient(low = "white", high = "steelblue") +
  facet_grid(. ~ ShellType)

Heatmaps

ggplot(taco.results, aes(AgeGroup, Filling)) + 
  geom_tile(aes(fill = Rating), color = "white") + 
  scale_fill_gradient(low = "white", high = "steelblue") +
  facet_grid(. ~ ShellType)

It seems that as people get older, they tend to rate tacos lower, regardless of the filling or shell type. Bob smells a bias.

ANOVA & HSD

This isn't a tutorial on statistical analysis with R (for that, check this out), but Bob would be remiss without doing some HSD tests and plotting them with ggplot. Bob does a quick ANOVA test (please note the horrible statistics of this, there is only one sample for each observation):

library(agricolae)
taco.anova <- aov(Rating~ShellType*AgeGroup,data = taco.results)
summary(taco.anova)
HSD.test(taco.anova, "AgeGroup", alpha = 0.5, console = TRUE)

taco.hsd <- data.frame(
  TukeyHSD(taco.anova,"AgeGroup", conf.level=.95)$AgeGroup)
taco.hsd$Comparison <- row.names(taco.hsd)

ggplot(taco.hsd, aes(x = Comparison, y = diff, 
                     ymin = lwr, ymax = upr, color=Comparison)) +
  geom_pointrange(size=1.2) + coord_flip() + guides(color=FALSE) +
  ylab("Difference in Mean Rating by Age Groups")

ANOVA & HSD

library(agricolae)
taco.anova <- aov(Rating~ShellType*AgeGroup,data = taco.results)
summary(taco.anova)

##                     Df Sum Sq Mean Sq  F value  Pr(>F)    
## ShellType            1 0.0077  0.0077   12.042 0.00071 ***
## AgeGroup             3 2.5275  0.8425 1318.430 < 2e-16 ***
## ShellType:AgeGroup   3 0.0026  0.0009    1.348 0.26183    
## Residuals          128 0.0818  0.0006                     
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

ANOVA & HSD

HSD.test(taco.anova, "AgeGroup", alpha = 0.5, console = TRUE)

Groups, Treatments and means:
a    <13     0.9832 
b    13-20   0.9267 
c    21-39   0.8626 
d    40+     0.6252

taco.hsd <- data.frame(
  TukeyHSD(taco.anova,"AgeGroup", conf.level=.95)$AgeGroup)
taco.hsd$Comparison <- row.names(taco.hsd)

ANOVA & HSD

ggplot(taco.hsd, aes(x = Comparison, y = diff, 
                     ymin = lwr, ymax = upr, color=Comparison)) +
  geom_pointrange(size=1.2) + coord_flip() + guides(color=FALSE) +
  ylab("Difference in Mean Rating by Age Groups")

Bob's Victory

Armed with his new visualization skills, Bob marched his paper back to Bill and got it accepted.

Congrats, Bob!

Moral of the Story

Use figures in your papers and you will become rich and famous just like Bob.