Tuesday Lecture: Permutation test for two group means

Sleep vs Caffeine experiment

In an experiment on memory (Mednicj et al, 2008), students were given lists of 24 words to memorize. After hearing the words they were assigned at random to different groups. One group of 12 students took a nap for 1.5 hours while a second group of 12 students stayed awake and was given a caffeine pill.

These data contain the number of words each participant was able to recall after the break:

Memory <- read.csv("http://www.mosaic-web.org/go/datasets/SleepCaffeine.csv")

You want to test whether the data indicate a difference in mean number of words recalled between the two treatments. Let’s use a permutation test to do so.

Interlude: Permutation Tests

In testing a null hypothesis we need a test statistic that will have different values under the null hypothesis (the means of the two groups are the same) and the alternative hypothesis we care about (i.e., the means are different).

To test the hypothesis, we need to know the sampling distribution of the test statistic when the null hypothesis is true. For some test statistics and some null hypotheses this is not possible. The p-value tells us how likely it is (under the null hypothesis) for the test statistic to be at least as extreme as the one we observed, if the null hypothesis is true.

Because of this, if the null hypothesis is true, then shuffled data sets should look like the real data, otherwise they should look different from the real data. A permutation test gives a simple way to compute the sampling distribution for any test statistic, under the null hypothesis that the treatment has absolutely no effect on the outcome, and the ranking of the real test statistic among the shuffled test statistics allows us to obtain a p-value.

From the theory, we know that the distribution of a difference in the means and we could just do a t-test. For the t-test to be valid we need enough samples so the Central Limit Theorem kicks in. In our case, a t-test might not work since we have only a few subjects. More information on permutation tests here.

Questions

Calculate the observed difference (Caffeine-Sleep) between the group means

sleep <- c(mean(Memory[Memory$Group == "Sleep",]$Words)) # the mean of Sleep
caffeine <- c(mean(Memory[Memory$Group == "Caffeine",]$Words)) # the mean of Caffeine
observed_diff = sleep - caffeine
print(observed_diff)

## [1] 3

Create a function (call it perm_diff) to randomly permute the 12 Sleep and 12 Caffeine treatment labels and calculate the difference for each group under this label assignment

perm_diff <- function(){
  perm_sample = sample(Memory$Group) 
  sample_smean <- mean(Memory$Words[perm_sample=="Sleep"])
  sample_cmean <- mean(Memory$Words[perm_sample=="Caffeine"])
  return(sample_smean-sample_cmean)
}
perm_diff()

## [1] 1.166667

Use perm_diff to generate 10000 permutations and store the difference between the means in a vector called diff_vector.

diff_vector <- rep(0,10000)
for(x in 1:10000){
  diff_vector[x] <- perm_diff()
}

Use the function quantile with the vector diff_vector to see a quick summary of the permuted differences

quantile(diff_vector, probs=c(0.025, 0.5, 0.975))

##      2.5%       50%     97.5% 
## -2.833333  0.000000  3.000000

Plot the sampling distribution of the differences using the function hist

hist(diff_vector)

Use the function abline to see where the observed difference falls in the sampling distribution you plotted above (by default abline is added to your last plot)

hist(diff_vector)
abline(v=observed_diff, col = "blue")

Calculate the proportion of times that the observed differences are smaller (try greater) than the ones simulated

# smaller than
mean(observed_diff<diff_vector)

## [1] 0.0198

# greater than
mean(observed_diff>diff_vector)

## [1] 0.9738

State you conclusions, is there a difference between the two treatments?

My conclusion is, yes, there is a difference between observed difference and permutation difference. As we could see through the histogram, the blue bar(which indicates observed difference) is not that close to the middle of the histogram.

In-class Exercise Week 3

Saeah Go

Tuesday Lecture: Permutation test for two group means

Sleep vs Caffeine experiment

Interlude: Permutation Tests

Questions