This report captures work done for the 5th flipped assignment, determining sample sizes for various statistical problems. R code along with the results are provided. Answers to the questions are in blue.


Problem 1

Question

One method for assessing the bioavailability of a drug is to note its concentration in blood and/or urine samples at certain periods of time after the drug is given. Suppose we want to compare the concentrations of two types of aspirin (types A and B) in urine specimens taken from the same person 1 hour after he/ she has taken the drug. In this study protocol, a specific dosage of either type A or type B aspirin is given and the urine concentration after 1-hour is measured. One week later, after the first aspirin has presumably been cleared from the system, the same dosage of the other aspirin is given to the same person and the urine concentration after 1-hour is noted. Because the order of giving the drugs may affect the results, a table of random numbers is used to decide which of the two types of aspirin to give first. The concentration will be measured as a percentage, and it is presumed that the standard deviation of the difference between subjects is approximately 3%. Suppose we would like to test whether the mean urine concentration for Aspirin B is less than Aspirin A at an alpha=0.05 level of significance. How many samples would we need to collect such that there would be a 75% chance of correctly rejecting the null hypothesis if the urine concentration of Aspirin B is 1.5% less than that of Aspirin A?

Problem Setup

Statistical Test and Rationale:  Paired t-Test


Hypothesis Test:        Null:     H0:μA = μB
               Alternate: H1:μA > μB
               Where μA & μB = mean concentrations of Aspirin A & B (respectively) in a subject’s blood stream after 1 hour.
One or Two Sided:      One-sided
Alpha:            0.05
Beta:            0.25
Difference:          1.5%
Standard Deviation:      3.0%

Statistical Test

pwr.t.test(n=NULL, d= 0.5, sig.level = 0.05,power = 0.75, type="paired", alternative="greater")
## 
##      Paired t test power calculation 
## 
##               n = 22.92958
##               d = 0.5
##       sig.level = 0.05
##           power = 0.75
##     alternative = greater
## 
## NOTE: n is number of *pairs*

Conclusion

It will take 23 participants in the study to detect a 1.5% difference in means of the two Aspirin types to a significance level of 95%.


Problem 2

Question

Can active exercise shorten the time that it takes an infant to learn how to walk alone? Researchers would at a major university would like to design an experiment to test this hypothesis. Specifically, they would like to use one-week old male infants from white middle-class white families as test subjects and allocate into to one of two treatment groups. Those in the active exercise group will receive stimulation of the walking reflexes for four 3-minute sessions each day from the beginning of the second week through the end of the eighth week following birth; those in the other group will receive no such stimulation. Following this, the time (in months) that it takes subjects from each group to walk independently will then be measured. Since this is a preliminary project, the researchers would like to use an alpha=0.10 level of significance for this test, but would like to achieve a power of .85 at detecting a mean difference that is half of the (pooled) standard deviation of walking times. How many infants do they need to enroll in each group? How many total?

Problem Setup

Statistical Test and Rationale:  2-Sample t-Test
Hypothesis Test:        Null:     H0:μ1 = μ2
               Alternate: H1:μ1 μ2
               Where μ1 = mean time to walk in months for the ‘walking stimulated group’
                  and μ2 = mean time to walk in months for the ‘non-stimulated group’
One or Two Sided:      Two-sided
Alpha:            0.10
Beta:            0.15
Difference:          0.5S
Standard Deviation:      S

Statistical Test

pwr.t.test(n=NULL, d= 0.5, sig.level = 0.10,power = 0.85, type="two.sample", alternative="two.sided")
## 
##      Two-sample t test power calculation 
## 
##               n = 58.20169
##               d = 0.5
##       sig.level = 0.1
##           power = 0.85
##     alternative = two.sided
## 
## NOTE: n is number in *each* group

Conclusion

It will take 59 infants enrolling in each group or 118 total to detect a difference that is half of the (pooled) standard deviation in mean months to walking at a significance level of 90%.

Source Code

# setup Libraries
library(knitr)
library(dplyr)
library(pwr)

#### Statistical Test Question 1
{r, echo=TRUE}
pwr.t.test(n=NULL, d= 0.5, sig.level = 0.05,power = 0.75, type="paired", alternative="greater")

#### Statistical Test Question 2
{r, echo=TRUE}
pwr.t.test(n=NULL, d= 0.5, sig.level = 0.10,power = 0.85, type="two.sample", alternative="two.sided")