This homework has two parts. Part 1 asks you to write your own functions. Part 2 applies hypothesis tests to two real scenarios.
# Q1. Write a function called calculate_area_of_rectangle that takes two parameters
# (length, width) and returns the area (area = length * width).
# Test it with 2 different inputs.
calculate_area_of_rectangle <- function(length, width) {
area <- length * width
return(area)
}
# Test 1
area1 <- calculate_area_of_rectangle(7, 5)
print(area1)
## [1] 35
# Test 2
area2 <- calculate_area_of_rectangle(4, 8)
print(area2)
## [1] 32
# Q2. Write a function called calculate_average that takes a numeric vector and returns
# its average. Handle the case of an empty vector by printing a message.
# (Hint: use if/else and length(x) == 0)
calculate_average <- function(x) {
if (length(x) == 0) {
print("How do you do?")
} else {
return(mean(x))
}
}
calculate_average (c(2, 4, 6, 8))
## [1] 5
calculate_average(c())
## [1] "How do you do?"
# Q3. Write a function called check_even_odd that takes an integer and prints whether
# it is "Even" or "Odd".
# Test it on 14 and 27.
# (Hint: use the %% modulus operator)
check_even_odd <- function(x) {
if(x %% 2 ==0)
print("Even")
else
print("Odd")
}
check_even_odd(14)
## [1] "Even"
check_even_odd(27)
## [1] "Odd"
In 2017, of the 144,790 students who took the AP Biology exam, 84,200 were female. That same year, of the 211,693 students who took the AP Calculus AB exam, 102,598 were female.
Is there enough evidence to show that the proportion of female students taking the Biology exam is HIGHER than the proportion taking the Calculus AB exam? Test at the 5% level.
State your hypotheses:
# Q4. Run the appropriate two-proportion test.
# (Hint: prop.test(c(84200, 102598), c(144790, 211693), alternative = "greater"))
prop.test(c(84200, 102598), c(144790, 211693), alternative = "greater")
##
## 2-sample test for equality of proportions with continuity correction
##
## data: c(84200, 102598) out of c(144790, 211693)
## X-squared = 3234.9, df = 1, p-value < 2.2e-16
## alternative hypothesis: greater
## 95 percent confidence interval:
## 0.09408942 1.00000000
## sample estimates:
## prop 1 prop 2
## 0.5815319 0.4846547
# Q5. What is the p-value? At α = 0.05, do you reject H₀?
2.2e-16
## [1] 2.2e-16
# We reject the H0.
Q6. Write your conclusion in plain English (one or two sentences): Conclusively, because we reject the null hypothesis, this means that the proportion of female students taking the biology exam is higher than the proportion taking the Calculus AB exam. —
A vitamin K shot is given to infants soon after birth. Researchers want to see if how the infants are handled can reduce the pain. They measured how long (in seconds) the infant cried after the shot. One group received the shot the conventional way; the other group received it while the mother held the infant.
Is there enough evidence to show that infants cried LESS on average when held by their mothers vs. the conventional method? Test at the 5% level.
Old <- c(63, 0, 2, 46, 33, 33, 29, 23, 11, 12, 48, 15, 33, 14, 51,
37, 24, 70, 63, 0, 73, 39, 54, 52, 39, 34, 30, 55, 58, 18)
New <- c(0, 32, 20, 23, 14, 19, 60, 59, 64, 64, 72, 50, 44, 14, 10,
58, 19, 41, 17, 5, 36, 73, 19, 46, 9, 43, 73, 27, 25, 18)
State your hypotheses:
# Q7. Run a paired t-test.
# (Hint: t.test(Old, New, paired = TRUE))
t.test(Old, New, paired = TRUE)
##
## Paired t-test
##
## data: Old and New
## t = 0.028519, df = 29, p-value = 0.9774
## alternative hypothesis: true mean difference is not equal to 0
## 95 percent confidence interval:
## -11.78558 12.11892
## sample estimates:
## mean difference
## 0.1666667
# Q8. What is the p-value? At α = 0.05, do you reject H₀?
0.9774
## [1] 0.9774
# We fail to reject the H0
Q9. Write your conclusion in plain English. Does the data support the claim that the new method reduces crying time? Conclusively, because we fail to reject the null hypothesis. In other words, yes, this means that there enough evidence to show that infants cried less on average when held by their mothers vs. the conventional method.