Three problems — one chi-square goodness-of-fit, one chi-square test of independence, and one one-way ANOVA. Each one follows the same pattern: state hypotheses, run the test, interpret the result.


Problem 1 — Chi-Square Goodness of Fit (ACTN3 alleles)

ACTN3 is a gene that encodes alpha-actinin-3, a protein in fast-twitch muscle fibers. The gene has two main alleles: R (functional) and X (non-functional). The R allele is linked to better performance in strength/speed/power sports; the X allele is associated with endurance.

A study of 436 people classified 244 as R and 192 as X. Does this provide evidence that the two options are NOT equally likely?

State your hypotheses:

# Q1. Run a chi-square goodness-of-fit test.
#     (Hint: observed <- c(244, 192); chisq.test(observed))
observed <- c(244, 192)
gof <- chisq.test(observed)
gof
## 
##  Chi-squared test for given probabilities
## 
## data:  observed
## X-squared = 6.2018, df = 1, p-value = 0.01276
# Q2. What is the p-value? At α = 0.05, do you reject H₀?
gof$p.value
## [1] 0.01276179
# p-value ≈ 0.013 < 0.05, so we REJECT H0.

Q3. Write your conclusion in plain English:

The chi-square statistic is about 6.20 with a p-value of about 0.013, which is below 0.05, so we reject the null hypothesis. There is significant evidence that the R and X alleles are not equally likely — in this sample the R allele (244) occurs more often than the X allele (192).


Problem 2 — Chi-Square Test of Independence (Vitamin Use & Gender)

The NutritionStudy.csv dataset contains data on vitamin use (VitaminUse) and gender (Sex) for many participants. Is there a significant association between these two variables?

Download NutritionStudy.csv from the Datasets folder on Blackboard.

nutrition <- read.csv("NutritionStudy.csv")

State your hypotheses:

# Q4. Build a contingency table of VitaminUse and Sex using table().
vit_table <- table(nutrition$VitaminUse, nutrition$Sex)
vit_table
##             
##              Female Male
##   No             87   24
##   Occasional     77    5
##   Regular       109   13
# Q5. Run a chi-square test of independence on that table.
ind_test <- chisq.test(vit_table)
ind_test
## 
##  Pearson's Chi-squared test
## 
## data:  vit_table
## X-squared = 11.071, df = 2, p-value = 0.003944
# Q6. What is the p-value? Do you reject H₀ at α = 0.05?
ind_test$p.value
## [1] 0.003944277
# p-value ≈ 0.004 < 0.05, so we REJECT H0.

Q7. Write your conclusion in plain English:

The chi-square test of independence gives a statistic of about 11.07 (df = 2) with a p-value of about 0.004, which is well below 0.05, so we reject the null hypothesis. There is a statistically significant association between vitamin use and gender — the pattern of vitamin use differs between males and females in this dataset.


Problem 3 — One-Way ANOVA (Fish Gills)

Researchers wanted to know how water chemistry affects fish ventilation. Fish were randomly assigned to one of three tanks with different calcium levels:

The team counted gill rates (beats per minute) for 30 fish in each tank. The data is in FishGills3.csv.

Download FishGills3.csv from the Datasets folder on Blackboard.

fish <- read.csv("FishGills3.csv")

State your hypotheses:

# Q8. Run a one-way ANOVA testing GillRate by Calcium.
#     (Hint: aov(GillRate ~ Calcium, data = fish))
fish_aov <- aov(GillRate ~ Calcium, data = fish)

# Q9. Use summary() on the result. What is the F statistic and p-value?
summary(fish_aov)
##             Df Sum Sq Mean Sq F value Pr(>F)  
## Calcium      2   2037  1018.6   4.648 0.0121 *
## Residuals   87  19064   219.1                 
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
# F statistic ≈ 4.65, p-value ≈ 0.012.

# Q10. At α = 0.05, do you reject H₀?
# p-value ≈ 0.012 < 0.05, so we REJECT H0.

Q11. Write your conclusion in plain English:

The ANOVA gives an F statistic of about 4.65 with a p-value of about 0.012, which is below 0.05, so we reject the null hypothesis. There is significant evidence that the mean gill (ventilation) rate is not the same across the three calcium levels — at least one calcium level produces a different average gill rate, so water calcium concentration does appear to affect fish ventilation.