setwd("~/Data 101")hw 8 data 101
fish <- read.csv("FishGills3.csv")nutritions <- read.csv("NutritionStudy.csv")Problem 1
ACTN3 is a gene that encodes alpha-actinin-3, a protein in fast-twitch muscle fibers, important for activities like sprinting and weightlifting. The gene has two main alleles: R (functional) and X (non-functional). The R allele is linked to better performance in strength, speed, and power sports, while the X allele is associated with endurance due to a greater reliance on slow-twitch fibers. However, athletic performance is influenced by various factors, including training, environment, and other genes, making the ACTN3 genotype just one contributing factor. A study examines the ACTN3 genetic alleles R and X, also associated with fast-twitch muscles. Of the 436 people in this sample, 244 were classified as R, and 192 were classified as X. Does the sample provide evidence that the two options are not equally likely? Conduct the test using a chi-square goodness-of-fit test.
Ho: p1= p2 = 1/2
Ha:at least one pi ≠ 1/3
observed <- c(244,192)
theoretical_prop <- rep(1/2, 2)counts
expected_values <- theoretical_prop*sum(observed)
expected_values[1] 218 218
chisq.test(observed)
Chi-squared test for given probabilities
data: observed
X-squared = 6.2018, df = 1, p-value = 0.01276
The p-value is 0.01276, which is less than 0.05 so we will reject the null. There is strong evidence showing at least one proportion is not equal to 1/2.
Problem 2
Who Is More Likely to Take Vitamins: Males or Females? The dataset NutritionStudy contains, among other things, information about vitamin use and the gender of the participants. Is there a significant association between these two variables? Use the variables VitaminUse and Gender to conduct a chi-square analysis and give the results. (Test for Association)
head(nutritions) ID Age Smoke Quetelet Vitamin Calories Fat Fiber Alcohol Cholesterol
1 1 64 No 21.4838 1 1298.8 57.0 6.3 0.0 170.3
2 2 76 No 23.8763 1 1032.5 50.1 15.8 0.0 75.8
3 3 38 No 20.0108 2 2372.3 83.6 19.1 14.1 257.9
4 4 40 No 25.1406 3 2449.5 97.5 26.5 0.5 332.6
5 5 72 No 20.9850 1 1952.1 82.6 16.2 0.0 170.8
6 6 40 No 27.5214 3 1366.9 56.0 9.6 1.3 154.6
BetaDiet RetinolDiet BetaPlasma RetinolPlasma Sex VitaminUse PriorSmoke
1 1945 890 200 915 Female Regular 2
2 2653 451 124 727 Female Regular 1
3 6321 660 328 721 Female Occasional 2
4 1061 864 153 615 Female No 2
5 2863 1209 92 799 Female Regular 1
6 1729 1439 148 654 Female No 2
summary(nutritions) ID Age Smoke Quetelet
Min. : 1.0 Min. :19.00 Length:315 Min. :16.33
1st Qu.: 79.5 1st Qu.:39.00 Class :character 1st Qu.:21.80
Median :158.0 Median :48.00 Mode :character Median :24.74
Mean :158.0 Mean :50.15 Mean :26.16
3rd Qu.:236.5 3rd Qu.:62.50 3rd Qu.:28.85
Max. :315.0 Max. :83.00 Max. :50.40
Vitamin Calories Fat Fiber
Min. :1.000 Min. : 445.2 Min. : 14.40 Min. : 3.10
1st Qu.:1.000 1st Qu.:1338.0 1st Qu.: 53.95 1st Qu.: 9.15
Median :2.000 Median :1666.8 Median : 72.90 Median :12.10
Mean :1.965 Mean :1796.7 Mean : 77.03 Mean :12.79
3rd Qu.:3.000 3rd Qu.:2100.4 3rd Qu.: 95.25 3rd Qu.:15.60
Max. :3.000 Max. :6662.2 Max. :235.90 Max. :36.80
Alcohol Cholesterol BetaDiet RetinolDiet
Min. : 0.000 Min. : 37.7 Min. : 214 Min. : 30.0
1st Qu.: 0.000 1st Qu.:155.0 1st Qu.:1116 1st Qu.: 480.0
Median : 0.300 Median :206.3 Median :1802 Median : 707.0
Mean : 3.279 Mean :242.5 Mean :2186 Mean : 832.7
3rd Qu.: 3.200 3rd Qu.:308.9 3rd Qu.:2836 3rd Qu.:1037.0
Max. :203.000 Max. :900.7 Max. :9642 Max. :6901.0
BetaPlasma RetinolPlasma Sex VitaminUse
Min. : 0.0 Min. : 179.0 Length:315 Length:315
1st Qu.: 90.0 1st Qu.: 466.0 Class :character Class :character
Median : 140.0 Median : 566.0 Mode :character Mode :character
Mean : 189.9 Mean : 602.8
3rd Qu.: 230.0 3rd Qu.: 716.0
Max. :1415.0 Max. :1727.0
PriorSmoke
Min. :1.000
1st Qu.:1.000
Median :2.000
Mean :1.638
3rd Qu.:2.000
Max. :3.000
Ho = There is no association between gender and vitamin use Ha = There is a association between gender and vitamin use
observed_dataset <- table(nutritions$VitaminUse, nutritions$Sex)
observed_dataset
Female Male
No 87 24
Occasional 77 5
Regular 109 13
chisq.test(observed_dataset)
Pearson's Chi-squared test
data: observed_dataset
X-squared = 11.071, df = 2, p-value = 0.003944
Because the p-value is .003944 which is less than .05 so we will reject the null.There is strong evidence that there is a association between gender and vitamin use.
Problem 3
Most fish use gills for respiration in water, and researchers can observe how fast a fish’s gill cover beats to study ventilation, much like we might observe a person’s breathing rate. Professor Brad Baldwin is interested in how water chemistry might affect gill beat rates. In one experiment, he randomly assigned fish to tanks with different calcium levels. One tank was low in calcium (0.71 mg/L), the second tank had a medium amount (5.24 mg/L), and the third tank had water with a high calcium level (18.24 mg/L). His research team counted gill rates (beats per minute) for samples of 30 fish in each tank. The results are stored in FishGills3. Perform ANOVA test to see if the mean gill rate differs depending on the calcium level of the water. μ1=is low calcium levels μ2=is medium calcium levels μ3=is high calcium levels
Ho: μ1= μ2 = μ2
ha= one μ is different from the others
head(fish) Calcium GillRate
1 Low 55
2 Low 63
3 Low 78
4 Low 85
5 Low 65
6 Low 98
summary(fish) Calcium GillRate
Length:90 Min. :33.00
Class :character 1st Qu.:48.00
Mode :character Median :62.50
Mean :61.78
3rd Qu.:72.00
Max. :98.00
anova_answer <- aov(GillRate ~ Calcium, data = fish)
anova_answerCall:
aov(formula = GillRate ~ Calcium, data = fish)
Terms:
Calcium Residuals
Sum of Squares 2037.222 19064.333
Deg. of Freedom 2 87
Residual standard error: 14.80305
Estimated effects may be unbalanced
summary(anova_answer) Df Sum Sq Mean Sq F value Pr(>F)
Calcium 2 2037 1018.6 4.648 0.0121 *
Residuals 87 19064 219.1
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Because the p-value is .0121, we will reject the null. There is very strong evidence to suggests that at least one of the calcium levels is different from the other calcium levels.