library(readxl)
library(ggpubr)
## Loading required package: ggplot2
library(effectsize)
Research Question: Do the same participants have the same stress scores before versus after?
Dataset6.3<- read_excel("/Users/alexiaprudencio/Desktop/Applied Analytics 1/Assingment 6/Dataset6.3.xlsx")
Before <- Dataset6.3$Stress_Pre
After <- Dataset6.3$Stress_Post
Differences <- After - Before
mean(Before, na.rm = TRUE)
## [1] 65.86954
median(Before, na.rm = TRUE)
## [1] 67.33135
sd(Before, na.rm = TRUE)
## [1] 9.496524
mean(After, na.rm = TRUE)
## [1] 57.90782
median(After, na.rm = TRUE)
## [1] 59.14539
sd(After, na.rm = TRUE)
## [1] 10.1712
hist(Differences,
main = "Histogram of Difference Scores",
xlab = "Value",
ylab = "Frequency",
col = "lightblue",
border = "darkblue",
breaks = 20)
The histogram appears negatively skewed. The tail extends to the right
with a higher concentration of frequent scores on the left side of the
scale, betweem -15 and -5. The histogram kurtosis is tall. Most
Individuals saw a decrease in their stress scores from before to
after.
boxplot(Differences,
main = "Distribution of Score Differences (After - Before)",
ylab = "Difference in Scores",
col = "lightgreen",
border = "darkgreen")
There are no outliers because there are no isolated dots floating above
the top or bellow the bottoms whiskers. The data appears normal.
shapiro.test(Differences)
##
## Shapiro-Wilk normality test
##
## data: Differences
## W = 0.95612, p-value = 0.1745
The p-value = 0.1745 > .05, which shows the distribution is normal and means we should proceed with a Dependent t-test.
t.test(Before, After, paired = TRUE)
##
## Paired t-test
##
## data: Before and After
## t = 3.9286, df = 34, p-value = 0.0003972
## alternative hypothesis: true mean difference is not equal to 0
## 95 percent confidence interval:
## 3.843113 12.080317
## sample estimates:
## mean difference
## 7.961715
The p-value = 0.0003972 < .05, (less than .05), this means the results were significant and we should calculate the Effect Size.
effectsize::cohens_d(After, Before, paired = TRUE)
## For paired samples, 'repeated_measures_d()' provides more options.
## Cohen's d | 95% CI
## --------------------------
## -0.66 | [-1.03, -0.29]