New names:
• `Indicator 1` -> `Indicator 1...4`
• `Indicator 2` -> `Indicator 2...5`
• `Indicator 3` -> `Indicator 3...6`
• `Indicator 4` -> `Indicator 4...7`
• `Indicator 5` -> `Indicator 5...8`
• `Indicator 1` -> `Indicator 1...9`
• `Indicator 2` -> `Indicator 2...10`
• `Indicator 3` -> `Indicator 3...11`
• `Indicator 4` -> `Indicator 4...12`
• `Indicator 5` -> `Indicator 5...13`
• `Indicator 1` -> `Indicator 1...14`
• `Indicator 2` -> `Indicator 2...15`
• `Indicator 3` -> `Indicator 3...16`
• `Indicator 4` -> `Indicator 4...17`
• `Indicator 5` -> `Indicator 5...18`
Attaching package: 'dplyr'
The following objects are masked from 'package:stats':
filter, lag
The following objects are masked from 'package:base':
intersect, setdiff, setequal, union
The tables above provides the distributions of respondents in terms of sex, year level, and strand. It can be seen that there are 79 females and 41 males; 29 of which are from ABM, 31 from GAS, 30 from HUMSS, and 30 from STEM.
Call:
lm(formula = `Academic Track Specialization` ~ `Facilities and Resources` +
`Community and Governance`, data = Data)
Coefficients:
(Intercept) `Facilities and Resources`
2.19232 0.03515
`Community and Governance`
0.32750
`summarise()` has grouped output by 'Sex'. You can override using the `.groups`
argument.
Attaching package: 'rstatix'
The following object is masked from 'package:stats':
filter
The mean for male and female is 3.263 and 3.324, respectively.
The above graph shows the plotting of data by sex, which contains two sexes – male and female.
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ forcats 1.0.0 ✔ stringr 1.5.1
✔ lubridate 1.9.3 ✔ tibble 3.2.1
✔ purrr 1.0.2 ✔ tidyr 1.3.1
✔ readr 2.1.5
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ rstatix::filter() masks dplyr::filter(), stats::filter()
✖ dplyr::lag() masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
Warning: The following aesthetics were dropped during statistical transformation: fill
ℹ This can happen when ggplot fails to infer the correct grouping structure in
the data.
ℹ Did you forget to specify a `group` aesthetic or to convert a numerical
variable into a factor?
It clearly shows that there is no significant difference on the variable academic track specialization when grouped according to their sex.
Loading required package: carData
Attaching package: 'car'
The following object is masked from 'package:purrr':
some
The following object is masked from 'package:dplyr':
recode
The histogram does not resemble a bell curve as seen above, means that the residuals do not have a normal distribution. Moreover, the points in the QQ-plots roughly follow the straight line, with the majority of them falling within the confidence bands. However, this does not guarantee that residuals follow a normal distribution since when based on the diagram on the left, it is the exact opposite of it. Thus, it is more convenient to observe the two.
Shapiro-Wilk normality test
data: res_aov$residuals
W = 0.90788, p-value = 5.078e-07
The Shapiro-Wilk p-value = 5.078e-07 on the residuals is less than the usual significance level of 0.05. Thus, we reject the hypothesis that residuals have a normal distribution.
Warning in leveneTest.default(y = y, group = group, ...): group coerced to
factor.
Levene's Test for Homogeneity of Variance (center = median)
Df F value Pr(>F)
group 1 0.0461 0.8304
118
The p-value is greater than the 0.05 level of significance. Thus, the homogeneity assumption of the variance is met.
Wilcoxon rank sum test
data: a and b
W = 1461, p-value = 0.366
alternative hypothesis: true location shift is not equal to 0
Since the p-value is larger than 0.05, we fail to reject the null hypothesis, that is, there is no significant difference on the variable academic track specialization when grouped according to sex.
`summarise()` has grouped output by 'Sex'. You can override using the `.groups`
argument.
The mean for male and female is 2.956 and 2.886, respectively.
The above graph shows the plotting of data by sex, which contains two sexes – male and female.
Warning: The following aesthetics were dropped during statistical transformation: fill
ℹ This can happen when ggplot fails to infer the correct grouping structure in
the data.
ℹ Did you forget to specify a `group` aesthetic or to convert a numerical
variable into a factor?
It clearly shows that there is difference on the variable Facilities and Resources when grouped according to their sex. However, we still need to check the significance of this difference.
The histogram does not resemble a bell curve as seen above, means that the residuals do not have a normal distribution. Moreover, the points in the QQ-plots roughly follow the straight line, with the majority of them falling within the confidence bands. However, this does not guarantee that residuals follow a normal distribution since when based on the diagram on the left, it is the exact opposite of it. Thus, it is more convenient to observe the two.
Shapiro-Wilk normality test
data: res_aov$residuals
W = 0.96814, p-value = 0.006036
The Shapiro-Wilk p-value = 0.006036 on the residuals is less than the usual significance level of 0.05. Thus, we reject the hypothesis that residuals have a normal distribution.
Warning in leveneTest.default(y = y, group = group, ...): group coerced to
factor.
Levene's Test for Homogeneity of Variance (center = median)
Df F value Pr(>F)
group 1 0.1105 0.7401
118
The p-value is greater than the 0.05 level of significance. Thus, the homogeneity assumption of the variance is met.
Wilcoxon rank sum test
data: c and d
W = 1758, p-value = 0.435
alternative hypothesis: true location shift is not equal to 0
Since the p-value is larger than 0.05, we fail to reject the null hypothesis, that is, there is no significant difference on the variable facilities and resources when grouped according to sex.
`summarise()` has grouped output by 'Sex'. You can override using the `.groups`
argument.
The mean for male and female is 3.107 and 3.066, respectively.
The above graph shows the plotting of data by sex, which contains two sexes – male and female.
Warning: The following aesthetics were dropped during statistical transformation: fill
ℹ This can happen when ggplot fails to infer the correct grouping structure in
the data.
ℹ Did you forget to specify a `group` aesthetic or to convert a numerical
variable into a factor?
It clearly shows that there is no significant difference on the variable Community and Governance when grouped according to their sex.
The histogram does not resemble a bell curve as seen above, means that the residuals do not have a normal distribution. Moreover, the points in the QQ-plots roughly follow the straight line, with the majority of them falling outside the confidence bands.
Shapiro-Wilk normality test
data: res_aov$residuals
W = 0.92752, p-value = 6.763e-06
The Shapiro-Wilk p-value = 6.763e-06 on the residuals is less than the usual significance level of 0.05. Thus, we reject the hypothesis that residuals have a normal distribution.
Warning in leveneTest.default(y = y, group = group, ...): group coerced to
factor.
Levene's Test for Homogeneity of Variance (center = median)
Df F value Pr(>F)
group 1 2.092 0.1507
118
The p-value is greater than the 0.05 level of significance. Thus, the homogeneity assumption of the variance is met.
Wilcoxon rank sum test
data: e and f
W = 1809.5, p-value = 0.273
alternative hypothesis: true location shift is not equal to 0
Since the p-value is larger than 0.05, we fail to reject the null hypothesis, that is, there is no significant difference on the variable Community and Governance when grouped according to sex.
Shapiro-Wilk normality test
data: Data$`Academic Track Specialization`
W = 0.89176, p-value = 7.608e-08
Since p-value = 7.608e-08 < 0.05, it is conclusive that we reject the null hypothesis. That is, we cannot assume normality.
Warning in leveneTest.default(y = y, group = group, ...): group coerced to
factor.
Levene's Test for Homogeneity of Variance (center = median)
Df F value Pr(>F)
group 3 4.4458 0.005386 **
116
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
The p-value is less than the 0.05 level of significance. Thus, the homogeneity assumption of the variance is not met.
Attaching package: 'gplots'
The following object is masked from 'package:stats':
lowess
Warning in plot.xy(xy.coords(x, y), type = type, ...): "frame" is not a
graphical parameter
Warning in axis(1, at = 1:length(means), labels = legends, ...): "frame" is not
a graphical parameter
Warning in plot.xy(xy.coords(x, y), type = type, ...): "frame" is not a
graphical parameter
# A tibble: 4 × 11
Strand variable n min max median iqr mean sd se ci
<fct> <fct> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 STEM Academic Track … 30 2.8 4 3.6 0.95 3.47 0.447 0.082 0.167
2 ABM Academic Track … 29 2.4 4 3.4 0.4 3.38 0.425 0.079 0.162
3 HUMSS Academic Track … 30 2.8 4 3 0.55 3.25 0.379 0.069 0.141
4 GAS Academic Track … 31 2.8 3.8 3 0.2 3.12 0.23 0.041 0.084
The mean of STEM, ABM, HUMSS, and GAS is 3.473, 3.379, 3.253, and 3.116, respectively.
# A tibble: 1 × 6
.y. n statistic df p method
* <chr> <int> <dbl> <int> <dbl> <chr>
1 Academic Track Specialization 120 13.7 3 0.0033 Kruskal-Wallis
Based on the p-value, there is significant difference was observed between the group pairs.
# A tibble: 6 × 9
.y. group1 group2 n1 n2 statistic p p.adj p.adj.signif
* <chr> <chr> <chr> <int> <int> <dbl> <dbl> <dbl> <chr>
1 Academic Tra… STEM ABM 30 29 -0.263 0.792 1 ns
2 Academic Tra… STEM HUMSS 30 30 -1.96 0.0498 0.299 ns
3 Academic Tra… STEM GAS 30 31 -3.23 0.00126 0.00754 **
4 Academic Tra… ABM HUMSS 29 30 -1.68 0.0926 0.556 ns
5 Academic Tra… ABM GAS 29 31 -2.93 0.00336 0.0202 *
6 Academic Tra… HUMSS GAS 30 31 -1.25 0.212 1 ns
There is significant difference between, STEM and HUMSS so with STEM and GAS, ABM and GAS.
Shapiro-Wilk normality test
data: Data$`Facilities and Resources`
W = 0.95209, p-value = 0.0003085
Since p-value = 0.0003085 < 0.05, it is conclusive that we reject the null hypothesis. That is, we cannot assume normality.
Warning in leveneTest.default(y = y, group = group, ...): group coerced to
factor.
Levene's Test for Homogeneity of Variance (center = median)
Df F value Pr(>F)
group 3 4.0718 0.00863 **
116
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
The p-value is less than the 0.05 level of significance. Thus, the homogeneity assumption of the variance is not met.
Warning in plot.xy(xy.coords(x, y), type = type, ...): "frame" is not a
graphical parameter
Warning in axis(1, at = 1:length(means), labels = legends, ...): "frame" is not
a graphical parameter
Warning in plot.xy(xy.coords(x, y), type = type, ...): "frame" is not a
graphical parameter
# A tibble: 4 × 11
Strand variable n min max median iqr mean sd se ci
<fct> <fct> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 STEM Facilities and … 30 1.4 3.8 2.8 0.6 2.85 0.535 0.098 0.2
2 ABM Facilities and … 29 2.4 3.6 2.8 0.2 2.91 0.3 0.056 0.114
3 HUMSS Facilities and … 30 2.4 3.6 3 0.35 2.95 0.309 0.056 0.116
4 GAS Facilities and … 31 2.2 3.6 3 0.2 2.93 0.299 0.054 0.11
The mean of STEM, ABM, HUMSS, and GAS is 2.847, 2.910, 2.953, and 2.929, respectively.
# A tibble: 1 × 6
.y. n statistic df p method
* <chr> <int> <dbl> <int> <dbl> <chr>
1 Facilities and Resources 120 1.10 3 0.778 Kruskal-Wallis
Based on the p-value, there is no significant difference was observed between the group pairs.
# A tibble: 6 × 9
.y. group1 group2 n1 n2 statistic p p.adj p.adj.signif
* <chr> <chr> <chr> <int> <int> <dbl> <dbl> <dbl> <chr>
1 Facilities and R… STEM ABM 30 29 0.274 0.784 1 ns
2 Facilities and R… STEM HUMSS 30 30 0.845 0.398 1 ns
3 Facilities and R… STEM GAS 30 31 0.863 0.388 1 ns
4 Facilities and R… ABM HUMSS 29 30 0.564 0.573 1 ns
5 Facilities and R… ABM GAS 29 31 0.579 0.562 1 ns
6 Facilities and R… HUMSS GAS 30 31 0.0109 0.991 1 ns
Shapiro-Wilk normality test
data: Data$`Community and Governance`
W = 0.91716, p-value = 1.654e-06
Since p-value = 1.654e-06 < 0.05, it is conclusive that we reject the null hypothesis. That is, we cannot assume normality.
Warning in leveneTest.default(y = y, group = group, ...): group coerced to
factor.
Levene's Test for Homogeneity of Variance (center = median)
Df F value Pr(>F)
group 3 1.2218 0.305
116
The p-value is greater than the 0.05 level of significance. Thus, the homogeneity assumption of the variance is met.
Warning in plot.xy(xy.coords(x, y), type = type, ...): "frame" is not a
graphical parameter
Warning in axis(1, at = 1:length(means), labels = legends, ...): "frame" is not
a graphical parameter
Warning in plot.xy(xy.coords(x, y), type = type, ...): "frame" is not a
graphical parameter
# A tibble: 4 × 11
Strand variable n min max median iqr mean sd se ci
<fct> <fct> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 STEM Community and G… 30 2 4 3.1 0.4 3.13 0.447 0.082 0.167
2 ABM Community and G… 29 2.2 4 3 0.2 3.08 0.353 0.065 0.134
3 HUMSS Community and G… 30 2.4 4 3 0.2 3.09 0.343 0.063 0.128
4 GAS Community and G… 31 2 4 3 0.1 3.03 0.345 0.062 0.127
The mean of STEM, ABM, HUMSS, and GAS is 3.127, 3.083, 3.087, 3.026, respectively.
# A tibble: 1 × 6
.y. n statistic df p method
* <chr> <int> <dbl> <int> <dbl> <chr>
1 Community and Governance 120 2.18 3 0.535 Kruskal-Wallis
Based on the p-value, there is no significant difference was observed between the group pairs.
# A tibble: 6 × 9
.y. group1 group2 n1 n2 statistic p p.adj p.adj.signif
* <chr> <chr> <chr> <int> <int> <dbl> <dbl> <dbl> <chr>
1 Community and Go… STEM ABM 30 29 -0.754 0.451 1 ns
2 Community and Go… STEM HUMSS 30 30 -0.760 0.447 1 ns
3 Community and Go… STEM GAS 30 31 -1.48 0.139 0.836 ns
4 Community and Go… ABM HUMSS 29 30 0.000265 1.00 1 ns
5 Community and Go… ABM GAS 29 31 -0.705 0.481 1 ns
6 Community and Go… HUMSS GAS 30 31 -0.711 0.477 1 ns
Shapiro-Wilk normality test
data: Data1$Scores
W = 0.94222, p-value = 1.242e-10
Since p-value = 1.242e-10 < 0.05, it is conclusive that we reject the null hypothesis. That is, we cannot assume normality.
Warning in leveneTest.default(y = y, group = group, ...): group coerced to
factor.
Levene's Test for Homogeneity of Variance (center = median)
Df F value Pr(>F)
group 2 3.3326 0.03681 *
357
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
The p-value is less than the 0.05 level of significance. Thus, the homogeneity assumption of the variance is not met.
Warning in plot.xy(xy.coords(x, y), type = type, ...): "frame" is not a
graphical parameter
Warning in axis(1, at = 1:length(means), labels = legends, ...): "frame" is not
a graphical parameter
Warning in plot.xy(xy.coords(x, y), type = type, ...): "frame" is not a
graphical parameter
# A tibble: 3 × 11
Variables variable n min max median iqr mean sd se ci
<fct> <fct> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 Academic Trac… Scores 120 2.4 4 3.2 0.6 3.30 0.397 0.036 0.072
2 Facilities an… Scores 120 1.4 3.8 3 0.45 2.91 0.372 0.034 0.067
3 Community and… Scores 120 2 4 3 0.2 3.08 0.372 0.034 0.067
The mean of Academic Track Specialization, Facilities and Resources, and Community and Governance is 3.303, 2.910, and 3.080, respectively.
# A tibble: 1 × 6
.y. n statistic df p method
* <chr> <int> <dbl> <int> <dbl> <chr>
1 Scores 360 52.1 2 4.89e-12 Kruskal-Wallis
Based on the p-value, there is a significant difference was observed between the group pairs.
# A tibble: 3 × 9
.y. group1 group2 n1 n2 statistic p p.adj p.adj.signif
* <chr> <chr> <chr> <int> <int> <dbl> <dbl> <dbl> <chr>
1 Scores Academic T… Facil… 120 120 -7.22 5.31e-13 1.59e-12 ****
2 Scores Academic T… Commu… 120 120 -3.62 2.94e- 4 8.82e- 4 ***
3 Scores Facilities… Commu… 120 120 3.60 3.22e- 4 9.67e- 4 ***
Pairwaise, there is significant difference.
Based on the provided output above, we can say that it is the academic track specialization.