Does increased government expenditure on education improve youth literacy?
Each case represents a country with expenditure and literacy data for a given time period.
These cases are from the the Word Bank’s World Development Indicators.
This study will only focus on these two indicators:
The original dataset contains 415,436 rows of observations from 1960 to 2017.
There are 528 observations for the indicators mentioned above.
For this study, there are 91 complete country-period cases.
Data is collected by The World Bank. This dataset is part of the World Bank’s open data and is updated quarterly. The dataset used in this study has a release date of June 11, 2010 and was last updated on March 1, 2018.
This is an observational study.
Source: https://datacatalog.worldbank.org/dataset/world-development-indicators Release Date: June 11, 2010 Lasted Updated: March 1, 2018
The response variable is change in youth literacy rate.
The explanatory variable is the change in government expenditure in education.
A complete case is a country with available data for both selected indicators within a specific time period.
Selected indicators:
NOTE: Selecting one time period reference does not generate enough complete cases. For example, for the period 2010-2012, this only has 17 complete cases. In order to increase the number of cases for the study, several time periods are selected.
Each case is assigned into one of two groups: Less-Expenditure
or More-Expenditure
.
Size of group with increased government expenditure on education: 47 cases Size of group with decreased government expenditure on education: 44
## [1] 47
## [1] 44
describe(group_more_expend$Change.Educ_Expend)
## vars n mean sd median trimmed mad min max range skew kurtosis se
## X1 1 47 1.16 1.25 0.77 0.93 0.74 0 5.63 5.63 2.01 3.94 0.18
describe(group_more_expend$Change.Literacy)
## vars n mean sd median trimmed mad min max range skew kurtosis
## X1 1 47 0.35 0.92 0.12 0.3 0.2 -3.15 2.84 5.98 -0.15 3.91
## se
## X1 0.13
describe(group_less_expend$Change.Educ_Expend)
## vars n mean sd median trimmed mad min max range skew kurtosis
## X1 1 44 -0.8 0.62 -0.68 -0.74 0.56 -2.38 -0.07 2.31 -0.83 -0.31
## se
## X1 0.09
describe(group_less_expend$Change.Literacy)
## vars n mean sd median trimmed mad min max range skew kurtosis
## X1 1 44 0.19 0.81 0.02 0.13 0.42 -1.8 2.71 4.51 0.87 1.64
## se
## X1 0.12
ggplot(data=educ_expend_literacy, aes(x=Change.Educ_Expend, y=Change.Literacy,
colour=Group)) +
geom_point(size=3, alpha=0.7) +
labs(y="Change in literacy", x="Change in government expenditure on education", color="Group")
boxplot(educ_expend_literacy$Change.Literacy~Group, data=educ_expend_literacy, main="Change in Literacy", xlab = "Group", ylab="Change in Literacy")
hist(educ_expend_literacy$Change.Literacy)