This is my R notebook for my final project.
I first loaded my data set.
data <- read.csv("Math_Test_Results_2013-2023")
Warning: cannot open file 'Math_Test_Results_2013-2023': No such file or directoryError in file(file, "rt") : cannot open the connection
These are the necessary steps I took to clean my data set.
Now that my data set is cleaned I will begin my exploratory
analysis.
I first made a scatter plot comparing the percentage of level 1
students vs. level 4 students.

Then I made a line plot comparing those who scored in the level 1
percentile based on year

After seeing that these graphs don’t tell me very much, I knew that I
needed to look at more specific factors.
So I created a graph that compared male and female percentage
scores.

I created a similar bar graph, comparing ethnicity instead of
gender.

Another bar graph comparing students who are at an economic
disavandtage and those who are not.

This is my first linear regression model. That looks at the
correlation between gender and the percentage of students who scored in
level 1.
# Print a summary of the regression results
summary(model)
Call:
lm(formula = Pct.Level.1 ~ gender_male + gender_female, data = df)
Residuals:
Min 1Q Median 3Q Max
-35.180 -19.837 -2.587 16.714 65.513
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 34.4868 0.3474 99.259 <2e-16 ***
gender_male 0.6937 1.1048 0.628 0.530
gender_female -1.2012 1.0699 -1.123 0.262
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 23.73 on 5724 degrees of freedom
Multiple R-squared: 0.0003175, Adjusted R-squared: -3.184e-05
F-statistic: 0.9088 on 2 and 5724 DF, p-value: 0.403
This is my second linear regression model. That looks at the
correlation between ethnicity and the percentage of students who scored
in level 1.
# Print a summary of the regression results
summary(model2)
Call:
lm(formula = Pct.Level.1 ~ race_white + race_black + race_hispanic,
data = df)
Residuals:
Min 1Q Median 3Q Max
-44.949 -18.813 -2.413 16.006 66.006
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 33.9941 0.3408 99.762 < 2e-16 ***
race_white -17.0903 1.6448 -10.390 < 2e-16 ***
race_black 10.9553 1.3097 8.365 < 2e-16 ***
race_hispanic 4.8186 1.1031 4.368 1.27e-05 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 23.32 on 5723 degrees of freedom
Multiple R-squared: 0.03489, Adjusted R-squared: 0.03439
F-statistic: 68.97 on 3 and 5723 DF, p-value: < 2.2e-16
This is my final linear regression model. That looks at the
correlation between economic disadvantaged and the percentage of
students who scored in level 1.
# Print a summary of the regression results
summary(model3)
Call:
lm(formula = Pct.Level.1 ~ econ_disadv, data = df)
Residuals:
Min 1Q Median 3Q Max
-34.591 -19.891 -2.591 16.709 65.409
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 34.5912 0.3265 105.940 <2e-16 ***
econ_disadv -2.0212 1.1687 -1.729 0.0838 .
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 23.73 on 5725 degrees of freedom
Multiple R-squared: 0.0005221, Adjusted R-squared: 0.0003476
F-statistic: 2.991 on 1 and 5725 DF, p-value: 0.08379
LS0tDQp0aXRsZTogIkZpbmFsIFByb2plY3QiDQpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sNCi0tLQ0KDQpUaGlzIGlzIG15IFIgbm90ZWJvb2sgZm9yIG15IGZpbmFsIHByb2plY3QuIA0KDQpJIGZpcnN0IGxvYWRlZCBteSBkYXRhIHNldC4gDQoNCmBgYHtyfQ0KZGF0YSA8LSByZWFkLmNzdigiTWF0aF9UZXN0X1Jlc3VsdHNfMjAxMy0yMDIzIikgDQoNCiMgQmVjYXVzZSBteSBkYXRhIHNldCBoYWQgc28gbWFueSB2YWx1ZXMsIEkgbmVlZGVkIHRvIHRha2UgYSByYW5kb20gc2FtcGxlIG9mIDEwMDAgcm93cyBpbiBvcmRlciBmb3IgaXQgdG8gYmUgZWFzaWVyIHRvIHdvcmsgd2l0aCANCg0KZGYgPC0gZGF0YVtzYW1wbGUobnJvdyhkYXRhKSwgMTAwMDApLCBdDQpgYGANCg0KVGhlc2UgYXJlIHRoZSBuZWNlc3Nhcnkgc3RlcHMgSSB0b29rIHRvIGNsZWFuIG15IGRhdGEgc2V0LiANCg0KYGBge3J9IA0KIyBGaXJzdCBkZWxldGluZyB1bm5lY2Vzc2FyeSBjb2x1bW5zIA0KZGYgPC0gZGZbLCAhbmFtZXMoZGYpICVpbiUgYygiR2VvZ3JhcGhpYy5TdWJkaXZpc2lvbiIsICJTY2hvb2wuTmFtZSIsICJOdW1iZXIuVGVzdGVkIiwgIk51bS5MZXZlbC4xIiwgIk51bS5MZXZlbC4yIiwgIk51bS5MZXZlbC4zIiwgIk51bS5MZXZlbC40IiwgIk51bS5MZXZlbC4zLmFuZC40IiwgIlBjdC5MZXZlbC4zLmFuZC40IildDQoNCiMgTm93IEkgbmVlZCB0byBnZXQgcmlkIG9mIGFueSByb3cgY29udGFpbmluZyAiQWxsIGdyYWRlcyIgYW5kICJzIiBmb3IgbnVtZXJpYyB2YWx1ZXMgDQpyb3dzX3RvX2tlZXAgPC0gZGYkR3JhZGUgIT0gIkFsbCBHcmFkZXMiDQpkZiA8LSBkZltyb3dzX3RvX2tlZXAsIF0NCg0Kcm93c190b19rZWVwMiA8LSBkZiRNZWFuLlNjYWxlLlNjb3JlICE9ICJzIg0KZGYgPC0gZGZbcm93c190b19rZWVwMiwgXQ0KDQojIEkgbm93IHB1dCB0aGUgeWVhcnMgYW5kIGdyYWRlIGluIGFzY2VuZGluZyBvcmRlciB0byBtYWtlIHRoZSBkYXRhIG1vcmUgY2xlYXIgDQpkZiA8LSBkZltvcmRlcihkZiRZZWFyKSwgXQ0KDQpkZiA8LSBkZltvcmRlcihkZiRHcmFkZSksIF0NCg0KIyBJIHRoZW4gY2hhbmdlZCB0aGUgcmVsZXZhbnQgY29sdW1ucyB0byBudW1lcmljIA0KZGYkWWVhciA8LSBhcy5udW1lcmljKGRmJFllYXIpDQpkZiRHcmFkZSA8LSBhcy5udW1lcmljKGRmJEdyYWRlKQ0KZGYkUGN0LkxldmVsLjEgPC0gYXMubnVtZXJpYyhkZiRQY3QuTGV2ZWwuMSkNCmRmJFBjdC5MZXZlbC4yIDwtIGFzLm51bWVyaWMoZGYkUGN0LkxldmVsLjIpDQpkZiRQY3QuTGV2ZWwuMyA8LSBhcy5udW1lcmljKGRmJFBjdC5MZXZlbC4zKQ0KZGYkUGN0LkxldmVsLjQgPC0gYXMubnVtZXJpYyhkZiRQY3QuTGV2ZWwuNCkNCmBgYA0KDQpOb3cgdGhhdCBteSBkYXRhIHNldCBpcyBjbGVhbmVkIEkgd2lsbCBiZWdpbiBteSBleHBsb3JhdG9yeSBhbmFseXNpcy4gDQoNCkkgZmlyc3QgbWFkZSBhIHNjYXR0ZXIgcGxvdCBjb21wYXJpbmcgdGhlIHBlcmNlbnRhZ2Ugb2YgbGV2ZWwgMSBzdHVkZW50cyB2cy4gbGV2ZWwgNCBzdHVkZW50cy4gDQoNCmBgYHtyfQ0KbGlicmFyeSgiZ2dwbG90MiIpDQoNCmdncGxvdChkZiwgYWVzKHggPSBQY3QuTGV2ZWwuMSwgeSA9IFBjdC5MZXZlbC40KSkgKw0KICBnZW9tX3BvaW50KGNvbG9yID0gImJsdWUiKSArDQogIGxhYnModGl0bGUgPSAiU2NhdHRlciBQbG90IG9mIFBjdC5MZXZlbC4xIHZzLiBQY3QuTGV2ZWwuNCIsDQogICAgICAgeCA9ICJQY3QuTGV2ZWwuMSIsDQogICAgICAgeSA9ICJQY3QuTGV2ZWwuNCIpICsNCiAgdGhlbWVfbWluaW1hbCgpDQpgYGANCg0KVGhlbiBJIG1hZGUgYSBsaW5lIHBsb3QgY29tcGFyaW5nIHRob3NlIHdobyBzY29yZWQgaW4gdGhlIGxldmVsIDEgcGVyY2VudGlsZSBiYXNlZCBvbiB5ZWFyIA0KDQpgYGB7cn0NCmxpYnJhcnkoZ2dwbG90MikNCmxpYnJhcnkoZHBseXIpDQoNCiMgR3JvdXAgdGhlIGRhdGEgYnkgeWVhciANCmF2ZXJhZ2Vfc2NvcmVzIDwtIGRmICU+JQ0KICBncm91cF9ieShZZWFyKSAlPiUNCiAgc3VtbWFyaXNlKGF2Z19wZXJjZW50YWdlID0gbWVhbihQY3QuTGV2ZWwuMSwgbmEucm0gPSBUUlVFKSkNCg0KIyBDcmVhdGUgdGhlIGxpbmUgcGxvdA0KZ2dwbG90KGF2ZXJhZ2Vfc2NvcmVzLCBhZXMoeCA9IFllYXIsIHkgPSBhdmdfcGVyY2VudGFnZSwgZ3JvdXAgPSAxKSkgKw0KICBnZW9tX2xpbmUoY29sb3IgPSAiZGFyayBncmVlbiIpICsNCiAgZ2VvbV9wb2ludChjb2xvciA9ICJkYXJrIGdyZWVuIikgKw0KICBsYWJzKHRpdGxlID0gIkF2ZXJhZ2UgUGVyY2VudGFnZSBvZiBMZXZlbCAxIGJ5IFllYXIiLA0KICAgICAgIHggPSAiWWVhciIsDQogICAgICAgeSA9ICJBdmVyYWdlIFBlcmNlbnRhZ2Ugb2YgTGV2ZWwgMSIpICsNCiAgdGhlbWVfbWluaW1hbCgpICsNCiAgdGhlbWUoYXhpcy50ZXh0LnggPSBlbGVtZW50X3RleHQoYW5nbGUgPSA0NSwgaGp1c3QgPSAxKSkNCmBgYA0KDQpBZnRlciBzZWVpbmcgdGhhdCB0aGVzZSBncmFwaHMgZG9uJ3QgdGVsbCBtZSB2ZXJ5IG11Y2gsIEkga25ldyB0aGF0IEkgbmVlZGVkIHRvIGxvb2sgYXQgbW9yZSBzcGVjaWZpYyBmYWN0b3JzLiANCg0KU28gSSBjcmVhdGVkIGEgZ3JhcGggdGhhdCBjb21wYXJlZCBtYWxlIGFuZCBmZW1hbGUgcGVyY2VudGFnZSBzY29yZXMuICANCg0KYGBge3J9DQojIEZpbHRlciB0aGUgZGF0YSB0byBpbmNsdWRlIG9ubHkgcm93cyB3aXRoICJNYWxlIiBvciAiRmVtYWxlIiBpbiBTdHVkZW50LkNhdGVnb3J5DQpmaWx0ZXJlZF9kZiA8LSBkZiAlPiUNCiAgZmlsdGVyKFN0dWRlbnQuQ2F0ZWdvcnkgJWluJSBjKCJNYWxlIiwgIkZlbWFsZSIpKQ0KDQojIENhbGN1bGF0ZSB0aGUgYXZlcmFnZSBwZXJjZW50YWdlIGJ5IFN0dWRlbnQuQ2F0ZWdvcnkgKE1hbGUvRmVtYWxlKQ0KYXZlcmFnZV9zY29yZXMgPC0gZmlsdGVyZWRfZGYgJT4lDQogIGdyb3VwX2J5KFN0dWRlbnQuQ2F0ZWdvcnkpICU+JQ0KICBzdW1tYXJpc2UoYXZnX3BlcmNlbnRhZ2UgPSBtZWFuKFBjdC5MZXZlbC4xLCBuYS5ybSA9IFRSVUUpKQ0KDQojIENyZWF0ZSB0aGUgYmFyIGdyYXBoDQpnZ3Bsb3QoYXZlcmFnZV9zY29yZXMsIGFlcyh4ID0gU3R1ZGVudC5DYXRlZ29yeSwgeSA9IGF2Z19wZXJjZW50YWdlLCBmaWxsID0gU3R1ZGVudC5DYXRlZ29yeSkpICsNCiAgZ2VvbV9iYXIoc3RhdCA9ICJpZGVudGl0eSIsIHBvc2l0aW9uID0gImRvZGdlIikgKw0KICBsYWJzKHRpdGxlID0gIkF2ZXJhZ2UgUGVyY2VudGFnZSBvZiBMZXZlbCAxIGJ5IEdlbmRlciIsDQogICAgICAgeCA9ICJHZW5kZXIiLA0KICAgICAgIHkgPSAiQXZlcmFnZSBQZXJjZW50YWdlIG9mIExldmVsIDEiKSArDQogIHRoZW1lX21pbmltYWwoKQ0KYGBgDQoNCkkgY3JlYXRlZCBhIHNpbWlsYXIgYmFyIGdyYXBoLCBjb21wYXJpbmcgZXRobmljaXR5IGluc3RlYWQgb2YgZ2VuZGVyLiANCg0KYGBge3J9DQojIEZpbHRlciB0aGUgZGF0YSB0byBkaWZmZXJlbnQgZXRobmljaXRpZXMgDQpmaWx0ZXJlZF9kZiA8LSBkZiAlPiUNCiAgZmlsdGVyKFN0dWRlbnQuQ2F0ZWdvcnkgJWluJSBjKCJXaGl0ZSIsICJCbGFjayIsICJIaXNwYW5pYyIpKQ0KDQojIENhbGN1bGF0ZSB0aGUgYXZlcmFnZSBwZXJjZW50YWdlIGJ5IFN0dWRlbnQuQ2F0ZWdvcnkgKFdoaXRlL0JsYWNrL0hpc3BhbmljKQ0KYXZlcmFnZV9zY29yZXMgPC0gZmlsdGVyZWRfZGYgJT4lDQogIGdyb3VwX2J5KFN0dWRlbnQuQ2F0ZWdvcnkpICU+JQ0KICBzdW1tYXJpc2UoYXZnX3BlcmNlbnRhZ2UgPSBtZWFuKFBjdC5MZXZlbC4xLCBuYS5ybSA9IFRSVUUpKQ0KDQojIENyZWF0ZSB0aGUgYmFyIGdyYXBoDQpnZ3Bsb3QoYXZlcmFnZV9zY29yZXMsIGFlcyh4ID0gU3R1ZGVudC5DYXRlZ29yeSwgeSA9IGF2Z19wZXJjZW50YWdlLCBmaWxsID0gU3R1ZGVudC5DYXRlZ29yeSkpICsNCiAgZ2VvbV9iYXIoc3RhdCA9ICJpZGVudGl0eSIsIHBvc2l0aW9uID0gImRvZGdlIikgKw0KICBsYWJzKHRpdGxlID0gIkF2ZXJhZ2UgUGVyY2VudGFnZSBvZiBMZXZlbCAxIGJ5IEV0aG5pY2l0eSIsDQogICAgICAgeCA9ICJFdGhuaWNpdHkiLA0KICAgICAgIHkgPSAiQXZlcmFnZSBQZXJjZW50YWdlIG9mIExldmVsIDEiKSArDQogIHRoZW1lX21pbmltYWwoKQ0KYGBgDQoNCkFub3RoZXIgYmFyIGdyYXBoIGNvbXBhcmluZyBzdHVkZW50cyB3aG8gYXJlIGF0IGFuIGVjb25vbWljIGRpc2F2YW5kdGFnZSBhbmQgdGhvc2Ugd2hvIGFyZSBub3QuIA0KDQpgYGB7cn0NCiMgRmlsdGVyIHRoZSBkYXRhIHRvIGluY2x1ZGUgb25seSByb3dzIHdpdGggIkVjb24gRGlzYWR2IiBvciAiTm90IEVjb24gRGlzYWR2IiBpbiBTdHVkZW50LkNhdGVnb3J5DQpmaWx0ZXJlZF9kZiA8LSBkZiAlPiUNCiAgZmlsdGVyKFN0dWRlbnQuQ2F0ZWdvcnkgJWluJSBjKCJFY29uIERpc2FkdiIsICJOb3QgRWNvbiBEaXNhZHYiKSkNCg0KIyBDYWxjdWxhdGUgdGhlIGF2ZXJhZ2UgcGVyY2VudGFnZSBieSBTdHVkZW50LkNhdGVnb3J5IChFY29uIERpc2Fkdi9Ob24gRWNvbiBEaXNhZHYpDQphdmVyYWdlX3Njb3JlcyA8LSBmaWx0ZXJlZF9kZiAlPiUNCiAgZ3JvdXBfYnkoU3R1ZGVudC5DYXRlZ29yeSkgJT4lDQogIHN1bW1hcmlzZShhdmdfcGVyY2VudGFnZSA9IG1lYW4oUGN0LkxldmVsLjEsIG5hLnJtID0gVFJVRSkpDQoNCiMgQ3JlYXRlIHRoZSBiYXIgZ3JhcGgNCmdncGxvdChhdmVyYWdlX3Njb3JlcywgYWVzKHggPSBTdHVkZW50LkNhdGVnb3J5LCB5ID0gYXZnX3BlcmNlbnRhZ2UsIGZpbGwgPSBTdHVkZW50LkNhdGVnb3J5KSkgKw0KICBnZW9tX2JhcihzdGF0ID0gImlkZW50aXR5IiwgcG9zaXRpb24gPSAiZG9kZ2UiKSArDQogIGxhYnModGl0bGUgPSAiQXZlcmFnZSBQZXJjZW50YWdlIG9mIExldmVsIDEgYnkgRWNvbm9taWMgRGlzYWR2YW50YWdlIiwNCiAgICAgICB4ID0gIkVjb25vbWljIERpc2FkdmFudGFnZSIsDQogICAgICAgeSA9ICJBdmVyYWdlIFBlcmNlbnRhZ2Ugb2YgTGV2ZWwgMSIpICsNCiAgdGhlbWVfbWluaW1hbCgpDQpgYGANCg0KVGhpcyBpcyBteSBmaXJzdCBsaW5lYXIgcmVncmVzc2lvbiBtb2RlbC4gVGhhdCBsb29rcyBhdCB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiBnZW5kZXIgYW5kIHRoZSBwZXJjZW50YWdlIG9mIHN0dWRlbnRzIHdobyBzY29yZWQgaW4gbGV2ZWwgMS4gDQoNCmBgYHtyfQ0KIyBDaGFuZ2UgbXkgZGF0YSB0byBudW1lcmljIA0KZGYgPC0gd2l0aGluKGRmLCB7DQogIGdlbmRlcl9tYWxlIDwtIGFzLm51bWVyaWMoU3R1ZGVudC5DYXRlZ29yeSA9PSAiTWFsZSIpDQogIGdlbmRlcl9mZW1hbGUgPC0gYXMubnVtZXJpYyhTdHVkZW50LkNhdGVnb3J5ID09ICJGZW1hbGUiKQ0KfSkNCg0KIyBGaXQgYSBsaW5lYXIgcmVncmVzc2lvbiBtb2RlbA0KbW9kZWwgPC0gbG0oUGN0LkxldmVsLjEgfiBnZW5kZXJfbWFsZSArIGdlbmRlcl9mZW1hbGUsIGRhdGEgPSBkZikNCg0KIyBQcmludCBhIHN1bW1hcnkgb2YgdGhlIHJlZ3Jlc3Npb24gcmVzdWx0cw0Kc3VtbWFyeShtb2RlbCkNCmBgYA0KDQpUaGlzIGlzIG15IHNlY29uZCBsaW5lYXIgcmVncmVzc2lvbiBtb2RlbC4gVGhhdCBsb29rcyBhdCB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiBldGhuaWNpdHkgYW5kIHRoZSBwZXJjZW50YWdlIG9mIHN0dWRlbnRzIHdobyBzY29yZWQgaW4gbGV2ZWwgMS4gDQoNCmBgYHtyfQ0KIyBDaGFuZ2UgbXkgZGF0YSB0byBudW1lcmljIA0KZGYgPC0gd2l0aGluKGRmLCB7DQogIHJhY2Vfd2hpdGUgPC0gYXMubnVtZXJpYyhTdHVkZW50LkNhdGVnb3J5ID09ICJXaGl0ZSIpDQogIHJhY2VfYmxhY2sgPC0gYXMubnVtZXJpYyhTdHVkZW50LkNhdGVnb3J5ID09ICJCbGFjayIpDQogIHJhY2VfaGlzcGFuaWMgPC0gYXMubnVtZXJpYyhTdHVkZW50LkNhdGVnb3J5ID09ICJIaXNwYW5pYyIpDQp9KQ0KDQojIEZpdCBhIGxpbmVhciByZWdyZXNzaW9uIG1vZGVsDQptb2RlbDIgPC0gbG0oUGN0LkxldmVsLjEgfiByYWNlX3doaXRlICsgcmFjZV9ibGFjayArIHJhY2VfaGlzcGFuaWMsIGRhdGEgPSBkZikNCg0KIyBQcmludCBhIHN1bW1hcnkgb2YgdGhlIHJlZ3Jlc3Npb24gcmVzdWx0cw0Kc3VtbWFyeShtb2RlbDIpDQpgYGANCg0KVGhpcyBpcyBteSBmaW5hbCBsaW5lYXIgcmVncmVzc2lvbiBtb2RlbC4gVGhhdCBsb29rcyBhdCB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiBlY29ub21pYyBkaXNhZHZhbnRhZ2VkIGFuZCB0aGUgcGVyY2VudGFnZSBvZiBzdHVkZW50cyB3aG8gc2NvcmVkIGluIGxldmVsIDEuIA0KDQpgYGB7cn0NCiMgQ2hhbmdlIG15IGRhdGEgdG8gbnVtZXJpYyANCmRmIDwtIHdpdGhpbihkZiwgew0KICBlY29uX2Rpc2FkdiA8LSBhcy5udW1lcmljKFN0dWRlbnQuQ2F0ZWdvcnkgPT0gIkVjb24gRGlzYWR2IikNCn0pDQoNCiMgRml0IGEgbGluZWFyIHJlZ3Jlc3Npb24gbW9kZWwNCm1vZGVsMyA8LSBsbShQY3QuTGV2ZWwuMSB+IGVjb25fZGlzYWR2LCBkYXRhID0gZGYpDQoNCiMgUHJpbnQgYSBzdW1tYXJ5IG9mIHRoZSByZWdyZXNzaW9uIHJlc3VsdHMNCnN1bW1hcnkobW9kZWwzKQ0KYGBgDQo=