This is my R notebook for my final project.

I first loaded my data set.

data <- read.csv("Math_Test_Results_2013-2023") 
Warning: cannot open file 'Math_Test_Results_2013-2023': No such file or directoryError in file(file, "rt") : cannot open the connection

These are the necessary steps I took to clean my data set.

Now that my data set is cleaned I will begin my exploratory analysis.

I first made a scatter plot comparing the percentage of level 1 students vs. level 4 students.

Then I made a line plot comparing those who scored in the level 1 percentile based on year

After seeing that these graphs don’t tell me very much, I knew that I needed to look at more specific factors.

So I created a graph that compared male and female percentage scores.

I created a similar bar graph, comparing ethnicity instead of gender.

Another bar graph comparing students who are at an economic disavandtage and those who are not.

This is my first linear regression model. That looks at the correlation between gender and the percentage of students who scored in level 1.

# Print a summary of the regression results
summary(model)

Call:
lm(formula = Pct.Level.1 ~ gender_male + gender_female, data = df)

Residuals:
    Min      1Q  Median      3Q     Max 
-35.180 -19.837  -2.587  16.714  65.513 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)    34.4868     0.3474  99.259   <2e-16 ***
gender_male     0.6937     1.1048   0.628    0.530    
gender_female  -1.2012     1.0699  -1.123    0.262    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 23.73 on 5724 degrees of freedom
Multiple R-squared:  0.0003175, Adjusted R-squared:  -3.184e-05 
F-statistic: 0.9088 on 2 and 5724 DF,  p-value: 0.403

This is my second linear regression model. That looks at the correlation between ethnicity and the percentage of students who scored in level 1.

# Print a summary of the regression results
summary(model2)

Call:
lm(formula = Pct.Level.1 ~ race_white + race_black + race_hispanic, 
    data = df)

Residuals:
    Min      1Q  Median      3Q     Max 
-44.949 -18.813  -2.413  16.006  66.006 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)    33.9941     0.3408  99.762  < 2e-16 ***
race_white    -17.0903     1.6448 -10.390  < 2e-16 ***
race_black     10.9553     1.3097   8.365  < 2e-16 ***
race_hispanic   4.8186     1.1031   4.368 1.27e-05 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 23.32 on 5723 degrees of freedom
Multiple R-squared:  0.03489,   Adjusted R-squared:  0.03439 
F-statistic: 68.97 on 3 and 5723 DF,  p-value: < 2.2e-16

This is my final linear regression model. That looks at the correlation between economic disadvantaged and the percentage of students who scored in level 1.

# Print a summary of the regression results
summary(model3)

Call:
lm(formula = Pct.Level.1 ~ econ_disadv, data = df)

Residuals:
    Min      1Q  Median      3Q     Max 
-34.591 -19.891  -2.591  16.709  65.409 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)  34.5912     0.3265 105.940   <2e-16 ***
econ_disadv  -2.0212     1.1687  -1.729   0.0838 .  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 23.73 on 5725 degrees of freedom
Multiple R-squared:  0.0005221, Adjusted R-squared:  0.0003476 
F-statistic: 2.991 on 1 and 5725 DF,  p-value: 0.08379
LS0tDQp0aXRsZTogIkZpbmFsIFByb2plY3QiDQpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sNCi0tLQ0KDQpUaGlzIGlzIG15IFIgbm90ZWJvb2sgZm9yIG15IGZpbmFsIHByb2plY3QuIA0KDQpJIGZpcnN0IGxvYWRlZCBteSBkYXRhIHNldC4gDQoNCmBgYHtyfQ0KZGF0YSA8LSByZWFkLmNzdigiTWF0aF9UZXN0X1Jlc3VsdHNfMjAxMy0yMDIzIikgDQoNCiMgQmVjYXVzZSBteSBkYXRhIHNldCBoYWQgc28gbWFueSB2YWx1ZXMsIEkgbmVlZGVkIHRvIHRha2UgYSByYW5kb20gc2FtcGxlIG9mIDEwMDAgcm93cyBpbiBvcmRlciBmb3IgaXQgdG8gYmUgZWFzaWVyIHRvIHdvcmsgd2l0aCANCg0KZGYgPC0gZGF0YVtzYW1wbGUobnJvdyhkYXRhKSwgMTAwMDApLCBdDQpgYGANCg0KVGhlc2UgYXJlIHRoZSBuZWNlc3Nhcnkgc3RlcHMgSSB0b29rIHRvIGNsZWFuIG15IGRhdGEgc2V0LiANCg0KYGBge3J9IA0KIyBGaXJzdCBkZWxldGluZyB1bm5lY2Vzc2FyeSBjb2x1bW5zIA0KZGYgPC0gZGZbLCAhbmFtZXMoZGYpICVpbiUgYygiR2VvZ3JhcGhpYy5TdWJkaXZpc2lvbiIsICJTY2hvb2wuTmFtZSIsICJOdW1iZXIuVGVzdGVkIiwgIk51bS5MZXZlbC4xIiwgIk51bS5MZXZlbC4yIiwgIk51bS5MZXZlbC4zIiwgIk51bS5MZXZlbC40IiwgIk51bS5MZXZlbC4zLmFuZC40IiwgIlBjdC5MZXZlbC4zLmFuZC40IildDQoNCiMgTm93IEkgbmVlZCB0byBnZXQgcmlkIG9mIGFueSByb3cgY29udGFpbmluZyAiQWxsIGdyYWRlcyIgYW5kICJzIiBmb3IgbnVtZXJpYyB2YWx1ZXMgDQpyb3dzX3RvX2tlZXAgPC0gZGYkR3JhZGUgIT0gIkFsbCBHcmFkZXMiDQpkZiA8LSBkZltyb3dzX3RvX2tlZXAsIF0NCg0Kcm93c190b19rZWVwMiA8LSBkZiRNZWFuLlNjYWxlLlNjb3JlICE9ICJzIg0KZGYgPC0gZGZbcm93c190b19rZWVwMiwgXQ0KDQojIEkgbm93IHB1dCB0aGUgeWVhcnMgYW5kIGdyYWRlIGluIGFzY2VuZGluZyBvcmRlciB0byBtYWtlIHRoZSBkYXRhIG1vcmUgY2xlYXIgDQpkZiA8LSBkZltvcmRlcihkZiRZZWFyKSwgXQ0KDQpkZiA8LSBkZltvcmRlcihkZiRHcmFkZSksIF0NCg0KIyBJIHRoZW4gY2hhbmdlZCB0aGUgcmVsZXZhbnQgY29sdW1ucyB0byBudW1lcmljIA0KZGYkWWVhciA8LSBhcy5udW1lcmljKGRmJFllYXIpDQpkZiRHcmFkZSA8LSBhcy5udW1lcmljKGRmJEdyYWRlKQ0KZGYkUGN0LkxldmVsLjEgPC0gYXMubnVtZXJpYyhkZiRQY3QuTGV2ZWwuMSkNCmRmJFBjdC5MZXZlbC4yIDwtIGFzLm51bWVyaWMoZGYkUGN0LkxldmVsLjIpDQpkZiRQY3QuTGV2ZWwuMyA8LSBhcy5udW1lcmljKGRmJFBjdC5MZXZlbC4zKQ0KZGYkUGN0LkxldmVsLjQgPC0gYXMubnVtZXJpYyhkZiRQY3QuTGV2ZWwuNCkNCmBgYA0KDQpOb3cgdGhhdCBteSBkYXRhIHNldCBpcyBjbGVhbmVkIEkgd2lsbCBiZWdpbiBteSBleHBsb3JhdG9yeSBhbmFseXNpcy4gDQoNCkkgZmlyc3QgbWFkZSBhIHNjYXR0ZXIgcGxvdCBjb21wYXJpbmcgdGhlIHBlcmNlbnRhZ2Ugb2YgbGV2ZWwgMSBzdHVkZW50cyB2cy4gbGV2ZWwgNCBzdHVkZW50cy4gDQoNCmBgYHtyfQ0KbGlicmFyeSgiZ2dwbG90MiIpDQoNCmdncGxvdChkZiwgYWVzKHggPSBQY3QuTGV2ZWwuMSwgeSA9IFBjdC5MZXZlbC40KSkgKw0KICBnZW9tX3BvaW50KGNvbG9yID0gImJsdWUiKSArDQogIGxhYnModGl0bGUgPSAiU2NhdHRlciBQbG90IG9mIFBjdC5MZXZlbC4xIHZzLiBQY3QuTGV2ZWwuNCIsDQogICAgICAgeCA9ICJQY3QuTGV2ZWwuMSIsDQogICAgICAgeSA9ICJQY3QuTGV2ZWwuNCIpICsNCiAgdGhlbWVfbWluaW1hbCgpDQpgYGANCg0KVGhlbiBJIG1hZGUgYSBsaW5lIHBsb3QgY29tcGFyaW5nIHRob3NlIHdobyBzY29yZWQgaW4gdGhlIGxldmVsIDEgcGVyY2VudGlsZSBiYXNlZCBvbiB5ZWFyIA0KDQpgYGB7cn0NCmxpYnJhcnkoZ2dwbG90MikNCmxpYnJhcnkoZHBseXIpDQoNCiMgR3JvdXAgdGhlIGRhdGEgYnkgeWVhciANCmF2ZXJhZ2Vfc2NvcmVzIDwtIGRmICU+JQ0KICBncm91cF9ieShZZWFyKSAlPiUNCiAgc3VtbWFyaXNlKGF2Z19wZXJjZW50YWdlID0gbWVhbihQY3QuTGV2ZWwuMSwgbmEucm0gPSBUUlVFKSkNCg0KIyBDcmVhdGUgdGhlIGxpbmUgcGxvdA0KZ2dwbG90KGF2ZXJhZ2Vfc2NvcmVzLCBhZXMoeCA9IFllYXIsIHkgPSBhdmdfcGVyY2VudGFnZSwgZ3JvdXAgPSAxKSkgKw0KICBnZW9tX2xpbmUoY29sb3IgPSAiZGFyayBncmVlbiIpICsNCiAgZ2VvbV9wb2ludChjb2xvciA9ICJkYXJrIGdyZWVuIikgKw0KICBsYWJzKHRpdGxlID0gIkF2ZXJhZ2UgUGVyY2VudGFnZSBvZiBMZXZlbCAxIGJ5IFllYXIiLA0KICAgICAgIHggPSAiWWVhciIsDQogICAgICAgeSA9ICJBdmVyYWdlIFBlcmNlbnRhZ2Ugb2YgTGV2ZWwgMSIpICsNCiAgdGhlbWVfbWluaW1hbCgpICsNCiAgdGhlbWUoYXhpcy50ZXh0LnggPSBlbGVtZW50X3RleHQoYW5nbGUgPSA0NSwgaGp1c3QgPSAxKSkNCmBgYA0KDQpBZnRlciBzZWVpbmcgdGhhdCB0aGVzZSBncmFwaHMgZG9uJ3QgdGVsbCBtZSB2ZXJ5IG11Y2gsIEkga25ldyB0aGF0IEkgbmVlZGVkIHRvIGxvb2sgYXQgbW9yZSBzcGVjaWZpYyBmYWN0b3JzLiANCg0KU28gSSBjcmVhdGVkIGEgZ3JhcGggdGhhdCBjb21wYXJlZCBtYWxlIGFuZCBmZW1hbGUgcGVyY2VudGFnZSBzY29yZXMuICANCg0KYGBge3J9DQojIEZpbHRlciB0aGUgZGF0YSB0byBpbmNsdWRlIG9ubHkgcm93cyB3aXRoICJNYWxlIiBvciAiRmVtYWxlIiBpbiBTdHVkZW50LkNhdGVnb3J5DQpmaWx0ZXJlZF9kZiA8LSBkZiAlPiUNCiAgZmlsdGVyKFN0dWRlbnQuQ2F0ZWdvcnkgJWluJSBjKCJNYWxlIiwgIkZlbWFsZSIpKQ0KDQojIENhbGN1bGF0ZSB0aGUgYXZlcmFnZSBwZXJjZW50YWdlIGJ5IFN0dWRlbnQuQ2F0ZWdvcnkgKE1hbGUvRmVtYWxlKQ0KYXZlcmFnZV9zY29yZXMgPC0gZmlsdGVyZWRfZGYgJT4lDQogIGdyb3VwX2J5KFN0dWRlbnQuQ2F0ZWdvcnkpICU+JQ0KICBzdW1tYXJpc2UoYXZnX3BlcmNlbnRhZ2UgPSBtZWFuKFBjdC5MZXZlbC4xLCBuYS5ybSA9IFRSVUUpKQ0KDQojIENyZWF0ZSB0aGUgYmFyIGdyYXBoDQpnZ3Bsb3QoYXZlcmFnZV9zY29yZXMsIGFlcyh4ID0gU3R1ZGVudC5DYXRlZ29yeSwgeSA9IGF2Z19wZXJjZW50YWdlLCBmaWxsID0gU3R1ZGVudC5DYXRlZ29yeSkpICsNCiAgZ2VvbV9iYXIoc3RhdCA9ICJpZGVudGl0eSIsIHBvc2l0aW9uID0gImRvZGdlIikgKw0KICBsYWJzKHRpdGxlID0gIkF2ZXJhZ2UgUGVyY2VudGFnZSBvZiBMZXZlbCAxIGJ5IEdlbmRlciIsDQogICAgICAgeCA9ICJHZW5kZXIiLA0KICAgICAgIHkgPSAiQXZlcmFnZSBQZXJjZW50YWdlIG9mIExldmVsIDEiKSArDQogIHRoZW1lX21pbmltYWwoKQ0KYGBgDQoNCkkgY3JlYXRlZCBhIHNpbWlsYXIgYmFyIGdyYXBoLCBjb21wYXJpbmcgZXRobmljaXR5IGluc3RlYWQgb2YgZ2VuZGVyLiANCg0KYGBge3J9DQojIEZpbHRlciB0aGUgZGF0YSB0byBkaWZmZXJlbnQgZXRobmljaXRpZXMgDQpmaWx0ZXJlZF9kZiA8LSBkZiAlPiUNCiAgZmlsdGVyKFN0dWRlbnQuQ2F0ZWdvcnkgJWluJSBjKCJXaGl0ZSIsICJCbGFjayIsICJIaXNwYW5pYyIpKQ0KDQojIENhbGN1bGF0ZSB0aGUgYXZlcmFnZSBwZXJjZW50YWdlIGJ5IFN0dWRlbnQuQ2F0ZWdvcnkgKFdoaXRlL0JsYWNrL0hpc3BhbmljKQ0KYXZlcmFnZV9zY29yZXMgPC0gZmlsdGVyZWRfZGYgJT4lDQogIGdyb3VwX2J5KFN0dWRlbnQuQ2F0ZWdvcnkpICU+JQ0KICBzdW1tYXJpc2UoYXZnX3BlcmNlbnRhZ2UgPSBtZWFuKFBjdC5MZXZlbC4xLCBuYS5ybSA9IFRSVUUpKQ0KDQojIENyZWF0ZSB0aGUgYmFyIGdyYXBoDQpnZ3Bsb3QoYXZlcmFnZV9zY29yZXMsIGFlcyh4ID0gU3R1ZGVudC5DYXRlZ29yeSwgeSA9IGF2Z19wZXJjZW50YWdlLCBmaWxsID0gU3R1ZGVudC5DYXRlZ29yeSkpICsNCiAgZ2VvbV9iYXIoc3RhdCA9ICJpZGVudGl0eSIsIHBvc2l0aW9uID0gImRvZGdlIikgKw0KICBsYWJzKHRpdGxlID0gIkF2ZXJhZ2UgUGVyY2VudGFnZSBvZiBMZXZlbCAxIGJ5IEV0aG5pY2l0eSIsDQogICAgICAgeCA9ICJFdGhuaWNpdHkiLA0KICAgICAgIHkgPSAiQXZlcmFnZSBQZXJjZW50YWdlIG9mIExldmVsIDEiKSArDQogIHRoZW1lX21pbmltYWwoKQ0KYGBgDQoNCkFub3RoZXIgYmFyIGdyYXBoIGNvbXBhcmluZyBzdHVkZW50cyB3aG8gYXJlIGF0IGFuIGVjb25vbWljIGRpc2F2YW5kdGFnZSBhbmQgdGhvc2Ugd2hvIGFyZSBub3QuIA0KDQpgYGB7cn0NCiMgRmlsdGVyIHRoZSBkYXRhIHRvIGluY2x1ZGUgb25seSByb3dzIHdpdGggIkVjb24gRGlzYWR2IiBvciAiTm90IEVjb24gRGlzYWR2IiBpbiBTdHVkZW50LkNhdGVnb3J5DQpmaWx0ZXJlZF9kZiA8LSBkZiAlPiUNCiAgZmlsdGVyKFN0dWRlbnQuQ2F0ZWdvcnkgJWluJSBjKCJFY29uIERpc2FkdiIsICJOb3QgRWNvbiBEaXNhZHYiKSkNCg0KIyBDYWxjdWxhdGUgdGhlIGF2ZXJhZ2UgcGVyY2VudGFnZSBieSBTdHVkZW50LkNhdGVnb3J5IChFY29uIERpc2Fkdi9Ob24gRWNvbiBEaXNhZHYpDQphdmVyYWdlX3Njb3JlcyA8LSBmaWx0ZXJlZF9kZiAlPiUNCiAgZ3JvdXBfYnkoU3R1ZGVudC5DYXRlZ29yeSkgJT4lDQogIHN1bW1hcmlzZShhdmdfcGVyY2VudGFnZSA9IG1lYW4oUGN0LkxldmVsLjEsIG5hLnJtID0gVFJVRSkpDQoNCiMgQ3JlYXRlIHRoZSBiYXIgZ3JhcGgNCmdncGxvdChhdmVyYWdlX3Njb3JlcywgYWVzKHggPSBTdHVkZW50LkNhdGVnb3J5LCB5ID0gYXZnX3BlcmNlbnRhZ2UsIGZpbGwgPSBTdHVkZW50LkNhdGVnb3J5KSkgKw0KICBnZW9tX2JhcihzdGF0ID0gImlkZW50aXR5IiwgcG9zaXRpb24gPSAiZG9kZ2UiKSArDQogIGxhYnModGl0bGUgPSAiQXZlcmFnZSBQZXJjZW50YWdlIG9mIExldmVsIDEgYnkgRWNvbm9taWMgRGlzYWR2YW50YWdlIiwNCiAgICAgICB4ID0gIkVjb25vbWljIERpc2FkdmFudGFnZSIsDQogICAgICAgeSA9ICJBdmVyYWdlIFBlcmNlbnRhZ2Ugb2YgTGV2ZWwgMSIpICsNCiAgdGhlbWVfbWluaW1hbCgpDQpgYGANCg0KVGhpcyBpcyBteSBmaXJzdCBsaW5lYXIgcmVncmVzc2lvbiBtb2RlbC4gVGhhdCBsb29rcyBhdCB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiBnZW5kZXIgYW5kIHRoZSBwZXJjZW50YWdlIG9mIHN0dWRlbnRzIHdobyBzY29yZWQgaW4gbGV2ZWwgMS4gDQoNCmBgYHtyfQ0KIyBDaGFuZ2UgbXkgZGF0YSB0byBudW1lcmljIA0KZGYgPC0gd2l0aGluKGRmLCB7DQogIGdlbmRlcl9tYWxlIDwtIGFzLm51bWVyaWMoU3R1ZGVudC5DYXRlZ29yeSA9PSAiTWFsZSIpDQogIGdlbmRlcl9mZW1hbGUgPC0gYXMubnVtZXJpYyhTdHVkZW50LkNhdGVnb3J5ID09ICJGZW1hbGUiKQ0KfSkNCg0KIyBGaXQgYSBsaW5lYXIgcmVncmVzc2lvbiBtb2RlbA0KbW9kZWwgPC0gbG0oUGN0LkxldmVsLjEgfiBnZW5kZXJfbWFsZSArIGdlbmRlcl9mZW1hbGUsIGRhdGEgPSBkZikNCg0KIyBQcmludCBhIHN1bW1hcnkgb2YgdGhlIHJlZ3Jlc3Npb24gcmVzdWx0cw0Kc3VtbWFyeShtb2RlbCkNCmBgYA0KDQpUaGlzIGlzIG15IHNlY29uZCBsaW5lYXIgcmVncmVzc2lvbiBtb2RlbC4gVGhhdCBsb29rcyBhdCB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiBldGhuaWNpdHkgYW5kIHRoZSBwZXJjZW50YWdlIG9mIHN0dWRlbnRzIHdobyBzY29yZWQgaW4gbGV2ZWwgMS4gDQoNCmBgYHtyfQ0KIyBDaGFuZ2UgbXkgZGF0YSB0byBudW1lcmljIA0KZGYgPC0gd2l0aGluKGRmLCB7DQogIHJhY2Vfd2hpdGUgPC0gYXMubnVtZXJpYyhTdHVkZW50LkNhdGVnb3J5ID09ICJXaGl0ZSIpDQogIHJhY2VfYmxhY2sgPC0gYXMubnVtZXJpYyhTdHVkZW50LkNhdGVnb3J5ID09ICJCbGFjayIpDQogIHJhY2VfaGlzcGFuaWMgPC0gYXMubnVtZXJpYyhTdHVkZW50LkNhdGVnb3J5ID09ICJIaXNwYW5pYyIpDQp9KQ0KDQojIEZpdCBhIGxpbmVhciByZWdyZXNzaW9uIG1vZGVsDQptb2RlbDIgPC0gbG0oUGN0LkxldmVsLjEgfiByYWNlX3doaXRlICsgcmFjZV9ibGFjayArIHJhY2VfaGlzcGFuaWMsIGRhdGEgPSBkZikNCg0KIyBQcmludCBhIHN1bW1hcnkgb2YgdGhlIHJlZ3Jlc3Npb24gcmVzdWx0cw0Kc3VtbWFyeShtb2RlbDIpDQpgYGANCg0KVGhpcyBpcyBteSBmaW5hbCBsaW5lYXIgcmVncmVzc2lvbiBtb2RlbC4gVGhhdCBsb29rcyBhdCB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiBlY29ub21pYyBkaXNhZHZhbnRhZ2VkIGFuZCB0aGUgcGVyY2VudGFnZSBvZiBzdHVkZW50cyB3aG8gc2NvcmVkIGluIGxldmVsIDEuIA0KDQpgYGB7cn0NCiMgQ2hhbmdlIG15IGRhdGEgdG8gbnVtZXJpYyANCmRmIDwtIHdpdGhpbihkZiwgew0KICBlY29uX2Rpc2FkdiA8LSBhcy5udW1lcmljKFN0dWRlbnQuQ2F0ZWdvcnkgPT0gIkVjb24gRGlzYWR2IikNCn0pDQoNCiMgRml0IGEgbGluZWFyIHJlZ3Jlc3Npb24gbW9kZWwNCm1vZGVsMyA8LSBsbShQY3QuTGV2ZWwuMSB+IGVjb25fZGlzYWR2LCBkYXRhID0gZGYpDQoNCiMgUHJpbnQgYSBzdW1tYXJ5IG9mIHRoZSByZWdyZXNzaW9uIHJlc3VsdHMNCnN1bW1hcnkobW9kZWwzKQ0KYGBgDQo=