We use the data from US Department of Education’s College Scorecard at https://collegescorecard. ed.gov/data/ (November 2019)
I propose the following 10 questions based on my own understanding of the data.
-1. What is the median debt of graduates? -2. What is the standard deviation of instructional spending per student? -3. What is the mean faculty salary? -4. What is Mean of in-state tuition? -5. Bar plot of Colleges by Region. -6. Histogram of Net Price? -7. How many colleges have over 60% female students? -8. What is the most common region colleges area located in? -9. What is the variance in average SAT scored? -10.What is the mean percentage of first-generation students?
We will explore the questions in detail.
college = read.csv("https://www.lock5stat.com/datasets3e/CollegeScores4yr.csv")
head(college)
## Name State ID Main
## 1 Alabama A & M University AL 100654 1
## 2 University of Alabama at Birmingham AL 100663 1
## 3 Amridge University AL 100690 1
## 4 University of Alabama in Huntsville AL 100706 1
## 5 Alabama State University AL 100724 1
## 6 The University of Alabama AL 100751 1
## Accred
## 1 Southern Association of Colleges and Schools Commission on Colleges
## 2 Southern Association of Colleges and Schools Commission on Colleges
## 3 Southern Association of Colleges and Schools Commission on Colleges
## 4 Southern Association of Colleges and Schools Commission on Colleges
## 5 Southern Association of Colleges and Schools Commission on Colleges
## 6 Southern Association of Colleges and Schools Commission on Colleges
## MainDegree HighDegree Control Region Locale Latitude Longitude AdmitRate
## 1 3 4 Public Southeast City 34.78337 -86.56850 0.9027
## 2 3 4 Public Southeast City 33.50570 -86.79935 0.9181
## 3 3 4 Private Southeast City 32.36261 -86.17401 NA
## 4 3 4 Public Southeast City 34.72456 -86.64045 0.8123
## 5 3 4 Public Southeast City 32.36432 -86.29568 0.9787
## 6 3 4 Public Southeast City 33.21187 -87.54598 0.5330
## MidACT AvgSAT Online Enrollment White Black Hispanic Asian Other PartTime
## 1 18 929 0 4824 2.5 90.7 0.9 0.2 5.6 6.6
## 2 25 1195 0 12866 57.8 25.9 3.3 5.9 7.1 25.2
## 3 NA NA 1 322 7.1 14.3 0.6 0.3 77.6 54.4
## 4 28 1322 0 6917 74.2 10.7 4.6 4.0 6.5 15.0
## 5 18 935 0 4189 1.5 93.8 1.0 0.3 3.5 7.7
## 6 28 1278 0 32387 78.5 10.1 4.7 1.2 5.6 7.9
## NetPrice Cost TuitionIn TuitonOut TuitionFTE InstructFTE FacSalary
## 1 15184 22886 9857 18236 9227 7298 6983
## 2 17535 24129 8328 19032 11612 17235 10640
## 3 9649 15080 6900 6900 14738 5265 3866
## 4 19986 22108 10280 21480 8727 9748 9391
## 5 12874 19413 11068 19396 9003 7983 7399
## 6 21973 28836 10780 28100 13574 10894 10016
## FullTimeFac Pell CompRate Debt Female FirstGen MedIncome
## 1 71.3 71.0 23.96 1068 56.4 36.6 23.6
## 2 89.9 35.3 52.92 3755 63.9 34.1 34.5
## 3 100.0 74.2 18.18 109 64.9 51.3 15.0
## 4 64.6 27.7 48.62 1347 47.6 31.0 44.8
## 5 54.2 73.8 27.69 1294 61.3 34.3 22.1
## 6 74.0 18.0 67.87 6430 61.5 22.6 66.7
## [1] 713.5
The median of debt of graduates is $713.5. This tells us the typical student loan burden, the “middle” student. this is of course what they owe after completing a four-year degree.
## [1] 11095.06
The standard deviation of instructional spending is $11,905.06. This shows how spread out the colleges area in how much they spend on teaching per student. A large value would mean spending varies a lot across colleges.
## [1] 7465.778
The mean faculty salary is $7,465.78 which gives the idea of what professors make. This is at four year institutions and can reflect the financial investments of the schools in their teaching staff.
## [1] 21948.55
The mean in-state tuition was $21,948.55. This shows the cost of attending a college in the same state as a resident. This typically helps students with affordability.
This bar plot visualizes how many colleges exist in each U.S. region, helping show the geographic distribution of higher education.And looking at it you can see the northeast is leading in attendance.
This histogram shows the spread of net prices students might expect. It helps highlight whether most colleges are affordable or expensive.
## [1] 59.29588
There are around 59 colleges with an over 60% female population. This result gives the average percentage of female students across all colleges. It’s a useful way to understand gender representation in higher education.
## [1] "Northeast" "Midwest" "Southeast" "West" "Territory"
This shows the most common geographic region among all colleges in the data set, helping us see where most schools are located. You can see the most common region is the northeast.
## [1] 16617.2
The variance in average SAT scores is 16,617 which reflects how much SAT scores differ between colleges. A higher value means there’s more variation in how strong the typical student body is academically.
## [1] 33.55713
The mean percentage of first-generation students is 33.56%.This gives the average percentage of students who are the first in their family to attend college. It gives insight into how inclusive and accessible colleges are for new generations.
In this project, I used basic statistics to explore different parts of the U.S. four-year Colleges. I looked at costs, salaries, admission rates, and student demographics. using numbers like the mean, median, and variance helped me understand the data better. I also used a few visuals to see some patterns of the data. Overall, this helped me learn how colleges can be different from region to region. Looking into colleges is very smart for upcoming students.